0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A Deep Learning-Based Model for House Number Detection And Recognition
نویسندگان :
Roghaiyeh Tayefeh Younesi
1
Jafar Tanha
2
Samaneh Namvar
3
Sahar Hassanzadeh Mostafaei
4
1- دانشگاه تبریز
2- دانشگاه تبریز
3- دانشگاه تبریز
4- دانشگاه تبریز
کلمات کلیدی :
Image Recognition،Image Detection،Data Augmentation،CNN،LSTM،Multi Digit
چکیده :
Abstract— Detection and recognition of information from natural images pose significant challenges in computer vision, with far-reaching implications for future applications. In recent years, the application of deep learning techniques to real-world image datasets has yielded notable achievements in the realms of recognition, detection, and pattern recognition. In this paper, we specifically tackle the challenge of number detection and recognition in real-world scenes by proposing deep learning models on the Street View House Numbers (SVHN) dataset. In the proposed models, to boost accuracy, we applied preprocessing steps to the training dataset. These steps included data augmentation techniques such as resizing, random rotation, random horizontal flip, angle degree changes, and optimization of hyperparameters and model layers. In the initial model, we utilized a fully connected Convolutional Neural Network (CNN) model on sequences of digit images, achieving an impressive accuracy of 95 percent. Subsequently, a Convolutional-Long Short-Term Memory (CNN-LSTM) model was designed for temporal information modeling, utilizing a combination of CNN and LSTM layers that also achieved an accuracy of 93 percent. These models demonstrate high performance in recognizing numbers in complex and real-world environments. Our results underscore the significant enhancement in the accuracy of number recognition in real-world images achieved on the SVHN dataset by combining CNN models with data augmentation. We also compare the results of our proposed models with other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
Clustering of Fuzzy Data Based on Particle Swarm Optimization
Najme Ghanbari - Seyed-hamid Zahiri - Hadi Shahraki
بررسی ارتباط الگوی خریدوفروش کاربران ارز دیجیتال و حرکات قیمت بازار رمزارز
مهسا علیزاده نیلی - عبدالحسین وهابی - محمدرضا ابوالقاسمی
A New Model of Interleaved Boost CF-CLLC Integrated Resonant Converter with Fixed-Frequency PWM Control for Renewable Energy Applications in Fuel Cell and Battery-Powered Electric Vehicles
Mina Taheri - Hossein Askariyan Abyane
کنترل تطبیقی بازوی رباتی دو درجه آزادی با استفاده از یادگیری گروهی مبتنیبر الگوریتم اکثریت وزندار شده تصادفی
علی چراغی - امیرحسین جراره - سعید شمقدری
A New Low Noise 4-Gb/s Serial CMOS MPPM Modulator
Erfan Alasvand Andekah - Noushin Ghaderi - Mostafa Pour Sayahi
Design an Intelligent Fault Detection System for Spring-Drive Operating Mechanism of SF6 High Voltage Circuit Breaker Using ADAMS
Milad Tahvilzadeh - Mehdi Aliyari Shooredeli - Ali asghar Razi Kazemi
Multi-Objective Concurrent Kernel Scheduling for Multi-GPU Systems
Negar Baradar Alizadeh - Mahmoud Momtazpour
بهره برداری از ESS ها در بخش DC ترانسفوماتور حالت جامد به منظور بهبود کیفیت توان شبکه برق
یوسف عطائی - رضا قندهاری - مهدی بابائی - بهنام بهارلوئی
Multi-physics electromagnetic-mechanical analysis of a high-speed switched reluctance motor for vacuum cleaner application
Nasrin Majlesi - Morteza Saghaian-Nejad - Amir Rashidi
VGG16-based Feature Fusion For Image Kyepoint Description
Javid Norouzi - Alireza Liaghat - Mohammad Sadegh Helfroush - Habibollah Danyali
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4