0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A Deep Learning-Based Model for House Number Detection And Recognition
نویسندگان :
Roghaiyeh Tayefeh Younesi
1
Jafar Tanha
2
Samaneh Namvar
3
Sahar Hassanzadeh Mostafaei
4
1- دانشگاه تبریز
2- دانشگاه تبریز
3- دانشگاه تبریز
4- دانشگاه تبریز
کلمات کلیدی :
Image Recognition،Image Detection،Data Augmentation،CNN،LSTM،Multi Digit
چکیده :
Abstract— Detection and recognition of information from natural images pose significant challenges in computer vision, with far-reaching implications for future applications. In recent years, the application of deep learning techniques to real-world image datasets has yielded notable achievements in the realms of recognition, detection, and pattern recognition. In this paper, we specifically tackle the challenge of number detection and recognition in real-world scenes by proposing deep learning models on the Street View House Numbers (SVHN) dataset. In the proposed models, to boost accuracy, we applied preprocessing steps to the training dataset. These steps included data augmentation techniques such as resizing, random rotation, random horizontal flip, angle degree changes, and optimization of hyperparameters and model layers. In the initial model, we utilized a fully connected Convolutional Neural Network (CNN) model on sequences of digit images, achieving an impressive accuracy of 95 percent. Subsequently, a Convolutional-Long Short-Term Memory (CNN-LSTM) model was designed for temporal information modeling, utilizing a combination of CNN and LSTM layers that also achieved an accuracy of 93 percent. These models demonstrate high performance in recognizing numbers in complex and real-world environments. Our results underscore the significant enhancement in the accuracy of number recognition in real-world images achieved on the SVHN dataset by combining CNN models with data augmentation. We also compare the results of our proposed models with other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
A Dual-Band LPDA Antenna Based on MXene for High-Band 5G Application
Javad Shokri seyyedi - Reza Sarraf Shirazi - Gholamreza Moradi
A Methodology for the Design of a Single-Sided Axial-Flux Permanent Magnet (AFPM) Motor for Lathe Machines
Ali Jamali-Fard - Mojtaba Mirsalim
An Enhanced Chaotic System Based Color Image Encryption using DNA Encoding
Mobin Vaziri - Mohammad Mehdi Rahimifar - Hadi Jahanirad
طراحی روش مبتنی بر آنالیز پوش داده برای ارزیابی عملکرد ایستگاه های فوق توزیع و تعیین سطح مطلوب قابلیت اطمینان سیستم توزیع انرژی الکتریکی
محمد رستگار - زهرا یزدانپناه - محمد جوشکی
Hybrid PI-SOSM Controller for Battery and Supercapacitor Integration in Electric Vehicles
Maede Azimi - Ghasem Rezazadeh - Mohsen Hamzeh
Automotive radar target classification using micro-Doppler features
Amin Aghatabarroodbary - Mohammad Hassan Bastani - Fereidoon Behnia
Model Predictive Control for a 3-DoF Suspended Cable Robot Based on Laguerre Functions
Shiva Khoshkam - Mohammad A. Khosravi - Rasul FesharakiFard
A 5kW Bidirectional Isolated On-Board EV Battery Charger Using Hybrid PFC/Inverter
Amir Hossein Dabbagh - Hamed Arvani - Ebrahim Afjei
Improved Light-Load Efficiency of Resonant Switched-Capacitor Converter
Sajad AfsharZarandi - Reza Beiranvand
Introducing a New Phase Realization Technique for Implementation of Broadband Reflectarray Antenna
Mahmood Rafaei-booket - Seyed Mostafa Mousavi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4