0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
Sliding Mode Control for a Platoon of vehicular with DoS attacks and Obstacles
Tara Rajabi Nezhad Siahpoosh - Hanie Marufkhani - Mohammad A. Khosravi
Fragmentation-aware Coordinated Virtual Optical Network Embedding Algorithm Over Elastic Optical Networks
Niusha Sabri Kadijani - Lotfollah Beygi
Family of Soft-Switched Single-Switch Switched-Resonator Converters with Low Component Count
Maryam Hajilou - Siamak Khalili - Hosein Farzanehfard
A New Protocol to Improve Effect of repetitive Transcranial Magnetic Stimulation in Treatment of Alzheimer's Disease
Ali Abedi - Gholamreza Moradi - Reza Sarraf Shirazi - Mehran Jahed
Estimation of the Arc Model Parameters Using Heuristic Optimization Methods
Sadegh Ghavami - Ali A Razi-kazemi
Robust H∞ Control Design for Variable-Speed Wind Turbines Using Bilinear Matrix Inequalities
Hamidreza Javanmardi - Alireza Hamedi - Mahya Rahimzadeh
طراحی بهینه چند هدفی کنترل کننده مدلغزشی مرتبه کسری برای سیستم کوادروتور
ابوالفضل انصاریان - جواد عسکری - مرضیه کمالی - محمدجواد محمودآبادی
حسگر ضریب شکست مبتنی بر فانو رزونانس در موجبرهای فلز- عایق- فلز، با رزوناتور صفحهای تزویج شده از جانب
تورج هاشمی - نسرین عبدالهی برازجان - عباس علی قنبری
Stability Improvement in Weak Grid-Tied DFIG-based WECS Employing Adaptive Virtual Impedance Strategy Based on Machine Learning Considering the LVRT Constraint
Mohammad Hossein Shaabani - Behrooz Vahidi - Navid Dehghan
Enhanced Optimal Droop Control for Effective Load Sharing in an Islanded Microgrid
Rafi Zahedi - Hassan Rastegar
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0