0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
T-type L-2L De-Embedding Method for On-Wafer T-model Transmission Line Network
Milad Seyedi - Nasser Masoumi - Samad Sheikhaei
تشخیص ناهنجاری گفتاری با استفاده از مدلسازی جاذبهای صوتی در فضای بازسازی شده فاز
عاطفه کردکاری خسروشاهی - یاسر شکفته
کنترل درایو موتور DC بدون جاروبک سه فاز با اینورتر چهار سوییچه به روش کنترل پیش بین مدل مبتنی بر تعداد حالات کنترلی محدود (FCS-MPC)
ابوالفضل حلوایی نیاسر - سجاد محمدی کوجانی
کنترل تطبیقی بازوی رباتی دو درجه آزادی با استفاده از یادگیری گروهی مبتنیبر الگوریتم اکثریت وزندار شده تصادفی
علی چراغی - امیرحسین جراره - سعید شمقدری
Primary Frequency Support in Clustered Unit Commitment with Battery Energy Storage and High Renewable Penetration
Abbas Abdollahi-Veshvaee - Turaj Amraee
Super twisting sliding mode incorporated with USDE for tracking control of nonlinear robotic systems
Ahmadreza Fallahinezhad - Maryam Malekzadeh - Alireza Ariaei
Small Target Detection Using an Enhanced Optimization Based Filter and Trajectory Tracking Via Pattern Matching Algorithm
Seyedeh Mahsa Zakipour Bahambari - Saeed Khankalantary
A Multi-domain Fuzzy Ensemble Approach for Epileptic Seizure Detection
Samin Shahraki - Alireza Hajabdollah javaheri - Mahdi Pourgholi - Pedram Safarpour
RCS Calculation of a Symmetrical Microstrip Array Using Discrete Bodies of Revolution Method
Hossein Mohammadzadeh - Abolghasem Zeidaabadi Nezhad - Zaker Hossein Firouzeh
Hand Movment Decoding from EEG Signals Using Kalman Filter with Parameters Estimated via Neural Networks and Least Squares Method
Pegah Khoshkavandi - Mohammad B Shamsollahi - Ali Motie Nasrabadi
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2