0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
Forecasting Tehran Stock Exchange Trend with Time Series Analysis, Fundamental Data, and Sentiment Analysis in News
Mahdi Shamisavi - Amir Jahanshahi
Peer-to-peer Energy Sharing Considering Prosumers' Preferences and Load Uncertainties
Mohammad Bagher Moradi - Mohammad Hasan Nazari - Seyed Hossein Hosseinian - Hamed Nafisi
Batch(offline) Reinforcement Learning for recommender system
Mohammad Amir Rezaei Gazik - Mehdy Roayaei
Explorable Grasp Pose Detection for Two-Finger Robot Handover
AliReza Beigy - Mehdi Tale Masouleh - Ahmad Kalhor
تشخیص و تفکیک برخط خطای مدار باز کلید در اینورترهای تک فاز PWM
مهدی اره پناهی - علی اکبر سلیمی
طراحی و پیاده سازی ژنراتور تولید کننده پالس PFN-Marx فشرده و ماژولار برای تولید پالس 25 کیلوولتی
محمد حسین رنجبر - محمدجواد گل علی پور
Decoding Trait: Using Dual Transformers to Analyze Gender, Age Range and Personality
ُSaeed Asadian - Mostafa Tanasan - Bijan Vosoughi vahdat
The Comparison of MXene and Graphene-Based Antennas for 5G/6G Communications
Javad Shokri Seyyedi - Gholamreza Moradi - Reza Sarraf Shirazi - Sepehr Sahab - Abolfazl Ebrahimpour
Net Load Forecasting of Household Prosumers Considering Deep Reinforcement Learning
Behzad Motallebi Azar - Rasool Kazemzadeh - Morteza Zare Oskouei - Behnam Mohammadi-Ivatloo
Modeling, estimation, and model predictive control for Covid-19 pandemic with finite security duration vaccine
Abolfazl Delavar - Reza Rahimi Baghbadorani
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4