0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
Analysis of the RCS of Luneburg Reflector in Bistatic Mode
Mohammad Amin Abdollahi - Gholamreza Moradi
تعیین آرایش بهینه خطوط جهت کاهش فرسایش یقه پایه های بتنی ناشی از تنشهای باد
میثم پوراحمدی نخلی - حمیدرضا فیروزآبادی
ادغام حسگرهای رادار، لیدار و دوربین به منظور بهبود عملکرد در تشخیص اهداف برای کاربرد خودروهای خودران
سید مسعود معصومی زاده - محمد سجادی - طاها محقق - منصور نادرپور - صادق شاه سنایی - محمد علی مددی - زهرا کاوه وش - علی فتوت احمدی
A Time-Distributed Convolutional Long Short-Term Memory for Hand Gesture Recognition
Mehdi Fatan Serj - Mersad Asgari - Bahram Lavi - Domenec Puig Valls - Miguel Angel Garcia
Multi-Objective Particle Swarm Optimization Of Spiral Antenna for Microwave Imaging Applications
Mehdi Yousefnia - Jaber Allahgholipor - Ataollah Ebrahimzadeh
Estimation of the Arc Model Parameters Using Heuristic Optimization Methods
Sadegh Ghavami - Ali A Razi-kazemi
Distributed Data Processing for Multi-Agent Systems Via Wave Model
Saeedreza Tofighi - Masoud Shafiee
High Performance and Low Power Spintronic Binarized Neural Network Hardware Accelerator
Milad Tanavardi Nasab - Arefe Amirany - Mohammad Hossein Moaiyeri - Kian Jafari
A new LDO regulator with adaptive PSR improvement under wide load current range and fast load transient response
Mohammad Ahmadi - Emad Ebrahimi
حسگر غیرتهاجمی تشخیص قندخون با استفاده از تکنیک مایکروویو بر مبنای تشدید فرکانسی
نازنین افشاری - سید محمد هاشمی - فاطمه گران قراخیلی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4