0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
Virtual power plant participation in day-ahead and futures markets with a deep learning approach
Farzin Ghasemi Olanlari - Mohammad Fazel Dehghanniri - Turaj Amraee
طراحی تزویجگر پهن باند سه استابی فشرده میکرواستریپ برای استفاده در ترکیب کننده توان
صادق حیدری کاهکش - اکرم شیخی
مقایسه پارامترهای عملکردی کمپرسورهای 4:2 در تکنولوژی FinFET و GAA-NWFET
پگاه زکیان - راهبه نیارکی اصلی
Improving CycleGAN-VC2 Voice Conversion by Learning MCD-Based Evaluation and Optimization
Majid Behdad - Davood Gharavian
External Force Control with Disturbance Rejection for 6 DoF Manipulator
Zahra Bonakdar - Arefe Hamidipour - Hamed Ghafarirad
Numerical investigation of gain switching in Fano semiconductor lasers
Arash Hodaie - Hassan Kaatuzian - Aref Rasoulzadeh Zali
SGG-Net: Skeleton and Graph-Based Neural Network Approaches for Grasping Objects
AliReza Beigy - Farbod Azimmohseni - Ali Sabzejou - Mehdi Tale Masouleh - Ahmad Kalhor
Photonic Crystal-based Plasmonic Biosensor with Low-cost and High-sensitivity Properties
Mahdieh Ahmadi Motlagh - Mahdieh Bozorgi - Mahmood Rafaei-Booket
Formation of Singular Multi-Agent Systems via a New Iterative Learning Control Approach
Ali Raddanipour - Masoud Shafiee
Proposed Small Signal Dynamic Model for a Grid-Connected Battery Storage System
Zahra Moradi- Shahrbabak
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3