0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
A Novel Image Denoising Algorithm Based on Wavelet and Akamatsu Transforms Using Particle Swarm Optimization
Zeinab Pakdaman - Majid Amini-Valashani - Sattar Mirzakuchaki
Optimal D2D Resource Allocation in Heterogeneous Cellular Networks by Decentralized Multi-Agent Deep Q-Learning
Pouya Akhoundzadeh - Ghasem Mirjalily - Mohammad taghi Sadeghi
طراحی و شبیهسازی یک آرایه انعکاسی پهن باند به کمک روش چرخش قطبش موج بازتابی و سنتز فاز چند فرکانسی روزنه آنتن
مجید کریمی پور - ایمان آریانیان
Zero control effort approach to perturbed coupled orbit-attitude periodic solution at three-body problem: Earth-Mars system
Amirreza Kosari - Ehsan Abbasali - Majid Bakhtiari
Network-based functional connectivity in MDD with suicide ideation before and after TMS: An fMRI case study
Moslem Khafi - Morteza Fattahi - Hamid Soltanian-Zadeh - Reza Rostami
Improving CCA-based methods for SSVEP classification using a common source graph
Nastaran Noori - Sepideh Hajipour Sardouie
Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods
Reihaneh Yourdkhani - Arash Tavoosian - Navid Asadi Khomami - Mehdi Tale Masouleh
Performance Evaluation of a Deep Neural Network Joint Equalizer-Decoder in AWGN-ISI Channels
Zahra Joleini - Ali Jamshidi
Robust Object Detection Against Adversarial Perturbations with Gabor Filter
Mohammad Parsa Karimi - Abdollah Amirkhani - Shahriar B. Shokouhi
Human detection and following by a mobile robot using YOLO structured convolutional neural network
Yasan Majidi - Amir Hossein Hassanabadi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0