0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
طراحی خودرمزگذار متغیر جهت تشخیص عیب در بیرینگهای غلتشی
مریم آهنگ - مهدی علیاری شورهدلی
A New Approach to Determine Maximum Allowable Penetration level of LSPVPPs Considering Transient Angle Stability
Siavash Yari - Hamid Khoshkhoo
Design and Demonstration of a Novel Microfluidic Channel for Trapping Circulating Tumor Cells with Magnetophoresis
Atin Bakhshi - Seyed Ehsan Hosseininasab - Vahid Ghafouri - Mehdi Rahmanian - Majid Badiei Rostami
ارائه روشی جهت بهبود عملکرد شبکههای بیسیم حسگر ناهمگون مبتنی بر برداشت انرژی
محمد فرشته حکمت - علیرضا کشاورز حداد
Fast and Low Power Modified Carry Look-Ahead Adder
Sanaz Salem - Amir hossein Owji
Holographic Technique Inspired Multi-Beam Cylindrical Leaky-Wave Antenna
Mohammad Amin Chaychi Zadeh - Nader Komjani - Sajjad Zohrevand
Design and Implementation of an RF Module for UHF PD Measurement
Vahid Javandel - Asghar Akbari - Mohammad Ardebili - Peter Werle
FPGA-Based Multiplier with a New Approximate Full Adder for Error-Resilient Applications
Ali Ranjbar - Elham Esmaeili - Roghayeh Rafieisangari - Nabiollah Shiri
تشخیص و تفکیک برخط خطای مدار باز کلید در اینورترهای تک فاز PWM
مهدی اره پناهی - علی اکبر سلیمی
ارائه یک روش دو مرحلهای مبتنی بر حسگری فشرده برای تخمین زاویه ورود در آرایه
مهدی محمدی پرستو - محمود مدرس هاشمی
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2