0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Design and Implementation of a Flexible CNN Accelerator for Fast Real-Time Object Detection on FPGA
نویسندگان :
Emadodin Sakhaee
1
Mahdi Kalbasi
2
1- دانشگاه اصفهان
2- دانشگاه اصفهان
کلمات کلیدی :
Energy efficiency،real-time image processing،convolutional neural networks،hardware accelerator
چکیده :
In edge computing systems with limited resources, such as mobile devices and the Internet of Things, the use of Convolutional Neural Network (CNN) accelerators on FPGA has increasingly expanded. Ultrascale ZYNQ FPGAs offer scalability and flexibility for implementing deep learning-based object detection applications. However, this technology has low performance and limitations in achieving real-time processing. This paper addresses the optimization of the accelerator at the Register Transfer Level (RTL) to increase processing speed using low-power techniques in FPGA implementation. Therefore, a configurable accelerator design for a CNN-based object detection system at the RTL level on FPGA is proposed. We also present RTL optimization techniques that include techniques for disabling unnecessary clock cycles to reduce energy consumption and the use of the Posit number system format to increase calculation accuracy. The proposed system was tested with ResNet-20 and trained with the CIFAR-10 dataset. The weight data used for this test was provided by Tensil. Experimental results show that the proposed design process improves energy consumption, hardware utilization, and computational accuracy by 11%, up to 25%, and 4%, respectively.
لیست مقالات
لیست مقالات بایگانی شده
ارائه روش بهینه سازی نوین جهت جایابی بهینه تولیدات پراکنده (DG) در شبکه توزیع بمنظور کمینه کردن اثر فروافتادگی ولتاژ
پژمان هاشمیان - عبدالرضا علیرضاپوری
حل مسئله مجموعه مستقل d-فاصله با رویکرد CombOpt Zero
فاطمه نیکبخت نصرآبادی - حسین فلسفین - مهران صفایانی
Family of Soft-Switched Single-Switch Switched-Resonator Converters with Low Component Count
Maryam Hajilou - Siamak Khalili - Hosein Farzanehfard
Insulation System Optimization in Dry-Type Transformer Using Finite Element Method
Shohreh Saberi - Mehdi Bigdeli - Davood Azizian
Design and Practical Implementation of Internal Model Controller for Temperature Regulation of Thermoelectric Cell
Parastoo Kamali - Sanaz Iman Shayan - Mahshid Mousapour - Fatemeh Abdolsamadi - Salar Zeinali - Sadra Rafatnia
DRAU-Net: Double Residual Attention Mechanism for automatic MRI brain tumor segmentation
Mohammad Soltani gol - Morteza Fattahi - Hamid Soltanian zadeh - Samd Sheikhaei
امکانسنجی اقتصادی استقرار شبکههای مخابرات صنعتی در شرکت توزیع نیروی برق شهرستان مشهد (با تاکید بر نقش هوشمندسازی شبکه و بکارگیری انرژیهای سبز)
مهدی فیل سرائی - مهدی اسماعیلی پور - علیرضا باوندپور
A Bi-Level Attack-Defense Model for the Forecasting False Data Injection Attacks on the Integrated Energy Systems
Maryam Azimi - Hamed Delkhosh - Mahdi Ghaedi
Low power SRAM using an optimal number of split bit lines and single-ended sensing
Mahdie Nazemian - Sayed Masoud Sayedi
A Two Stage Low Power 0.73-4.4 GHz LNA Using Current Reuse and Noise Reduction Techniques
Sajjad Shojaei Baghini - Seyed-Ali Samareh-TaheriNasab - Samad Sheikhaei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0