0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
A Novel Application of the Travel Profile for the Electrical Bus in Electric Systems for Transportation
Mohsen Tamaddon - Mohsen Davoodi
Compact Multiband HMSIW Antenna Loaded with Complementary Split Ring Resonators
Rasol Zayer - Mohamamd Naghi Azarmanesh - Javad Nourinia - Changiz Ghobadi - Farzad Alizadeh - Bahman Mohammadi
A Hybrid Computer-aided Diagnosis System For Central Obesity Screening In A Large Sample Of Iranian Children and Adolescents
Amirhossein Koochekian - Morteza Farahi - Hamid Reza Sadr manouchehri Naeini - Mohammad Reza Mohebian - Hamid Reza Marateb - Marjan Mansourian - Roya Kelishadi
Low Complexity Single-Snapshot DOA Estimation Using Adaptive Filtering
Mojtaba Amiri - Mohammadreza Nargesi - Ali Olfat
A Subsurface Microwave Imaging System Based on the Combination of Sub-Band-Subspace Images
Mohammad Ramezaninia - Mohammad Zoofaghari - Abolfazl Gheibollahi - Abbas Ali Heidari
Design and fabrication tip tapered fiber optic dopamine sensor based on LSPR
Roksana Esmaeilpour - Mohammad Ismail zibaii - Masoumeh Barkand - Marzieh Pajouhandeh - Soroush Rostami - Mehdi Banihashemi - Mohammad-Mahdi Babakhani-fard
Modulation Classification with Convolutional Neural Network based Deep Learning in Elastic Optical Network
Ehsan Varasteh - Seyed Sadra Kashef - Morteza Valizadeh - Mehdi Ranjbar Zefreh
Improved Low Voltage Ride Through by A STATCOM Based on Neutral Point Piloted (NPP) Multilevel Inverter
Yousef Neyshabouri - Mohammad Farhadi-kangarlu
Enhanced Optimal Droop Control for Effective Load Sharing in an Islanded Microgrid
Rafi Zahedi - Hassan Rastegar
A Two Stage Low Power 0.73-4.4 GHz LNA Using Current Reuse and Noise Reduction Techniques
Sajjad Shojaei Baghini - Seyed-Ali Samareh-TaheriNasab - Samad Sheikhaei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0