0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
Towards Non-Invasive Deep Brain Stimulation Using Temporal Interference Method
Mehdi Gholami - Farshid Ghobadzadeh - Fatemeh Yazdanshenas - Amir Yazdani - Mohammad Neshat
همزمان سازی سمبلها در مخابرات مولکولی مبتنی بر انتشار
سمانه منطقی - علی جمشیدی
Thermo-optically Adjustment of Stimulated Brillouin Scattering in Integrated Slot Ring Resonators
Mahdi Piri - Bijan Abbasi Arand - Sayyed Reza Mirnaziry
Outage Analysis of Distributed Relaying NOMA in Cognitive Radio Networks
Zahra Doorbash - Ali Jamshidi
Artificial Intelligence-Based Prediction of Flexibility Requirements in Power Systems
MohammadReza Zarei-Jeliani - Mahmud Fotuhi-Firuzabad - Niloofar Pourghaderi
40Hz Auditory Entrainment Promotes Synchronization Between Frontal and Parietal Regions of the Brain
Mojtaba Lahijanian - Hamid Aghajan
بهبود تخمین موقعیت هواپیما به کمک تشخیص موقعیتهای پرت در دادههای ADS-B ترافیک هوایی
سید سجاد حسینی رستمی - میترا میرزارضایی - بابک نجار اعرابی
Investigation the Effects of Partial discharge Pulse Characteristics on its Propagation in Stator Windings
Arash Abyaz - Mohammad Hamed Samimi - Amir Abbas Shayegani Akmal
تخمین نرختنفس با استفاده از ترکیب ویژگیهای سیگنال فوتوپلتیسموگرافی و مدل FCM-ANFIS
علیرضا باغبانی - سیده فاطمه مولایی زاده
مدیریت برنامهریزی هاب انرژی در مواجه با عدم قطعیتهای شدید قیمت برق و بار مصرفکننده با استفاده از روش تئوری تصمیمگیری بر مبنای شکاف اطلاعاتی
رضا غریبی - بهروز وحیدی
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2