0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
Ultra-Compact and Fast All-Optical Half-Subtractor Photonic Crystal Logic Gate
Ehsan Veisi - Mahmood Seifouri - Saeed Olyaee
Enhancing Disaster Communication: Multi-UAV Optimization for Efficient Coverage
Amirhossein Solati - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
Evanescent-to-Propagating Wave Conversion Using Continuous High-Order Dielectric Metasurfaces
Hamid Akbari Chelaresi - Pooria Salami - Leila Yousefi
Effect of structural connectivity weightings in graph-based analysis in Schizophrenia
Sara Khamseh - Farzaneh Keyvanfard
Reliability Evaluation of Distribution System Considering a Modified Electric Bus as a Mobile Energy Storage (Tehran E-Bus as a Case study)
Ali Kamali - Amir Soleimani - Seyed Vahid Nourbakhsh - Hassan Nehzati - Vahid Esfahanian - Mahmoud Oukati Sadegh
Addressing Death from Heart Failure Using RACER Algorithm
Mohammad Mirsafaei - Alireza Basiri
An Improved Real-Time Implementation of Adaptive Neuro-fuzzy Controller
Iman Gholizadeh - Haniye Raziyan - Reza Javidan
A High Dynamic Range Differential Rectifier for RF Energy Harvesting
Ataollah Mahsafar - Mohammad Yavari
Design of a Controllable and State-observable MEMS Nonlinear Resonator Based on the Awl-shaped Serpentine Spring
Ehsan Ranjbar - Amirabolfazl Suratgar
A Novel Model for Student's Mental Health Monitoring Based on Hard and Soft Data Fusion
Mohammad Fatahi - Masoud Alizadeh - Behzad Moshiri
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2