0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
Optimization of Fifth Order Band-Pass Ladder Filter and Statistical Analysis of Reverse Problem
Sayyed Ali Alizadeh - Mahmoud Kamarei
Semi-supervised Deep Reinforcement Learning in Decentralized Multi-Agent Collision Avoidance and Path Planning in a Complex Environment
Marzie Parooei - Mehdi Tale Masouleh - Ahmad Kalhor
Low Complexity Single-Snapshot DOA Estimation Using Adaptive Filtering
Mojtaba Amiri - Mohammadreza Nargesi - Ali Olfat
Design and Manufacturing of a Programmable Spin Coater Based on a Brushless DC Motor
MirBehrad Mousavi - Saeed Javadizadeh - Seyed Ahmadreza Firoozabadi - Majid Badieirostami
Counterintuitive Benefits of Time Window Constraints: Enhancing Cost Efficiency in Vehicle Routing Problems
Mehdi Alimohammadi - Saeedeh Rezaee - Nasser Motahari Farimani - Mohammad Reza Akbarzadeh Totonchi
Improving Wind Turbines Blades Damage detection by using YOLO BoF and BoS
Reza Mohammadi - Saeed Sharifian
3D Modeling of a Superconducting Transition Edge Detector
Samaneh Ansari - Rana Nazifi - Mehdi Yaghoubi Arzefouni - Roya Mohajeri - Seyed Iman Mirzaei - Mehdi Fardmanesh
On the selection of superspreaders for advertising in science education using a new similarity measure
Sanaz Afsharian - Mohsen Heidari - Heidar Nosratzadeh - Mojgan Khalifeh
True Random Number Generator Relying on Multiple Entropy Source and Triple Oscillator for Cryptography Purposes
Somayeh Gholam Mehraban - Mohsen Jalali - Mostafa Azadbakht
Design and fabrication of a microstrip phase shifter based on liquid crystal
Sadegh Rajabi Doulataabadi - Seyed Hossein Hosseini Biuki - Farid Khoshkhati - Seyed Abbas Jazayeri Moghadas - Mohammad Masoudi Mohammadi - Mehdi Ahmadi-Boroujeni
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0