0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
ZnO-based Acoustofluidics: Droplet-based Particle Manipulation
Sara Abbasi - Behdad Barahimi - Sara Darbari - Mohammad Kazem Moravvej-Farshi - Mohammad Zabetian
Optimal Control of Rectangular Singular Systems
Masoud Shafiee
A Low-Power High-Precision Low-Dropout Regulator For Biomedical Implants
Vahid Baghbani khezerlu - Mohammad Yavari - Mortaza Mojarad
A Centralized Adaptive PID Control of Telerehabilitation Systems Using Multi-Agent Systems Theory
Mohammadreza Sheykh - Heidar Ali ُTalebi - Iman Sharifi
Integrated strategy for segment BRATS using co-operation of FCM and TL under abnormal behavior of noises
Arman Zafaranchi - Pedram Salehpoor
کنترل تطبیقی مبتنی بر سطح لغزش کوادراتور با در نظر گرفتن تأخیر در ورودی
الهه سبزیان - مرضیه کمالی - مجدالدین نجفی - مریم ذکری
Computational Insights into the Superior Performance of ψ-Graphene in Li-S Batteries: A DFT Study
Donna Rashidi - Maryam Abbasi - Leila Sadeghbeigy - Matin Bakhtavari - Ebrahim Nadimi
On the Design of Highly Efficient Harmonic Tuned Wideband Class F-1/F Power Amplifier
Mohammad Reza Zeinali - Amir Hossein Aalipour - Hossein Shamsi
TELLM: Advancements in Knowledge Incorporation and Task-specific Enhancements of Large Language Models
Fatemeh Feizi - Amirhossein Hossein Nia - MohammadMahdi Hemmatyar - Fatemeh Rahimi - Farhoud Jafari Kaleibar
Cooperative Coverage Path Planning Using Q-Learning and Sarsa in Two Environments
Alireza Nezamzadeh - Hamed Jalaly Bidgoly - Marzieh Kamali
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0