0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Human Action Recognition in Still Images Using ConViT
نویسندگان :
Seyed Rohollah Hosseyni
1
Sanaz Seyedin
2
Hassan Taheri
3
1- Amirkabir University of Technology
2- Amirkabir University of Technology
3- Amirkabir University of Technology
کلمات کلیدی :
Human action recognition،Still images،Convolutional Neural Network،Vision Transformer
چکیده :
Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.
لیست مقالات
لیست مقالات بایگانی شده
A Combined Channel Approach for Decoding Intracranial EEG Signals: Enhancing Accuracy through Spatial Information Integration
Maryam Ostadsharif Memar - Navid Ziaei - Behzad Nazari
Efficient and Fast Analysis of SIW Microwave Devices Using the Multiple Multipole Technique
Ahmad Bakhtafrouz - Mohammad Moemenian - Mohsen Maddahali - Mohsen Karimian Kakolaki
طراحی سیستم هوشمند تشخیص سطح مذاب قالب در ماشین ریختهگری مداوم
محمد رضا رشیدی - سید محمد تقی المدرسی - سعیده ذبحی
Multi-Machine Traction Drive Based on Parallel Connected Synchronous Machines
Hassan Mohammadi Pirouz
Low-power and Low-Phase Noise Gm-Boosted Differential CMOS LC Voltage Controlled Oscillator using Genetic Algorithm
Mohammad Jafar Hemmati - Sepehr Ebrahimi Mood
A Novel Image Denoising Algorithm Based on Wavelet and Akamatsu Transforms Using Particle Swarm Optimization
Zeinab Pakdaman - Majid Amini-Valashani - Sattar Mirzakuchaki
Giant Optical Nonreciprocity with Magnetized Epsilon-Near-Zero Materials
Zahra Chamani - Abolghasem Zeidaabadi Nezhad - Mahyar Dehdast - Zaker Hossein Firouzeh
Fragmentation-aware Coordinated Virtual Optical Network Embedding Algorithm Over Elastic Optical Networks
Niusha Sabri Kadijani - Lotfollah Beygi
An Improved Nonlinear Observer-Based Integrated Guidance and Control for Hypersonic Flight Vehicle with Angle Constraints
Seyedeh Mahsa Zakipour Bahambari - Saeed Khankalantary
Improving CycleGAN-VC2 Voice Conversion by Learning MCD-Based Evaluation and Optimization
Majid Behdad - Davood Gharavian
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2