0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Vision Transformer and Parallel Convolutional Neural Network for Speech Emotion Recognition
نویسندگان :
Saber Hashemi
1
Mohammad Asgari
2
1- دانشگاه صداوسیما
2- دانشگاه صدا و سیما
کلمات کلیدی :
speech emotion recognition،vision transformer،convolutional neural network،attention mechanism
چکیده :
Vision transformer (ViT) is a new approach for image processing tasks. The vision transformer splits the image into patches and converts it into a sequence of vectors. This sequence is suitable for the transformer structure. This paper uses the ViT method for speech emotion recognition. Unlike ViT, which splits the image into square patches, we use time frames as patches. Alongside using the frame-based ViT to benefit from its ability to learn global features, we are using a convolutional neural network. The convolutional neural network extracts local features and focuses on the two-dimensional structure of the input. Mel-Frequency Cepstral Coefficients extracted from audio files are used as input for the proposed neural network. Using this model in the RAVDESS dataset, we achieved an unweighted accuracy of 79.2%.
لیست مقالات
لیست مقالات بایگانی شده
Leader-Following H_∞ Fault-Tolerant Consensus of Nonlinear Multi-agent Systems with External Disturbances
Maryam Salimifard - Heidar Ali Talebi
Adaptive dynamic programming for kinematic control of 3 interconnected wheeled mobile robots
Aliakbar Ghasemzadeh - Roya Amjadifard - Ali Keymasi Khalaji
A Mathematical 3D Solution to Efficiently Locate Drones in 5G Wireless Networks
Mina Taghavi - Jamshid Abouei
Enhanced Forward Model for Photoacoustic Imaging with Speed of Sound Compensation
Amirreza Jodeiry - Zahra Kavehvash
High Performance and Low Power Spintronic Binarized Neural Network Hardware Accelerator
Milad Tanavardi Nasab - Arefe Amirany - Mohammad Hossein Moaiyeri - Kian Jafari
طراحی ایستگاه شارژ سریع با در نظر گرفتن عدم قطعیت منابع تجدیدپذیر و مدیریت ریسک
محمد بزرگپور رودباری - میثم جعفری نوکندی - محمد هاشمی مصیر
Multiphysics Simulation of the Modified Flux Coupling Type SFCL in VSC-HVDC Network
Mohammad Khakroei - Ashkan Mirzaei Rajeooni - Mahdi Rahimi Pirbasti - Hossein Heydari
The Effect of Cavity Length on Two-State Quantum Dot Laser Performance
Gholamreza Babaabasi - Mohammad Mohsen Sheikhey - Sara Alaei
A Real Time MPC-Based Strategy for PV Plant with Battery Energy Storage
Mohammad Amini - Sajad Esmaeili - Mohammad Sayadlou - Amir Khorsandi - Seyed Hossein Hosseinian
Enhancing Kriging with Inductive Spatio-Temporal GraphODE
Amin Sheykhzadeh - Behzad Moshiri - Ebrahim Ghafar-Zadeh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0