0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Vision Transformer and Parallel Convolutional Neural Network for Speech Emotion Recognition
نویسندگان :
Saber Hashemi
1
Mohammad Asgari
2
1- دانشگاه صداوسیما
2- دانشگاه صدا و سیما
کلمات کلیدی :
speech emotion recognition،vision transformer،convolutional neural network،attention mechanism
چکیده :
Vision transformer (ViT) is a new approach for image processing tasks. The vision transformer splits the image into patches and converts it into a sequence of vectors. This sequence is suitable for the transformer structure. This paper uses the ViT method for speech emotion recognition. Unlike ViT, which splits the image into square patches, we use time frames as patches. Alongside using the frame-based ViT to benefit from its ability to learn global features, we are using a convolutional neural network. The convolutional neural network extracts local features and focuses on the two-dimensional structure of the input. Mel-Frequency Cepstral Coefficients extracted from audio files are used as input for the proposed neural network. Using this model in the RAVDESS dataset, we achieved an unweighted accuracy of 79.2%.
لیست مقالات
لیست مقالات بایگانی شده
Design, Simulation and Analysis of a MIM Plasmonic Sensor Based on the Cross-Shaped Resonator
Setare Farzane - Hassan Kaatuzian - Leila Hajshahvaladi
Design of an Optical Current Transformer for High-Voltage Gas-Insulated Switchgear-Part II: Focus on GIS Compartment Design
Reza Babaei - Asghar Akbari - Arash Moradi
Integrated expansion planning of the distribution network and distributed generations considering energy storage systems, electric vehicles charging stations, and daily load modeling
Ahmad Mohammadi Pour - Mehrdad Setayesh Nazar
Accurate Methods for Automatic Detection of Characteristic Points in Electrocardiograms
Seyedeh Mersedeh Bagheri - Mohammad Pooyan
Control of optical bistability in one-dimensional photonic crystals with a central layer doped with Landa-type three-level atoms using atomic and laser parameters
Akbar Ashrafabadi - Siamak Khademi - Ghasem Naeimi
Numerical Approach on Modeling of Perovskite Solar Cells Based on Coupled Ion Vacancy and Charge Carrier Dynamics
Hamed Abnavi - Daniyal Khosh Maram - Hanieh Talati Aghdam
ساخت حسگر مقاومتی گاز سولفید هیدروژن با استفاده از ترکیب نانوذرات اکسید تیتانیوم و گرافن اکسید کاهش یافته
محمد دیانتی - سمانه حامدی
Two-Stage Stochastic Modeling for Energymnagement and Control of Virtual Power Plants: Addressing Renewable Energy Challenges
Mohammadreza Mousavi Khademi - Mehdi Zareian Jahromi
Energy Efficiency Evaluation of a Line-Start Permanent Magnet Assisted Synchronous Reluctance Motor for Pump Application
Ali Jamali-Fard - Mojtaba Mirsalim
طراحی کنترل کننده مقاوم برای مدل غیرخطی بیماری کووید-19
آرمان مرزبان - الهام امینی بروجنی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0