0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Vision Transformer and Parallel Convolutional Neural Network for Speech Emotion Recognition
نویسندگان :
Saber Hashemi
1
Mohammad Asgari
2
1- دانشگاه صداوسیما
2- دانشگاه صدا و سیما
کلمات کلیدی :
speech emotion recognition،vision transformer،convolutional neural network،attention mechanism
چکیده :
Vision transformer (ViT) is a new approach for image processing tasks. The vision transformer splits the image into patches and converts it into a sequence of vectors. This sequence is suitable for the transformer structure. This paper uses the ViT method for speech emotion recognition. Unlike ViT, which splits the image into square patches, we use time frames as patches. Alongside using the frame-based ViT to benefit from its ability to learn global features, we are using a convolutional neural network. The convolutional neural network extracts local features and focuses on the two-dimensional structure of the input. Mel-Frequency Cepstral Coefficients extracted from audio files are used as input for the proposed neural network. Using this model in the RAVDESS dataset, we achieved an unweighted accuracy of 79.2%.
لیست مقالات
لیست مقالات بایگانی شده
Cloudy: A Pythonic Cloud Simulator
Ahmad Siavashi - Mahmoud Momtazpour
Emotion Recognition from EEG Signals During REM Sleep
Asghar Zarei - Ali Mahmoudi
Significant Methods to Improve Control of Quadrotors, Hexarotors and Octorotors
Peyman Amiri - Nima Sina - Mohammad Danesh
Design and fabrication of a microstrip phase shifter based on liquid crystal
Sadegh Rajabi Doulataabadi - Seyed Hossein Hosseini Biuki - Farid Khoshkhati - Seyed Abbas Jazayeri Moghadas - Mohammad Masoudi Mohammadi - Mehdi Ahmadi-Boroujeni
Weak GPS Signal Acquisition Based on Wavelet Transform Denoising and Deep Learning Method
Navid Moradi - Mohsen Nezhadshahbodaghi - Mohammad-Reza Mosavi
Design and Implementation of CAN Bus Monitoring Module for Lithium Battery Management System
Shakila Kazempourdizaji - Amir Mohammad Moazami Goudarzi - Majid Shalchian
A Novel method for power transmission lines Protection Against the Sub-Synchronous Resonance Using thyristor-based reactive power compensation
Mohammadreza Mousavi Khademi - Mehdi Zareian Jahromi
Wideband and Multi-band Frequency Selective Surfaces for Microwave Shielding
Mahmoodreza Marzban - Abbas Alighanbari
Bilabial Consonants Recognition in CV Persian Syllable Based on Computer Vision
Melika Khajeh - Azam Bastanfard - Dariush Amirkhani
طراحی و پیاده سازی ژنراتور تولید کننده پالس PFN-Marx فشرده و ماژولار برای تولید پالس 25 کیلوولتی
محمد حسین رنجبر - محمدجواد گل علی پور
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0