0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
نویسندگان :
Marziye Azad
1
Babak Nasersharif
2
1- دانشگاه صنعتی خواجه نصیرالدین طوسی
2- دانشگاه صنعتی خواجه نصیرالدین طوسی
کلمات کلیدی :
HuBERT،Conformer،self-supervised learning،speech emotion recognition
چکیده :
Self-supervised speech representation learning (S3RL) models like wav2vec2.0, Hidden-unit BERT (HuBERT), and WavLM are trained with a great amount of speech data and subsequently give a general purpose speech representation that then needs to be finetuned for different speech processing tasks like ASR. Despite these models’ good performance, they suffer from massive structures and a great number of parameters which makes their finetuning inapplicable for low-resource tasks like speech emotion recognition. In this paper, a small model is proposed for speech emotion recognition based on the Hubert model by transferring the Hubert convolutional feature encoder and substituting all of its transformers with a simple conformer block. Then this simple model is trained with emotional speech signals. The experimental results demonstrate that the proposed model has comparable results with other well-performing S3RL models.
لیست مقالات
لیست مقالات بایگانی شده
Simulation of planar organic-inorganic perovskite light-emitting diode
Morteza Yarahmadi - Elnaz Yazdani - Mohammad Kazem Moravvej-Farshi
پیچش زمانی عمیق برای انطباق چندگانه سری های زمانی
سیدعلیرضا نوربخش - نرجس الهدی محمدزاده
A Technical-Managerial Framework for Determining Periodic Performance Indices and Operating Ranges of Power Grid Frequency
Hamed Delkhosh - Hossein Seifi - Sajjad Gholamnejad - Morteza Yousefian
Network-based functional connectivity in MDD with suicide ideation before and after TMS: An fMRI case study
Moslem Khafi - Morteza Fattahi - Hamid Soltanian-Zadeh - Reza Rostami
Kalman Filter Fusion Based on Interactive Multiple Model for Target Tracking in Wireless Sensor Networks
Zahra Zamani - Behrouz Safarinejadian
The Effect of Cavity Length on Two-State Quantum Dot Laser Performance
Gholamreza Babaabasi - Mohammad Mohsen Sheikhey - Sara Alaei
Brain Tumor Segmentation using Multimodal MRI and Convolutional Neural Network
Nazila Loghmani - Roqaie Moqadam - Armin Allahverdy
One-Way Edge Modes Induced by Synthetic Magnetic Field in Time-Varying LC Circuit
Sadeq Bahmani - Amir Nader Askarpour
A compact 5G MIMO antenna with reduced mutual coupling
Marziyeh Amiri - Ali Ghafoorzadeh-yazdi - Abbas-Ali Heidari
The Use of Additive Decomposition and Deep Neural Network for Photovoltaic Power Forecasting
Fariba Dehghan - Mohsen Parsa Moghaddam - Maryam Imani
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3