0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
نویسندگان :
Marziye Azad
1
Babak Nasersharif
2
1- دانشگاه صنعتی خواجه نصیرالدین طوسی
2- دانشگاه صنعتی خواجه نصیرالدین طوسی
کلمات کلیدی :
HuBERT،Conformer،self-supervised learning،speech emotion recognition
چکیده :
Self-supervised speech representation learning (S3RL) models like wav2vec2.0, Hidden-unit BERT (HuBERT), and WavLM are trained with a great amount of speech data and subsequently give a general purpose speech representation that then needs to be finetuned for different speech processing tasks like ASR. Despite these models’ good performance, they suffer from massive structures and a great number of parameters which makes their finetuning inapplicable for low-resource tasks like speech emotion recognition. In this paper, a small model is proposed for speech emotion recognition based on the Hubert model by transferring the Hubert convolutional feature encoder and substituting all of its transformers with a simple conformer block. Then this simple model is trained with emotional speech signals. The experimental results demonstrate that the proposed model has comparable results with other well-performing S3RL models.
لیست مقالات
لیست مقالات بایگانی شده
Joint Fairness, Fragmentation, and Physical Layer Impairments Aware Routing, Spectrum and Modulation Level Allocation in Elastic Optical Networks
Hassan Khanahmadzadeh - Arash Rezaee - Lotfollah Beygi
Design of a High-Efficiency RF Energy Harvesting System
Saeed Abbasi FallahPour - Shokrollah Karimian - ٍEsfandiar Mehrshahi
Active and Passive Beamforming for Secure Wireless Communication via Star-RIS under imperfect CSI
Seyedeh Reyhane Shahcheragh - Kamal Mohamed-pour
جداسازی عروق در تصاویر شبکیه چشم با استفاده از یک روش آستانه گذاری وفقی مبتنی بر اطلاعات محلی و سرتاسری
زهرا نورانی آتشگاه - محمد آراسته - آیدا فولادی وندا
An Enhanced SLAM Method Using ICP Algorithm for Autonomous Mobile Robots Navigation
Hasan Enami Eraghi - Mohammad Reza Taban - Sayed Farzad Bahreinian - Mohammad Reza Jabbari
Computational Insights into the Superior Performance of ψ-Graphene in Li-S Batteries: A DFT Study
Donna Rashidi - Maryam Abbasi - Leila Sadeghbeigy - Matin Bakhtavari - Ebrahim Nadimi
Breast tumor detection using graphene-based terahertz patch antenna
Zahra Yasaghi - Ayaz Ghorbani - Gholamreza Moradi
Forecasting Tehran Stock Exchange Trend with Time Series Analysis, Fundamental Data, and Sentiment Analysis in News
Mahdi Shamisavi - Amir Jahanshahi
Propagation of Measurement Errors in the Euler Kinematic Equations
Mojtaba Fazelinia - Saeed Ebadollahi - Soheil Ganjefar
Temperature-Sensitive Tunable Nanoantenna Based on Phase Change Material (Ge2Sb2Te5) Substrate
Daniyal Khosh Maram - Seyed Asad Amirhosseini
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.3.1