0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
نویسندگان :
Marziye Azad
1
Babak Nasersharif
2
1- دانشگاه صنعتی خواجه نصیرالدین طوسی
2- دانشگاه صنعتی خواجه نصیرالدین طوسی
کلمات کلیدی :
HuBERT،Conformer،self-supervised learning،speech emotion recognition
چکیده :
Self-supervised speech representation learning (S3RL) models like wav2vec2.0, Hidden-unit BERT (HuBERT), and WavLM are trained with a great amount of speech data and subsequently give a general purpose speech representation that then needs to be finetuned for different speech processing tasks like ASR. Despite these models’ good performance, they suffer from massive structures and a great number of parameters which makes their finetuning inapplicable for low-resource tasks like speech emotion recognition. In this paper, a small model is proposed for speech emotion recognition based on the Hubert model by transferring the Hubert convolutional feature encoder and substituting all of its transformers with a simple conformer block. Then this simple model is trained with emotional speech signals. The experimental results demonstrate that the proposed model has comparable results with other well-performing S3RL models.
لیست مقالات
لیست مقالات بایگانی شده
Hybrid-Excited, Variable-Flux, and Inter-Modular Biased-Flux Motors: A Comparative Analysis
Mohammad Amirkhani - Ehsan Farmahini Farahani - Alireza Eikani - Mojtaba Mirsalim - Javad Shokrollahi Moghani
سیستم تشخیص فعالیت مبتنی بر مدلسازی تصویری تنک اطلاعات حالت کانال و شبکه عصبی کانولوشنی
علیرضا ابوالقاسمی - سید محمد تقی المدرسی - سید مجتبی آقایی
Design and Simulation of Ultra High power X-band Rotary Joint with a Matching Choke
Mohammad Bod - Seyed mohammad Hashemi
Design and Electromagnetic Analysis of Brushless Salient Pole Switching Flux Synchronous Generator with DC Auxiliary Field Winding for Wind Energy Converter Systems
Seyed Hamed Bibak - Mohammad Hossein Mousavi - Moslem Geravandi
Thermo-optically Adjustment of Stimulated Brillouin Scattering in Integrated Slot Ring Resonators
Mahdi Piri - Bijan Abbasi Arand - Sayyed Reza Mirnaziry
Wideband and Multi-band Frequency Selective Surfaces for Microwave Shielding
Mahmoodreza Marzban - Abbas Alighanbari
Energy-aware Multiple Access Using Deep Reinforcement Learning
Hamid Reza Mazandarani - Siavash Khorsandi
Robust Optimal Hardening for Resilience Enhancement of Power System
Fardin Hasanzad - Hassan Rastegar
A Hybrid Computer-aided Diagnosis System For Central Obesity Screening In A Large Sample Of Iranian Children and Adolescents
Amirhossein Koochekian - Morteza Farahi - Hamid Reza Sadr manouchehri Naeini - Mohammad Reza Mohebian - Hamid Reza Marateb - Marjan Mansourian - Roya Kelishadi
Intrusion Detection System for Securing Agriculture 4.0 against DDoS Attacks using Deep Learning and Machine Learning Models
Mohammad Mirmarghabi - Ahmad Afshar - Hajar Atriyanfar
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2