0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
نویسندگان :
Marziye Azad
1
Babak Nasersharif
2
1- دانشگاه صنعتی خواجه نصیرالدین طوسی
2- دانشگاه صنعتی خواجه نصیرالدین طوسی
کلمات کلیدی :
HuBERT،Conformer،self-supervised learning،speech emotion recognition
چکیده :
Self-supervised speech representation learning (S3RL) models like wav2vec2.0, Hidden-unit BERT (HuBERT), and WavLM are trained with a great amount of speech data and subsequently give a general purpose speech representation that then needs to be finetuned for different speech processing tasks like ASR. Despite these models’ good performance, they suffer from massive structures and a great number of parameters which makes their finetuning inapplicable for low-resource tasks like speech emotion recognition. In this paper, a small model is proposed for speech emotion recognition based on the Hubert model by transferring the Hubert convolutional feature encoder and substituting all of its transformers with a simple conformer block. Then this simple model is trained with emotional speech signals. The experimental results demonstrate that the proposed model has comparable results with other well-performing S3RL models.
لیست مقالات
لیست مقالات بایگانی شده
کنترل فرآیند سیستم های حرارتی بر اساس مدل دو بعدیFMM و رویکرد یادگیری تکرارشونده تطبیقی
سهیلا عابدی - طاهره بینازاده
Heterogeneous Coverage Path Planning For Multi- Agent systems with ACO and GA
Mohammad Hasan Jalili Bahabadi - ََAmir Mahdavi - Saeed Khankalantary
ZnO-based Acoustofluidics: Droplet-based Particle Manipulation
Sara Abbasi - Behdad Barahimi - Sara Darbari - Mohammad Kazem Moravvej-Farshi - Mohammad Zabetian
Uneven Illumination Correction in Whole Slide Imaging using Pix2Pix
Sama Nemati - Hasti Shabani
Design and Control of a Novel Multi-port Bidirectional Buck-Boost Converter Suitable for Hybrid Electric Vehicle Charging Stations
Amir Safaeinasab - Homayon Soltani Gohari - Karim Abbaszadeh
A Geometry-based Approach to Reduce the Quantization lobe in 1-bit Reconfigurable Intelligent Surfaces
Nima Ahmadi - Forouhar Farzaneh
Ground-based Power Line Sag Measurement by Combining Data from a Smartphone and a Laser Rangefinder
Mohammad Javad Abdollahifard - Reza Bahrami
Counterintuitive Benefits of Time Window Constraints: Enhancing Cost Efficiency in Vehicle Routing Problems
Mehdi Alimohammadi - Saeedeh Rezaee - Nasser Motahari Farimani - Mohammad Reza Akbarzadeh Totonchi
Privacy-Preserving Learning using Autoencoder-based Structure
Mohammad Ali Jamshidi - Hadi Veisi - Mohammad Mahdi Mojahedian - Mohammad Reza Aref
Novel Wideband Dual-Polarized Base-Station Antenna
Farzad Alizadeh - Changiz Ghobadi - Javad Nourinia - Keyhan Hosseini - Bahman Mohammadi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4