0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
نویسندگان :
Marziye Azad
1
Babak Nasersharif
2
1- دانشگاه صنعتی خواجه نصیرالدین طوسی
2- دانشگاه صنعتی خواجه نصیرالدین طوسی
کلمات کلیدی :
HuBERT،Conformer،self-supervised learning،speech emotion recognition
چکیده :
Self-supervised speech representation learning (S3RL) models like wav2vec2.0, Hidden-unit BERT (HuBERT), and WavLM are trained with a great amount of speech data and subsequently give a general purpose speech representation that then needs to be finetuned for different speech processing tasks like ASR. Despite these models’ good performance, they suffer from massive structures and a great number of parameters which makes their finetuning inapplicable for low-resource tasks like speech emotion recognition. In this paper, a small model is proposed for speech emotion recognition based on the Hubert model by transferring the Hubert convolutional feature encoder and substituting all of its transformers with a simple conformer block. Then this simple model is trained with emotional speech signals. The experimental results demonstrate that the proposed model has comparable results with other well-performing S3RL models.
لیست مقالات
لیست مقالات بایگانی شده
Enhancing Kriging with Inductive Spatio-Temporal GraphODE
Amin Sheykhzadeh - Behzad Moshiri - Ebrahim Ghafar-Zadeh
تخمین کانال های پهپاد به پهپاد با استفاده از فیلتر کالمن توسعه یافته
فهیمه رنجبر - محمدعلی سبقتی
Design and Implementation of CAN Bus Monitoring Module for Lithium Battery Management System
Shakila Kazempourdizaji - Amir Mohammad Moazami Goudarzi - Majid Shalchian
Deep Convolutional Neural Network for ADHD Classification using resting-state fMRI
MohammadHadi Firouzi - Maliheh Ahmadi - Kamran Kazemi - Mohammad Sadegh Helfroush - Ardalan Aarabi
Partial Image Encryption of Faces Based on Chaotic Maps and Elliptic Curve Cryptography
Ali Soleymani - Md Jan Nordin
انتخاب سبد سهام بهینه در بورس تهران با استفاده از تقریب تصادفی انحراف همزمان
زینب گدازگر
بررسی و شبیه سازی اضافه ولتاژهای صاعقه در نیروگاه خورشیدی برق خراسان و ارائه سیستم حفاظتی مناسب
هادی علی آبادی - بهزاد کرمانی
Attractors Manipulation in Denoising Autoencoders for Robust Phone Recognition
Shaghayegh Reza - Seyyed Ali Seyyedsalehi - Seyyedeh Zohreh Seyyedsalehi
Decoding Trait: Using Dual Transformers to Analyze Gender, Age Range and Personality
ُSaeed Asadian - Mostafa Tanasan - Bijan Vosoughi vahdat
Human detection and following by a mobile robot using YOLO structured convolutional neural network
Yasan Majidi - Amir Hossein Hassanabadi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0