0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Investigation of Li3P as Electrolyte and Lithium-ion conductor: An Ab-Initio Study
Keyvan Khosh Abady - ََamin Niksirat - Negar Karpourazar - Mahdi Pourfath
Multi-physics electromagnetic-mechanical analysis of a high-speed switched reluctance motor for vacuum cleaner application
Nasrin Majlesi - Morteza Saghaian-Nejad - Amir Rashidi
Classifying Human Spatial Navigation Anxiety Using Electrooculography Signals and Machine Learning Techniques
Saeed Mousavi - Sara Ashrafi - Mehdi Delrobaei
Introduce a novel approach to orbital maintenance in CRTBP
Amirreza Kosari - Ehsan Abbasali - Jamileh Hamzei - Majid Bakhtiari
Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels
Mohammad Hossein Givkashi - Mahshid Hadipour - َArezoo PariZanganeh - Zahra Nabizadeh Shahre-Babak - Nader Karimi - Shadrokh Samavi
Primary Frequency Support in Clustered Unit Commitment with Battery Energy Storage and High Renewable Penetration
Abbas Abdollahi-Veshvaee - Turaj Amraee
Computational Insights into the Superior Performance of ψ-Graphene in Li-S Batteries: A DFT Study
Donna Rashidi - Maryam Abbasi - Leila Sadeghbeigy - Matin Bakhtavari - Ebrahim Nadimi
An event-triggered distributed consensus information filter for target tracking in sensor networks
Sara Giyani - Behrouz Safarinejadian - Sajad Shamsi
An improved ECG segmentation method based on adaptive Hermite functions
Abazar Arabameri - Sajad Haghzad Klidbary
Finite-Time Bipartite Time-Varying Formation tracking for Heterogeneous Nonlinear Multi-Agent Systems
Mohammad Reza Mehrabi Koushki - Javad Askari - Marzieh Kamali
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2