0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Design and Implementation of a TEM Double-ridge Horn Antenna for Ultra-Wideband Applications
Seyed Navid Seyfossadat - Hassan Zakeri - Ahad Tavakoli - Gholamreza Moradi
A Robust Hysteresis-Feedforward Control Approach with High Flexibility for a Single-Inductor Multi-Port DC-DC Converter
Aran Shoaei - Karim Abbaszadeh - Hesamodin Allahyari
Effect of Passivation on the Structural and Electronic Properties of Armchair MoSe2 Nanoribbons: A First-Principles Investigation
ََAmirreza Ghazi - Arash Yazdanpanah
طراحی روش مبتنی بر آنالیز پوش داده برای ارزیابی عملکرد ایستگاه های فوق توزیع و تعیین سطح مطلوب قابلیت اطمینان سیستم توزیع انرژی الکتریکی
محمد رستگار - زهرا یزدانپناه - محمد جوشکی
Φ-OTDR Event Classification Using Machine Learning and Optical Signal Processing
Amir Babaoughli - Tohid Alizadeh - Seyed Sadra Kashef
A Comparison Between PI-PSO, Fuzzy-PID, and Direct Adaptive Fuzzy Controllers for Controlling a Buck-Boost DC-DC Converter with Semi-Quadratic Voltage Gain and Continuous Input Current
Nasim Moradmatak - Seyed Hamid Shahalami
A Wideband PLL with Programmable LC VCO with 5.1 to 7.9GHz Lock Range
Mohsen Azimikia - Arash Esmaili
Low-Leakage 6T SRAM Cell for In-Memory Computing with High Stability
Deniz Najafi - Behzad Ebrahimi
A High Gain Transformerless DC-DC Boost Converter Using LCD Network: Design and Experimental Verification
Hamed Hokmali - Ebrahim Afjei
پیشبینی بازار سرمایه به کمک دادهکاوی با الگوریتمهای رگرسیونی
شیوا نمایان - محمدشهرام معین
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0