0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Corona Discharge Analysis of a 400kV Overhead Line at Its Tower Windows and Line Mid-Span Under Fog Condition
Seyed mohamad ali Tabatabaei - Hamid Javadi - Masoud Abdolhosseinpour - Faramarz Ghelichi
A CMOS Low-Noise and Low-Power Transimpedance Amplifier
Mehrdad Amirkhan Dehkordi - Seyed Mehdi Mirsanei - Soorena Zohoori
Broadband Two Layers 1-Bit Metal-Only Transmitarray with Polarization Conversion Technique
Majid Karimipour - Iman Aryanian
Optimal Bidding Strategy with Smooth Budget Delivery in Online Advertising
Mohammad Afzali - Keykhosro Khosravani - Maryam Babazadeh
بکارگیری تکنیک کنترل مقاوم جهت طراحی مسیر حرکت خودرو در مانورهای اضطراری ممانعت از برخورد
محمد امین قماشی - رضا کاظمی
T-type L-2L De-Embedding Method for On-Wafer T-model Transmission Line Network
Milad Seyedi - Nasser Masoumi - Samad Sheikhaei
تحلیل حرارتی لیزر تابنده از سطح کاواک-عمودی با ساختار بازتابگر ترکیبی توری کنتراست بالا یکپارچه و بازتابشگر براگ
حسن هوشدار رستمی - وحید احمدی - سعید پهلوان
Designing Music Recommendation System based on music Genre by using Bi-LSTM
Saman Mesghali - Javad Askari
Robust Object Detection Against Adversarial Perturbations with Gabor Filter
Mohammad Parsa Karimi - Abdollah Amirkhani - Shahriar B. Shokouhi
Numerical investigation of gain switching in Fano semiconductor lasers
Arash Hodaie - Hassan Kaatuzian - Aref Rasoulzadeh Zali
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0