0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
طراحی یک چارچوب غیر متمرکز تبادل انرژی برای مصرفکنندههای فعال در بازارهای همتا به همتا (P2P)
امیر زارع بخت پیما چمثقالی - مهدی مهدینژاد - مهرداد عابدی
Peer-to-peer Energy Sharing Considering Prosumers' Preferences and Load Uncertainties
Mohammad Bagher Moradi - Mohammad Hasan Nazari - Seyed Hossein Hosseinian - Hamed Nafisi
Modeling of Photo-thermoelectric Current Effects in Phase Change Material based Optical Nano Dipole Antenna Energy Transducer
Daniyal Khosh Maram - Seyed Asad Amirhosseini
روشی برای انتخاب کُدهای بهینه افزایشی چرخشی برای افزایش تحمل پذیری خطا در شبکه های درون ساختمانیِ شهرهای هوشمند با ملاحظه سربارهای زمانی و توان مصرفی
آرش ابراهیم پور زندی - مهرشاد خسرویانی
Performance Analysis of the Modified Flux-Coupling-Type SFCL in VSC-HVDC System
Mohammad Khakroei - Ashkan Mirzaei Rajeooni - Mahdi Rahimi Pirbasti - Hossein Heydari
Classification of automotive radar targets using Gray Level Co-occurrence Matrix
Amin AghatabarRoodbary - MohammadHassan Bastani - Fereidoon Behnia
A brief review of methods for improving the performance of virtual synchronous generators under unbalnced conditions
Mohammad Hossein Mousavi - Hassan Moradi CheshmehBeigi
Inversion Coefficient as a Key Design Parameter in MOS Device Performance
Gholamreza Khademevatan - Ali Jalali
Transformer-Based Unsupervised Image Registration using SSIM and Homography Loss for Steady Camera and Aerial Videos
Golnoosh Abdollahinejad - Matin Hashemi
بررسی اثر فیدبک نوری بر مشخصه های دینامیکی لیزرهای قفل مد سیلیکونی
محمد شکرپور - محمد حسن یاوری
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0