0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Positioning a Moving Target Using Range and Doppler-Rate Measurements with Bi-static Radar
MohammadAmin Latifi - Fereidoon Behnia
بهبود تخصیص منابع لبهها در شبکه LTE مبتنی بر محاسبات لبه با رویکرد تعادل میان تاخیر و قابلیت اطمینان
ایمان عظیمی احمدآبادی - علی اکبر تدین تفت
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
Marziye Azad - Babak Nasersharif
Proposing an indirect distributed approach to apply SSSEP vibrational stimulation
SAHAR SADEGHI - Ali Maleki
طراحی یک چارچوب غیر متمرکز تبادل انرژی برای مصرفکنندههای فعال در بازارهای همتا به همتا (P2P)
امیر زارع بخت پیما چمثقالی - مهدی مهدینژاد - مهرداد عابدی
P300 Evoked Related Potential Detection Based on Integration of Modified HOG and Convolutional Neural Networks
Pedram Havaei - Elham Mahmoudzadeh - Maryam Zekri
Low Cost Implementation of Neural Networks Based on Stochastic Computing
Hadi Jahanirad - Ahmad Menbari
Second-order Sliding Mode Control for DC-DC buck converter with input Voltage Ripple Elimination
Maede Azimi - Mehdi Asadi - Adel Zakipour
Optimal Control of Rectangular Singular Systems
Masoud Shafiee
Design and Implementation of a TEM Double-ridge Horn Antenna for Ultra-Wideband Applications
Seyed Navid Seyfossadat - Hassan Zakeri - Ahad Tavakoli - Gholamreza Moradi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0