0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
A Fast Approach for Deep Neural Network Implementation on FPGA
Maedeh Nobari - Hadi Jahanirad
A New High Step-Up Quasi Z-Source DC-DC Converter Using Buffer and Switched Capacitor Techniques
Erfan Meshkizadeh - Ebrahim Afjei - Morteza Kheradmandi
Melanoma Detection Using Multi-Color LBP-FPl and Optimized VGG16
Vida Esmaeili - Mahmood Mohassel Feghhi
Weighted Fuzzy-Based PSNR for Watermark Visual Quality Evaluation
Maedeh Jamali - Nader Karimi - Shadrokh Samavi
Adaptive synchronous switching of uncompensated open transmission lines Realizing the line’s Parameters, and the pre-arcing interval
Alireza Karimonnafs - Mehdi Vakilian
طراحی کنترل کننده امن سیستمهای غیرخطی با استفاده از یادگیری تقویتی و بهینه سازی مجموع مربعات
حسین قلی زاده - احسان رضوی - سجاد پاک خصال - سعید شمقدری
Photonic Crystal-based Plasmonic Biosensor with Low-cost and High-sensitivity Properties
Mahdieh Ahmadi Motlagh - Mahdieh Bozorgi - Mahmood Rafaei-Booket
Flexible Generation Expansion Planning Considering Representative Days of Load and Renewable Variations
Peyman Amirian - Zeinab Maleki - Mohammad-Amin Pourmoosavi - Turaj Amraee
Multi-agent H-Learning Based Cooperative Spectrum Sensing for Cognitive Radio Networks
Elaheh Karimpour Fard - Mahdi Nouri - Hamid Behroozi - Sima Sobhi-Givi
Multi-Attribute Decision-Making Methods to a Cloud Service Providing Selection
Amirhossein Shahbakhsh razavi - Kiumars Javan - Mehdi Zaferanieh - Somayeh Sobati-Moghadam
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2