0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Multi-Agents Gaussian Estimation and Coverage Control Client-Server Architecture
Mohammad َAzizian Shishavan - Mahdi Zeinali - Azam Salari
A Non-Isolated Extendable Common Grounded DC-DC Boost Converter for DC-microgrid Applications
Saed Mahmoud alilou - Ali Nadermohammadi - Mohammad Maalandish - Seyed hossein Hosseini - Kazem Zare - Mehdi Abapour
Second-Order Sliding Mode Design Based on the Integration of Proportional-Integral and Nonlinear $\mathcal{H}_\infty$ Controllers for Load Frequency Control
Behrad Samari - Mohammad Javad Yazdanpanah
Wide-band Cloaking of Finite Length PEC Cylindrical Objects under Oblique Incidence using Multi-Layer Mantle Cloak
Alireza Moosaei - Mohammad Hasan Neshati
Angular Misalignment Effect on the Performance of Underwater MIMO OCC Systems
Ehsan Hamidnejad - Asghar Gholami
Design and Control of a Novel Multi-port Bidirectional Buck-Boost Converter Suitable for Hybrid Electric Vehicle Charging Stations
Amir Safaeinasab - Homayon Soltani Gohari - Karim Abbaszadeh
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
Marziye Azad - Babak Nasersharif
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
Hadi Alizadeh - Rahil Mahdian Toroghi - Hassan Zareian
Photonic Crystal-based Plasmonic Biosensor with Low-cost and High-sensitivity Properties
Mahdieh Ahmadi Motlagh - Mahdieh Bozorgi - Mahmood Rafaei-Booket
Sampled-data-based Descriptor Observer Design with Aperiodic Measurements for Lithium-ion Batteries in Hybrid Electric Vehicles
Hamid Reza Ahmadzadeh - Masoud Shafiee
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4