0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Model Predictive Control for Optimal Drug Administration of Cancer Chemotherapy
Zahra Hosseinpour - Amirhossein Nikoofard - Erfan Nejabat
A 2D Geometry Based Grasping Pose Generation Algorithm for a Two-finger Robot Hand
Arash Akbari - Arman Akbari - Mehdi Tale Masouleh
Design and Analysis of a New Electrically Controllable Brushless Eddy-Current Clutch
Hassan Mohammadi Pirouz - Mohammadreza Baghayipour
Optimization and Analysis of Transformer Hot Spot Temperature Under Harmonic Conditions with Different Windings
Mehran Nemati - Hamed Karimi - Alireza Siadatan - Maryam Sepehrinour
User Identification Based on Hand Geometrical Biometrics Using Media-Pipe
Sara Ghanbari - Zahra Parvin Ashtyani - Mehdi Tale Masouleh
An Autonomous Multi Agent Q-Learning Approach for Resource Allocation in D2D-Enabled Heterogeneous Networks
Pouya Akhoundzadeh - Ghasem Mirjalily - Mohammad taghi Saadeghi
Estimation of the Arc Model Parameters Using Heuristic Optimization Methods
Sadegh Ghavami - Ali A Razi-kazemi
Gray Box High-Frequency Modeling of Transformer using Particle Swarm Optimization
Mehdi Shamsodini Lori - Mohammad Hamed Samimi - Jawad Faiz
Service Restoration in Distribution Networks Based on a Two-stage Power Flow Model
Saman Armand - Jalal Heidary - Eli Shirazi
تخصیص بهینه نصب خازنها و ایستگاههای شارژ خودروهای برقی با مدلسازی صف M/M/S و انتخاب گرههای کاندید مبتنی بر شاخصهای تلفات، تقاضای شارژ خودروهای برقی و پایداری ولتاژ در شبکه توزیع
رضا قلی پور - محسن حمزه
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0