0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Chaos-Based Physical Layer Security in NOMA Systems
Alireza Mard shoorijeh - Mahmoud Ahmadian Attari
Energy Allocation Methods in NOMA Modulation Using Machine Learning Algorithms in the Presence of Jamming
Khashayar Saremi - Bahareh Akhbari
تخمین نرختنفس با استفاده از ترکیب ویژگیهای سیگنال فوتوپلتیسموگرافی و مدل FCM-ANFIS
علیرضا باغبانی - سیده فاطمه مولایی زاده
A straightforward approach for measuring blood pressure in an upper arm digital blood pressure monitor
Mohammad Soroush Rezaei - Mahdi Khalilzadeh Shabestari - Seyed Yousef Jazaery Farsany - Danial Katoozian - Hossein Hosseini-Nejad
Investigation of Impact Ionization Variations Versus Electric Field and Temperature in Compound Semiconductors for UV-APD Applications
Mohammad hossein Khoddami - Hassan Kaatuzian - Mohammad hossein Asgari
An Iterative Approach to Enhance the Accuracy of TDOA-Based Localization by Averaging and Reducing Noise
Reza Bahrampour - Mohammad Hossein Madani - Hossein Bahramgiri
Multi-objective Optimization of Peer-to-Peer Transactions in Arizona State University’s Microgrid by NSGA II
Pourya Shirinshahrakfard - Amir Abolfazl Suratgar - Mohammad Bagher Menhaj - Gevork B. Gharehpetian
Power Transformer Vibration Study and its Application in Winding Deformation Detection
Amir Esmaeili Nezhad - Mohammad Hamed Samimi
A New Approach to Solve MDVRP in Lower Computation Time
Reza Rahimi Baghbadorani - Mohammad Amin Zajkani - Mohammad Haeri
جابجایی ایمبرت-فدروف نور عبوری از ساختار چندلایه ای حاوی گرافن و دیاکسید وانادیوم
رباب زادجمال سیفی - رضا عبدی قلعه - کاظم جمشیدی قلعه
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.3.1