0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Single-Channel Recursive Speech Separation with Unknown Speaker Count by Mask Estimation
نویسندگان :
Hadi Alizadeh
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Instantaneous Speech separation،single microphone،unknown speaker count،recursive operation،Mask estimation
چکیده :
This paper presents a novel speech separation method capable of handling an unknown number of speakers using a single, compact model, eliminating the need for prior knowledge of speaker count. The proposed approach employs a unique objective function to train a speaker-independent, single-channel model, enabling effective separation across diverse conditions, even when training and testing datasets differ. Additionally, a robust technique for detecting the number of speakers in a mixture is introduced, ensuring high performance with minimal computational complexity. By employing a recursive separation strategy, the method addresses the limitations of traditional approaches reliant on predefined speaker counts, making it more adaptable to real-world scenarios. Evaluations on the WSJ0 dataset demonstrate the proposed model's superiority in SI-SNR and SDR metrics while achieving a significantly lower parameter count compared to existing methods.
لیست مقالات
لیست مقالات بایگانی شده
Forged Channel: A Breakthrough Approach for Accurate Parkinson's Disease Classification using Leave-One-Subject-Out Cross-Validation
SeyedAmirReza Hamidi - Kamal Mohamed-Pour - Mohammad Yousefi
جداسازی عروق در تصاویر شبکیه چشم با استفاده از یک روش آستانه گذاری وفقی مبتنی بر اطلاعات محلی و سرتاسری
زهرا نورانی آتشگاه - محمد آراسته - آیدا فولادی وندا
Optimal Placement of Followers Within the Convex Hull of Leaders: A Distributed Subgradient Approach
Seyedeh Mahsa Zakipour Bahambari - Saeed Khankalantary
Outage Analysis of Distributed Relaying NOMA in Cognitive Radio Networks
Zahra Doorbash - Ali Jamshidi
An Uncertain Optimal Factorization of Cooperative Manipulators for Robust Optimal Control Schemes
Neda Nasiri - Ahmad Fakharian - Mohammad Bagher Menhaj
Revolutionizing Energy Efficiency: A Case Study on Self-supply of Electrical Energy in the Mobarake Steel Industry
Mahdi Shadi - Seyed Mohammad Shobeiry - Mohammad Sadegh Ghazizadeh - Hassan Mardani
ملاحظات طراحی مغناطیسی، الکتریکی و حرارتی راکتورهای سری دیتیون از نوع خشک رزینی
مرتضی اسلامیان
True Random Number Generator Relying on Multiple Entropy Source and Triple Oscillator for Cryptography Purposes
Somayeh Gholam Mehraban - Mohsen Jalali - Mostafa Azadbakht
Establishment of a Virtual Power Plant in Grid for Maximizing Producers' Profits and Minimizing Pollutant Emissions and Investment Costs
Amir Hossein Gholami - Amir Abulfazl Suratgar - Mohammad Bagher Menhaj - Mohammad Reza Hesamzadeh
A 1.2GHz wide bandwidth integer-N type-I PLL
Javad Tavakoli - Hossein Yaghobi - Samad Sheikhaei
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2