0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
An Iterative Post-processing Method for Speech Source Separation in Realistic Scenarios
نویسندگان :
Iman Shahriari
1
Hossein Zeinali
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
کلمات کلیدی :
Speech Source Separation،Speaker Embedding،Deep Learning
چکیده :
The purpose of this paper is to design a speaker-independent Blind Source Separation (BSS) system which aims to reduce the word error rate (WER) metric on Persian speech data. The main idea behind this method is that it tries to improve the quality of the output of any baseline separation system with the help of an iterative model by removing the remained parts of the interferer speaker from each source. For this purpose, we use embedded representations of the input speech signals. In addition, our system benefits from a convergence metric that aims to purify the output signals. To evaluate the proposed method, we have collected a dataset that contains about one hour of real phone calls from landline phones. Although most of the energy of some consonant phonemes appears in high-frequency bins which are filtered in telephony speeches, our method can handle this condition by properly removing the interference. Experimental results based on different metrics have proved the effectiveness of the proposed method.
لیست مقالات
لیست مقالات بایگانی شده
Experimental Study of Pick and Place Operation for Packaging Using Delta Parallel Robot with Two-Fingered Gripper
Mona Mohades Mojtahedi - Arvin Mohammadi - Mehdi Tale Masouleh
Modeling, estimation, and model predictive control for Covid-19 pandemic with finite security duration vaccine
Abolfazl Delavar - Reza Rahimi Baghbadorani
HyperSpectral Image Classification using a 3D Convolutional Mixer Block
Sara Dianat - Mehran Yazdi
بهره برداری از ESS ها در بخش DC ترانسفوماتور حالت جامد به منظور بهبود کیفیت توان شبکه برق
یوسف عطائی - رضا قندهاری - مهدی بابائی - بهنام بهارلوئی
Stability Analysis of Singular 2-D Positive systems
Mahmoud Zamani - Masoud Shafiee - Iman Zamani
Forecasting Tehran Stock Exchange Trend with Time Series Analysis, Fundamental Data, and Sentiment Analysis in News
Mahdi Shamisavi - Amir Jahanshahi
Efficient Full Adders for Approximate Arithmetic Units in the Image Processing Applications
Bahram Rashidi
Fabrication, Simulation and Modeling of a T-Shaped Coaxial Stub Resonator
Abolfazl Ebrahimpour - Sepehr Sahab - Javad Shokri Seyyedi - Younes Sahranavard - Gholamreza Moradi
بررسی اثر فیدبک نوری بر مشخصه های دینامیکی لیزرهای قفل مد سیلیکونی
محمد شکرپور - محمد حسن یاوری
Ultra-broadband and compact beamsplitters using subwavelength-grating-assisted zero gap directional couplers
Kamalodin Arik - Mahmood Akbari - Amin Khavasi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0