0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
An Iterative Post-processing Method for Speech Source Separation in Realistic Scenarios
نویسندگان :
Iman Shahriari
1
Hossein Zeinali
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
کلمات کلیدی :
Speech Source Separation،Speaker Embedding،Deep Learning
چکیده :
The purpose of this paper is to design a speaker-independent Blind Source Separation (BSS) system which aims to reduce the word error rate (WER) metric on Persian speech data. The main idea behind this method is that it tries to improve the quality of the output of any baseline separation system with the help of an iterative model by removing the remained parts of the interferer speaker from each source. For this purpose, we use embedded representations of the input speech signals. In addition, our system benefits from a convergence metric that aims to purify the output signals. To evaluate the proposed method, we have collected a dataset that contains about one hour of real phone calls from landline phones. Although most of the energy of some consonant phonemes appears in high-frequency bins which are filtered in telephony speeches, our method can handle this condition by properly removing the interference. Experimental results based on different metrics have proved the effectiveness of the proposed method.
لیست مقالات
لیست مقالات بایگانی شده
Improving CCA-based methods for SSVEP classification using a common source graph
Nastaran Noori - Sepideh Hajipour Sardouie
Design and Modelling of a Modified Controller for D-STATCOM Considering Parametric Uncertainties and Unmodeled Dynamics
Majid Arabahmadi - Hossein Khaligh - Amirhossein Moghani - Ali Mosallanejad
P300 Evoked Related Potential Detection Based on Integration of Modified HOG and Convolutional Neural Networks
Pedram Havaei - Elham Mahmoudzadeh - Maryam Zekri
High Step up DC/DC Converter with Low Input Current Ripple and Low Voltage Stress on Semiconductors
Saed Mahmoud Alilou - Mohammad Maalandish - Soheil Nouri - Seyed Hossein Hosseini
Emotion Recognition from EEG Signals During REM Sleep
Asghar Zarei - Ali Mahmoudi
Uneven Illumination Correction in Whole Slide Imaging using Pix2Pix
Sama Nemati - Hasti Shabani
A New Physical Philosophy to Model and Interpret Partial Discharge Phenomenon
Arman Vasigh Zadeh Ansari - Mahdi Vakilian
Ultrahigh Step-Up Non-Isolated DC-DC Converter Based on Quadratic Converter without Coupled Inductor
Sajad Rostami - Vahid Abbasi - Masoumeh Parastesh
Modeling of a low-noise amplifier with a recurrent neural network
Mostafa Noohi - Fatemeh Charoosaei - Ali Mirvakili - Sayed Alireza Sadrossadat
Fast Subdomain Approximation of Brushless Electrical Machines with Spoke-Hub Permanent Magnets
Meisam Pourahmadinakhli - Seyed Hassan Daryanavard - Masoud Jokar-Kohanjani - Sina Soltani
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2