0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
An Iterative Post-processing Method for Speech Source Separation in Realistic Scenarios
نویسندگان :
Iman Shahriari
1
Hossein Zeinali
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
کلمات کلیدی :
Speech Source Separation،Speaker Embedding،Deep Learning
چکیده :
The purpose of this paper is to design a speaker-independent Blind Source Separation (BSS) system which aims to reduce the word error rate (WER) metric on Persian speech data. The main idea behind this method is that it tries to improve the quality of the output of any baseline separation system with the help of an iterative model by removing the remained parts of the interferer speaker from each source. For this purpose, we use embedded representations of the input speech signals. In addition, our system benefits from a convergence metric that aims to purify the output signals. To evaluate the proposed method, we have collected a dataset that contains about one hour of real phone calls from landline phones. Although most of the energy of some consonant phonemes appears in high-frequency bins which are filtered in telephony speeches, our method can handle this condition by properly removing the interference. Experimental results based on different metrics have proved the effectiveness of the proposed method.
لیست مقالات
لیست مقالات بایگانی شده
Transfer learning using deep convolutional neural network for predicting dementia severity
Vahid Asayesh - Mehdi Dehghani - Majid Torabi Nikjeh - Sepideh Akhtari khosrowshahi
Study of Multiple Teeth Linear Switched and Hybrid Reluctance Motors
Mohammad Amin Jalali Kondelaji - Ali Ghaffarpour - Mojtaba Mirsalim
Design and Implementation of a Modular ROS-based Mobile Robot With Hierarchical Control
Erfan Riazati - Arian Hajizadeh - Seyed Majid Esmailzadeh
Interval-Based Setting Approach for Distance Relays Considering Uncertainties Using Monte Carlo Simulation
Abolfazl Hadadi - Mohammad Javad Jalilian - Behrooz Vahidi - Gholam Hossein Riahy Dehkordi
A Novel method for power transmission lines Protection Against the Sub-Synchronous Resonance Using thyristor-based reactive power compensation
Mohammadreza Mousavi Khademi - Mehdi Zareian Jahromi
A Novel Analytical Tuning Method for Designing of Composite Nonlinear Feedback Control Law in Continuous-time Dynamical Systems
Ali Vazani - Valiollah Ghaffari
Joint Space Control of a Deployable Cable Driven Parallel Robot with Redundant Actuators
S. Ahmad Khalilpour - Ali Hassani - Rohollah Khorambakht - A.R. Zahedi - Abbas Bataleblu - Hamid D. Taghirad
Error Probability Analysis of Non-Orthogonal Multiple Access
Rozita Shafie - AliAkbar Tadaion - Zolfa Zeinalpour-Yazdi
CT Super-Resolution Using Arbitrary Scale Diffusion Model
Mahsa Nadafi Ghahnavieh - Saeed Masoudnia - Hamid Soltanian-Zadeh
HyperSpectral Image Classification using a 3D Convolutional Mixer Block
Sara Dianat - Mehran Yazdi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2