0% Complete
صفحه اصلی
/
سی و یکمین کنفرانس بین المللی مهندسی برق
An Iterative Post-processing Method for Speech Source Separation in Realistic Scenarios
نویسندگان :
Iman Shahriari
1
Hossein Zeinali
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
کلمات کلیدی :
Speech Source Separation،Speaker Embedding،Deep Learning
چکیده :
The purpose of this paper is to design a speaker-independent Blind Source Separation (BSS) system which aims to reduce the word error rate (WER) metric on Persian speech data. The main idea behind this method is that it tries to improve the quality of the output of any baseline separation system with the help of an iterative model by removing the remained parts of the interferer speaker from each source. For this purpose, we use embedded representations of the input speech signals. In addition, our system benefits from a convergence metric that aims to purify the output signals. To evaluate the proposed method, we have collected a dataset that contains about one hour of real phone calls from landline phones. Although most of the energy of some consonant phonemes appears in high-frequency bins which are filtered in telephony speeches, our method can handle this condition by properly removing the interference. Experimental results based on different metrics have proved the effectiveness of the proposed method.
لیست مقالات
لیست مقالات بایگانی شده
Propagation of Measurement Errors in the Euler Kinematic Equations
Mojtaba Fazelinia - Saeed Ebadollahi - Soheil Ganjefar
تشخیص و مقیاس بندی شدت افسردگی براساس روشهای یادگیری ماشین و با استفاده از معیارهای خطی، غیرخطی و آماری محاسبه شده در سیگنالهای الکتروانسفالگرام
پریسا رئوف امامزاده هاشمی - وحید شالچیان - رضا رستمی
Field Effect Phototransistor Based on Thin Film Ag2S Nanocrystals
Hossein Roshan - Mohammad Hossein Sheikhi
Age of Information Optimization for Multi-hop VLC/RF IoT Sensor Networks
Hossein Khodi - Paeiz Azmi - Nader Mokari - Mohammadreza Javan - Hamid Saeedi - Murat Uysal
Reactive Power Compensation in Distribution Grids: An Application of Trinary Cascaded H-bridge Multilevel Inverter
Yousef Neyshabouri - Mohammad Farhadi-Kangarlu
کنترل حرارت مبتنی بر روش LQG در پیل سوختی غشاء پلیمری
احمدرضا ولی - محمدعلی علیرضاپوری - محمدمهدی برزگری
Ultra-wideband RCS Reduction Using Checkerboard Configuration of Bed of Nails
Sadegh Sarjoughian - Mohsen Maddahali - Ahmad Bakhtafrouz
Diagnosis of Heart Diseases based on Processing Heart Sound using Machine Learning
Maryam Moulaverdi - Akbar Ranjbar
A novel CMRR Enhancement technique in fully-differential Class-AB OTAs
Amirhossein Sabour - Mahsa Ramezan Pour - Mohammad Yavari
Gearbox Fault Detection Using Continuous Wavelet Transform and Vision Transformer (ViT)
Ali Asadian - Yassin Riyazi - Moosa Ayati
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0