0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Enhancing the Incident Angle Band in Carpet Cloaking using Deep Neural Networks
Amirhossein Fallah - Leila Yousefi - Ahmad Kalhor
Deception Attack Detection and Resilient Control in Platoon of Smart Vehicles
Hassan Mokari - Elnaz Firouzmand - Iman Sharifi - Ali Doustmohammadi
A Band-pass Power Divider Based on Substrate Integrated Plasmonic Waveguide
Salma Mirhadi - Shamsi Soleimani
Virtual power plant participation in day-ahead and futures markets with a deep learning approach
Farzin Ghasemi Olanlari - Mohammad Fazel Dehghanniri - Turaj Amraee
The Effect of Cavity Length on Two-State Quantum Dot Laser Performance
Gholamreza Babaabasi - Mohammad Mohsen Sheikhey - Sara Alaei
Inversion Coefficient as a Key Design Parameter in MOS Device Performance
Gholamreza Khademevatan - Ali Jalali
Bidirectional Isolated DC/DC Dual-Active-Bridge Converters Optimum Soft-Switching Control Method for Electrical Vehicle Applications
Shokoufeh Valadkhani - Mojtaba Mirsalim - Gevork B. Gharehpetian
تولید ریزداپلر راداری بدن انسان با استفاده از آموزش شبکه مولد متقابل کانولوشنال عمیق
مهدی استوان - صادق صمدی - علیرضا کاظمی
Classification of Schizophrenia Patients by Nonlinear Analysis of EEG
Amirhossein Tajik - Hoda Jalalkamali - Hossein Nezamabadipour
Performance Analysis of an UAV-assisted cognitive D2D communication-based Disaster Response Network
Hossein Mohammadi Firozjae - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0