0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Optimal Path Planning of Mobile Robots using IsoCost-Based Dynamic Programming
Fatemeh Alvankarian - Ahmad Kalhor - Mehdi Tale Masouleh
Impedance Evaluation of Plasmonic Nano Dipole Antennas Based on Guided TE Mode
Daniyal Khosh Maram - Hanieh Talati Aghdam - Hamed Abnavi
Secrecy Sum Rate Analysis and Power Allocation with OSTBC and Artificial Noise for MIMO Systems
Abdolrasoul Sakhaei Gharagezlou - Mahdi Nangir - Nima Imani - Amir Poorfaraj Liqvan
Stability Analysis of a New Switched SEIAR-Vac-Iso Epidemic Model for the COVID-19
Amir Hossein Amiri Mehra - Mohsen Shafieirad - Zohreh Abbasi - Iman Zamani
تحلیل ارتباطات موثر و عملکردی سیگنالهای فیزیولوژیکی راننده جهت بهبود تشخیص حواس پرتی
نیلوفر وثوق - زهرا بهمنی دهکردی - امین محمدیان
Control of a Wheeled Robot in the Presence of Wheels Sliding Using Robust Adaptive Control in Differential Game Format
Alireza Azimi - Roya Amjadifard - Aliakbar Ghasemzadeh
LSTM and Markov-Based Mobility Prediction for Multi-access Edge Computing
Hadi Ghavaminejad - Nasser Yazdani - Golboo Rashidi
تخمین پارامتر سریهای زمانی دو بعدی چند متغیره گسسته
مرضیه بهمنی - محسن شفیعیراد - مهدی زینالی - احسان ناظمالرعایا
Wake-Sleep Learning in R-STDP-Based Spiking Neural Networks to Avoid Catastrophic Forgetting
Mehrdad Baradaran - Katayoon Kobraei - Saeed Reza Kheradpisheh
IRS-aided NOMA in a Cell Free Massive MIMO System
Anahid Rafieifar - Hosein Ahmadinejad - Abolfazl Falahati
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0