0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Novel Low Power Switch-Count Structure for Medium/High Power Multilevel Inverter
Ali Seifi - Seyed-hossein Hosseini - Mehrdad Tarafdar Hagh
بررسی نامتعادلی در مبدل DC به DC تمامپل شیفت فاز با یکسوکنندهی دوبرابرکنندهی جریان
رضا نرئی - یاسر کریمی - محمدهادی زارع
GAN-Driven Image Generation for Metamaterial Absorbers Using Mean and Variance Encoding
Atefe Shahsavaripour - Mohammad Hossein Badiei - Leila Yousefi - Ahmad Kalhor
A Siamese Neural Network for Predicting snoRNA-Disease Association
Milad Besharatifard - Fatemeh Zare-Mirakabad
Multi-Machine Traction Drive Based on Parallel Connected Synchronous Machines
Hassan Mohammadi Pirouz
Outage and Sum-Rate Analysis for mCAP-NOMA in Visible Light Communication Under Users' Mobility
Amir Oshtoudan - Seyed Mohammad Sajad Sadough
مبدل زمان پیوسته سیگما دلتا با پهنای باند 200k-28M مناسب برای گیرنده های باند پایه3G,4G
فائزه جسور قره باغ - مرتضی موسی زاده
Energy Efficiency Evaluation of a Line-Start Permanent Magnet Assisted Synchronous Reluctance Motor for Pump Application
Ali Jamali-Fard - Mojtaba Mirsalim
Super twisting sliding mode incorporated with USDE for tracking control of nonlinear robotic systems
Ahmadreza Fallahinezhad - Maryam Malekzadeh - Alireza Ariaei
CNN-LSTM model for Confusion Classification; using Single-Channel EEG
Amirhossein Aran - Zahra Ghanbari - Mohammad Hassan Moradi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0