0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Design of Dual-beam Orthogonal Circular Polarized Leaky-wave Holographic Antenna
Mohammad Amin Chaychizadeh - Nader Komjani
Sum-Rate Maximization for NOMA-Based Networks with D2D Communications using Matching Theory
َAlireza Gholamrezaee - Hamid Farrokhi - Javad Zeraatkar Moqaddam
Study of Plasmonic Perfect Absorber Using Three Dimensional Silver Double Triangle-Shaped Nanoparticles
Mohammad Reza Rakhshani
Application of Artificial Neural Network on Diagnosing Location and Extent of Disk Space Variations in Transformer Windings Using Frequency Response Analysis
Reza Behkam - Hossein Karami - Mahdi Salay Naderi - Gevork Gharehpetian
Comparison of the MRT and ZF Precoding in Massive MIMO Systems from Energy Efficiency Viewpoint
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
A Novel Estimation Law for Impedance-Controlled Bilateral Teleoperation to Enhance Human-Environment Interaction
Mobina Kameli - Mohammad Motaharifar - Negin Sayyaf
Efficiency Enhancement of Heterojunction IBC Solar Cell: Surface Passivation
Amirmohammad Shahryari - Zohreh Golshan bafghi - Negin Manavizadeh
Wideband Rat-race Hybrid Coupler Using Ridge Gap Waveguide Technology
Zahra Akhoondmahdi - Ahmad Bakhtafrouz
IRS-aided NOMA in a Cell Free Massive MIMO System
Anahid Rafieifar - Hosein Ahmadinejad - Abolfazl Falahati
Connective Reconstruction-based Novelty Detection
Seyyed Morteza Hashemi - Parvaneh Aliniya - Parvin Razzaghi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4