0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
یک روش تشخیص و تصحیح خطا برای بلوک های داده
سعیده صادقی - محسن راجی
A Novel Model for Backcasting the Environmental Sustainability in Iran’s Electricity Supply Mix
Mohammad Saeid Atabaki - Mohammad Mohammadi
انتخاب سبد سهام بهینه در بورس تهران با استفاده از تقریب تصادفی انحراف همزمان
زینب گدازگر
Privacy-Preserving Learning using Autoencoder-based Structure
Mohammad Ali Jamshidi - Hadi Veisi - Mohammad Mahdi Mojahedian - Mohammad Reza Aref
Enhancing Kriging with Inductive Spatio-Temporal GraphODE
Amin Sheykhzadeh - Behzad Moshiri - Ebrahim Ghafar-Zadeh
A Novel HVDC Transmission System Based on Z-Source Converter
Mehdi Zareian Jahromi - Mohammadreza Mousavikhademi - Ebrahim Kazemi
Ultra-Low-Latency QCA Adder Design Using an Innovative Carry Generator
Mohammad Mahdi Cheraghi - Reza Omidi - Ali Azarpeyvand
همزمان سازی سمبلها در مخابرات مولکولی مبتنی بر انتشار
سمانه منطقی - علی جمشیدی
Contextual and Spectral Feature Fusion Using Local Binary Graph for Hyperspectral Images Classification
Zahra Farmahini Farahani - Hassan Ghassemian - Maryam Imani
Finite-Time Bipartite Time-Varying Formation tracking for Heterogeneous Nonlinear Multi-Agent Systems
Mohammad Reza Mehrabi Koushki - Javad Askari - Marzieh Kamali
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2