0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Improved Attention U-Net combined with Conditional Random Field for Ischemic Lesion Segmentation from Magnetic Resonance Images
Ali Rezaei - Asieh Khosravanian - Habibollah Danyali - Kamran Kazemi - Ardalan Aarabi
A Transformerless Single-Switch DC-DC Boost Converter Suitable for Renewable Energy Applications
Saed Mahmoud Alilou - Sasan Ahmadi - Mohammad Maalandish - Seyed Hossein Hosseini
A Novel Approach to Pulmonary Embolism Segmentation: Increasing an Attention-based U-Net
Hanie Arabian - Alireza Karimian - Hosein Arabi - Marjan Mansourian
Impacts of Various Wind Turbine Generators on Transient Recovery Voltage in a Medium Voltage Power Network
Mostafa Heydari - Ali Asghar Razi-Kazemi
Defects Dynamics in Multilayer h-BN Resistive Switching Memories: A Molecular Dynamics Investigation
Omid Babaeinejad - Maryam Keshavarz Afshar - Ebrahim Nadimi
Partitioning-based Graph Signal Denoising via Heat Kernel Smoothing
Mohammadreza Fattahi - Hamid Saeedi-Sourck - Vahid Abootalebi
Sampled-data-based Descriptor Observer Design with Aperiodic Measurements for Lithium-ion Batteries in Hybrid Electric Vehicles
Hamid Reza Ahmadzadeh - Masoud Shafiee
ترکیب الگوریتم بهینهساز ازدحام ذرات و شبکه عصبی همگشتی رزنت در مدلسازی و طراحی سطوح انتخابگر فرکانس فراکتالی
امین مزروعی آبکنار - مجتبی مداح علی - مرضیه نصیریان
طراحی و ساخت یک سیستم مخابرات نور مرئی مبتنی بر دوربین
شادی خسروی - فروغ السادات طباطباء - شهاب الدین رحمانیان
An Iterative Approach to Enhance the Accuracy of TDOA-Based Localization by Averaging and Reducing Noise
Reza Bahrampour - Mohammad Hossein Madani - Hossein Bahramgiri
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2