0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Evaluation of Different Connectivity Methods for Obsessive Compulsive Disorder Diagnosis
Samandokht Rashidi - Amin Abdipourasl - Fatemeh Jamaloo - Reza Rostami
Joint User Association and UAV Location Optimization for Two-Tired Visible Light Communication Networks
Alireza Qazavi - Foroogh Sadat Tabataba - Mehdi Naderi Soorki
Enhanced the Droop Approach MMC-Based in AC Microgrids
Amirhossein Fallah Bagheri - Hamid Reza Baghaee - Ali Yazdian Varjani - Kourosh Khalaj Monfared - Reza Alizadeh
Smartly, reduce the latency of high-priority vehicles using IoT technology
Mahdi Talebi - Masoud Sabaei
Improved Model Predictive Control for the Three-Phase Grid-Connected Split-Source Inverter
Seyed Hamid Montazeri - Jafar Milimonfared - MohammadReza Zolghadri
Non-pharmacological interventions for Covid-19 new variants with fractional order fuzzy type-2 PID
Hadi Delavari - Amir Veisi - Maryam Ranjbaran
Transfer learning using deep convolutional neural network for predicting dementia severity
Vahid Asayesh - Mehdi Dehghani - Majid Torabi Nikjeh - Sepideh Akhtari khosrowshahi
A new double rotor switched reluctance motor aiming at average torque improvement
Reza Rezaei - Seyed Reza Mousavi Aghdam
بکارگیری یادگیری عمیق در ارزیابی به هنگام پایداری ولتاژ کوتاه مدت با استفاده از داده های اندازه گیری فازوری
امیرحسین باباعلی - محمدتقی عاملی
40Hz Auditory Entrainment Promotes Synchronization Between Frontal and Parietal Regions of the Brain
Mojtaba Lahijanian - Hamid Aghajan
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0