0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Hybrid-Excited, Variable-Flux, and Inter-Modular Biased-Flux Motors: A Comparative Analysis
Mohammad Amirkhani - Ehsan Farmahini Farahani - Alireza Eikani - Mojtaba Mirsalim - Javad Shokrollahi Moghani
Design of an Optical Current Transformer for High-Voltage Gas-Insulated Switchgear-Part I: Focus on Optical Sensor Design
Reza Babaei - Asghar Akbari - Arash Moradi
Impacts of Various Wind Turbine Generators on Transient Recovery Voltage in a Medium Voltage Power Network
Mostafa Heydari - Ali Asghar Razi-Kazemi
Multi-agent H-Learning Based Cooperative Spectrum Sensing for Cognitive Radio Networks
Elaheh Karimpour Fard - Mahdi Nouri - Hamid Behroozi - Sima Sobhi-Givi
بهبود عملکرد یک ( LOC ) Lab – On –Chipپیشرفته مبتنی بر فناوری MEMSبه کمک تقویت میدان الکتریکی ساختار
شیوا عظیمی نام - فهیمه مروی - کیان جعفری
Two-Stage Stochastic Modeling for Energymnagement and Control of Virtual Power Plants: Addressing Renewable Energy Challenges
Mohammadreza Mousavi Khademi - Mehdi Zareian Jahromi
Underwater Image Quality Assessment via Color and Contrast Analysis
Meysam Ghalyani - Maryam Karimi
SWOT Analysis of the Mega Constellation Technology and Satellite Internet
Mohammad Bod - Parvin Sojoodi - Leila Mohammadi
Multi-Agent Systems for Quadcopter under Nonlinear Dynamics and Actuator Modeling with MPC and LQR Controller
Navid Mohammadi - Saeed Khankalantary
Error Probability Analysis of Non-Orthogonal Multiple Access
Rozita Shafie - AliAkbar Tadaion - Zolfa Zeinalpour-Yazdi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0