0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
An incentive compatible reward sharing approach for shard-based blockchains
Mojdeh Hemati - Mehdi Shajari
Bit error rate improvement in optical camera communication based on RGB LED
Farzaneh Norouzi - Saeed Olyaee - Mehran Mehraban Rad
System Sectioning to Retain Durability of an Inverter-Based Microgrid
Sara Noorollah
Transformer-Based Unsupervised Image Registration using SSIM and Homography Loss for Steady Camera and Aerial Videos
Golnoosh Abdollahinejad - Matin Hashemi
Kernel-Based Embedded Feature Selection for Motor Imagery Based BCI
Mehdi Kamandar
تدوین استراتژی تعمیرات و نگهداری مبتنی بر قابلیت اطمینان در شبکه ی انتقال قدرت
سید سینا طاهری اطاقسرا - مسعود اصغری قراخیلی
Optimization of Fifth Order Band-Pass Ladder Filter and Statistical Analysis of Reverse Problem
Sayyed Ali Alizadeh - Mahmoud Kamarei
Application of Artificial Neural Network on Diagnosing Location and Extent of Disk Space Variations in Transformer Windings Using Frequency Response Analysis
Reza Behkam - Hossein Karami - Mahdi Salay Naderi - Gevork Gharehpetian
Stable Target Tracking in Wireless Sensor Networks Under Malicious Cyber Attacks
Jafar Akhondali - Mohammad Taheri
A Novel Low Torque Ripple Hexagon Biased Flux Doubly Salient Permanent Magnet Motor
Mohammad Amirkhani - Behnam Mohammadian Mosammam - Mojtaba Mirsalim
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2