0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Design of Fresnel-Region Millimeter-Wave Metasurface Beam Shaper Using Deep Learning
Mohammad Hossein Koohi Ghamsari - Ehsan Imanbeygi - Mehdi Ahmadi-Boroujeni
Multi-physics electromagnetic-mechanical analysis of a high-speed switched reluctance motor for vacuum cleaner application
Nasrin Majlesi - Morteza Saghaian-Nejad - Amir Rashidi
Numerical study of different pillar shapes using deterministic lateral displacement method for particle separation
Mohammad Mahdi Eskandari Sani - Mahdi Aliverdinia - Mahdi Moghimi Zand
Fuzzy Fractional Order Sliding Mode Controller Design for a Wind Turbine with DFIG
Mohammad Hossein Aghaseyedabdollah - Yasin Alavian - Hadi Azmi - Alireza Yazdizadeh
A 2D Geometry Based Grasping Pose Generation Algorithm for a Two-finger Robot Hand
Arash Akbari - Arman Akbari - Mehdi Tale Masouleh
Improved Equivalent Input Disturbance Control of Nonlinear Aeropendulum System Using Data-Driven Approach
Mohammad Hossein Bayati - Arman Marzban - Mahsan Tavakoli-Kakhki - Ali Naseh
طراحی یک کنترلکننده غیرخطی تطبیقی غیرمتمرکز برای تنظیم ولتاژ ریزشبکههای DC در حالت جزیرهای
سمیه بهرامی - فاطمه صفایی
Ultra-Compact and Fast All-Optical Half-Subtractor Photonic Crystal Logic Gate
Ehsan Veisi - Mahmood Seifouri - Saeed Olyaee
Performance improvement of automated parking by considering road incline and wheel slippage
Ali Anisi - Moosa Ayati - Yassin Riyazi - Ali Asadian
بهبود کیفیت تصاویر حاصل از الگوریتم راداری DMAS با تخمین بهینه گذردهی الکتریکی در تصویربرداری مایکروویو برای تشخیص سرطان سینه
فاطمه سادات حسینی راد - امیررضا عطاری - سیدمحمدسعید ماجدی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3