0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
خلاصه سازی ویدیوهای کپسول آندوسکوپی با رویکرد یادگیری انتقالی
محدثه امیریان چایجان - رضا آقائی زاده ظروفی - مسعود رضا سهرابی
An Investigation on Transfer Learning for Classification of COVID-19 Chest X-Ray Images with Pre-trained Convolutional-based Architecture
Mobina Abdoli Nemati - ََAmirreza Baba Ahmadi
تعیین نقشه راه مناسب شرکتهای توزیع کشور در زمینه مدیریت سمت تقاضا
محمدرحیم محمدی
Outage Analysis of Distributed Relaying NOMA in Cognitive Radio Networks
Zahra Doorbash - Ali Jamshidi
Q-Learning-Oriented Distributed Energy Management of Grid-Connected Microgrid
Esmat Samadi - Ali Badri - Reza Ebrahimpour
On the Design of Highly Efficient Harmonic Tuned Wideband Class F-1/F Power Amplifier
Mohammad Reza Zeinali - Amir Hossein Aalipour - Hossein Shamsi
Impact of Loss of Generation (LoG) on Directional Overcurrent Protection in Microgrids
Amir Nedaei - Aref Eskandari
Breast tumor detection using graphene-based terahertz patch antenna
Zahra Yasaghi - Ayaz Ghorbani - Gholamreza Moradi
طبقه بندی سکته مغزی در یک سیستم دو بعدی چند فرکانسی با استفاده از امواج مایکروویو و یادگیری عمیق
محسن مهرانیان - محمدسعید ماجدی - امیررضا عطاری
Flexible Generation Expansion Planning Considering Representative Days of Load and Renewable Variations
Peyman Amirian - Zeinab Maleki - Mohammad-Amin Pourmoosavi - Turaj Amraee
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4