0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
Object Detection enhancement based on Super-Resolution Mapping
Danial Abyazi - Dadfar Abyazi - Mehran Yazdi
پنل بازیابی: نرم افزار بازیابی سیستمهای قدرت با قیود امنیتی
سجاد نجفی روادانق - رسول اسماعیل زاده - رضا فرتاش
Enhancing Kriging with Inductive Spatio-Temporal GraphODE
Amin Sheykhzadeh - Behzad Moshiri - Ebrahim Ghafar-Zadeh
High-Performance Biosensor Based on SRR for Early Breast Cancer Detection
Hasti Enayattarighehkari - Sina Aramtan - Gholamreza Moradi - Farhad Azadi Namin
Three Improved Boost Topologies with Continuous Input/Output Currents Suitable for High-Voltage Applications
Hossein Gholizadeh - Hesam Ehsan - Alireza Poursalan - Mohammad Hamed Samimi
کنترل دست پروتزی با استفاده از کنترل کننده تطبیقی فازی- PI به کمک سیگنال های EMG
مهسا برفی - حمیدرضا کرمی - سیدمنوچهر حسینی پیلانگرگی
E-RESO: An Enhanced Time Redundancy-based Error Detection Approach for Arithmetic Operations
Sina Shahoveisi - Athena Abdi
Deep SqueezeNet Based Technique for Detection of High Impedance Arcing Faults in Electric Power Distribution Networks
Amin Mohammadi - Mohsen Jannati - Mohammadreza Shams
طراحی خودرمزگذار متغیر جهت تشخیص عیب در بیرینگهای غلتشی
مریم آهنگ - مهدی علیاری شورهدلی
Design and Analysis of A Non-Isolated High gain DC-DC Converter with Single Power Switch
Amirreza Bahadori - Seyed Hossein Hosseini - Ebrahim Babaei - Saeed Danyali
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2