0% Complete
صفحه اصلی
/
سی و سومین کنفرانس بین المللی مهندسی برق
Dual-Branch Cross-Parallel Transformer Model for Single-Channel Speech Enhancement
نویسندگان :
Mohammad Hakimkhah
1
Rahil Mahdian Toroghi
2
Hassan Zareian
3
1- Iran Broadcasting University
2- Iran Broadcasting University
3- Iran Broadcasting University
کلمات کلیدی :
Speech enhancement،Single microphone،Crossparallel Transformer،Dual branch
چکیده :
In this paper, a dual-branch parallel structure is proposed for single-channel speech enhancement, consisting of a Magnitude Mask Branch (MMB) and a Complex Mapping Branch (CMB) utilizing a Cross-Parallel Transformer (CPT) in the time-frequency domain. The CPT effectively captures longterm dependencies along time and frequency axes, extracting time-frequency-related features by integrating their information. The MMB estimates the spectral magnitude, while the CMB compensates for lost spectral details and implicitly extracts phase information. The approach is evaluated on the public VoiceBank+DEMAND dataset. The proposed Dual-Branch Cross- Parallel Transformer Neural Network (DB-CPTNN) achieves superior results compared to SOTA models. Specifically, the model attains PESQ, STOI, SSNR, CSIG, CBAK, and COVL scores of 3.37, 95.9%, 10.58, 4.71, 3.89, and 4.15, respectively, outperforming state-of-the-art (SOTA) benchmarks.
لیست مقالات
لیست مقالات بایگانی شده
RCS Calculation of a Symmetrical Microstrip Array Using Discrete Bodies of Revolution Method
Hossein Mohammadzadeh - Abolghasem Zeidaabadi Nezhad - Zaker Hossein Firouzeh
بهبود بازه پویای حسگر گاز اکسید فلزی برای کاربرد در پایش ایمنی محیطهای صنعتی
سمانه محمدباغبان - وحید غفاری نیا
Hybrid-Excited, Variable-Flux, and Inter-Modular Biased-Flux Motors: A Comparative Analysis
Mohammad Amirkhani - Ehsan Farmahini Farahani - Alireza Eikani - Mojtaba Mirsalim - Javad Shokrollahi Moghani
Privacy-Preserving Model Predictive Control Using Secure Multi-Party Computation
Saeed Adelipour - Mohammad Haeri
Designing Of Type-2 Fuzzy Formation Controller For A Class Of Nonlinear Multiagent System Using JAYA Algorithm
Arvin Attar - Mohammad Ali Badamchizadeh - Sehraneh Ghaemi
PAVID-CVs: Persian Audio-Visual Database of CV syllables
Mahsa Hedayatipour - Yasser Shekofteh - Mohsen Ebrahimi Moghaddam
بررسی اثر فیدبک نوری بر مشخصه های دینامیکی لیزرهای قفل مد سیلیکونی
محمد شکرپور - محمد حسن یاوری
Anomaly Detection in Urban Water Distribution Grids Using Fog Computing Architecture
Sara Mirzaie - Mohammadreza Avazaghaei - Omid Bushehrian
کنترل دوز داروی بیماران مبتلا به لوسمی با استفاده از روشی نوین بر پایه یادگیری تقویتی عمیق
مریم افخمی - امین نوری
Reactive Power Management of PV Systems by Distributed Cooperative Control in Low Voltage Distribution Networks
Saeed Mahdavian Rostami - Mohsen Hamzeh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0