0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
Bilabial Consonants Recognition in CV Persian Syllable Based on Computer Vision
نویسندگان :
Melika Khajeh
1
Azam Bastanfard
2
Dariush Amirkhani
3
1- دانشگاه آزاد اسلامی واحد کرج
2- دانشگاه آزاد اسلامی واحد کرج
3- دانشگاه صداوسیما
کلمات کلیدی :
CLM algorithm, DTW algorithm, viseme, Consonant, feature extraction, Euclidean distance.
چکیده :
According to previous researches, Persian consonants have been divided into seven categories based on viseme. It led to several consonants being placed in one category. Detecting between consonants in one category is so hard because the spots for the production of these consonants are the same.The forms of lips do not change at the time of production; these consonants are hardly distinguishable. The major challenge is to recognize the differences between lip shapes in one category. The purpose of this study is to recognize differences between bilabial consonants such as /p/, /b/, and /m/ in a word that composed of consonant/vowel called CV by computer vision. For the first time, this study attempts to distinguish these consonants. Proper pronunciations of words are required to identify consonants. Therefore, a database has formed based on the videos of the speech therapists. Generally, this kind of process is including 1- lip detection, 2- lip feature extraction, and 3- classification systems for the diagnosis of consonants. In this paper, consonants recognition in a category based on lip shape using the CLM algorithm for lip detection is presented. Geometric algorithms for feature extraction and DTW and equalizer as a classification system are proposed. Although this study is open because we could identify differences among consonants in just one class, we could reach remarkable CV video results for the first time. We could aim for acceptable results with reasonable accuracy for bilabial consonants detection. The principle purpose of this study is to improve lip-reading systems in security issues and help hearing-impaired people in interaction with their surroundings. The results of this paper can have a positive effect on speech systems.
لیست مقالات
لیست مقالات بایگانی شده
A 0.5-V Ultra-Low-Power Low-Pass-filter with Low Noise for ECG detection system
Yasin Heydarzadeh - Mehran Khanehbeygi - Sajad Sohrabian - Ziaddin Daie Koozehkanani
Conversion of Linear Polarized Light-to-Orbital Angular Momentum with Variable Topological Charges, Using the Surface Plasmons of Elliptical Holes Etched in a Gold Layer
Amir Mohammad Ghanei - Abolfazl Aghili - Sara Darbari
A Circularly Polarized Metal-Only Holographic Leaky-Wave Antenna Based on Spoof Surface Plasmon Polaritons
Reza Ashrafi Mohabadi - Sajjad Zohrevand - Mohammad Amin Chaychizadeh - Nader Komjani
مدیریت بهینه توان در یک ساختمان هوشمند حاوی واحدهای ترکیبی برق و حرارت و منابع تولیدپراکنده در حضور ذخیره ساز انرژی
اسماعیل زحمت کشان
A New 10 Watt Power Amplifier for GSM 900 MHz base stations with 44% Bandwidth
Marzieh Chegini - HojjatAllah Nemati - Mahmoud Kamarei
کنترل تشنج در مدل صرع ساز با استفاده از کنترل کننده سطح دینامیکی
مهدی کمالی دولت آبادی - مرضیه کمالی - فرزانه شایق
Development of Iterative Learning Control Method for Trajectory Tracking in Two-Dimensional Discrete-Time Systems
Meysam Azhdari - Tahereh Binazadeh - Soheila Abedi
Evaluating the Impact of Operation Scheduling Methods on Microgrid Reliability Using Monte Carlo Simulation
Mahsa Omri - Mohammad Jooshaki - Ali Abbaspour - Mahmud Fotuhi-Firuzabad
Adaptive dynamic programming for kinematic control of 3 interconnected wheeled mobile robots
Aliakbar Ghasemzadeh - Roya Amjadifard - Ali Keymasi Khalaji
Design of Optimal Iterative Learning Control AutoPilot for Landing Fixed-Wing Aircraft
Ali Raddanipour - Masoud Shafiee
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0