0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
Bilabial Consonants Recognition in CV Persian Syllable Based on Computer Vision
نویسندگان :
Melika Khajeh
1
Azam Bastanfard
2
Dariush Amirkhani
3
1- دانشگاه آزاد اسلامی واحد کرج
2- دانشگاه آزاد اسلامی واحد کرج
3- دانشگاه صداوسیما
کلمات کلیدی :
CLM algorithm, DTW algorithm, viseme, Consonant, feature extraction, Euclidean distance.
چکیده :
According to previous researches, Persian consonants have been divided into seven categories based on viseme. It led to several consonants being placed in one category. Detecting between consonants in one category is so hard because the spots for the production of these consonants are the same.The forms of lips do not change at the time of production; these consonants are hardly distinguishable. The major challenge is to recognize the differences between lip shapes in one category. The purpose of this study is to recognize differences between bilabial consonants such as /p/, /b/, and /m/ in a word that composed of consonant/vowel called CV by computer vision. For the first time, this study attempts to distinguish these consonants. Proper pronunciations of words are required to identify consonants. Therefore, a database has formed based on the videos of the speech therapists. Generally, this kind of process is including 1- lip detection, 2- lip feature extraction, and 3- classification systems for the diagnosis of consonants. In this paper, consonants recognition in a category based on lip shape using the CLM algorithm for lip detection is presented. Geometric algorithms for feature extraction and DTW and equalizer as a classification system are proposed. Although this study is open because we could identify differences among consonants in just one class, we could reach remarkable CV video results for the first time. We could aim for acceptable results with reasonable accuracy for bilabial consonants detection. The principle purpose of this study is to improve lip-reading systems in security issues and help hearing-impaired people in interaction with their surroundings. The results of this paper can have a positive effect on speech systems.
لیست مقالات
لیست مقالات بایگانی شده
A Graphene Terahertz Detector based on the Photo-Thermoelectric Effect with Frequency Selectivity
Faramarz Alihosseini - Zahra Heshmatpanah - Hesam Zandi
Controlling Energy Consumption and Intelligent Manufacturing through an Energy-aware Scheduling Algorithm in Industrial Sector
Negin Shafinezhad - Maryam Mahmoudi - Hamid Abrishami - Vahid Baghishani
Reactive Power Management of PV Systems by Distributed Cooperative Control in Low Voltage Distribution Networks
Saeed Mahdavian Rostami - Mohsen Hamzeh
Joint Space Control of a Deployable Cable Driven Parallel Robot with Redundant Actuators
S. Ahmad Khalilpour - Ali Hassani - Rohollah Khorambakht - A.R. Zahedi - Abbas Bataleblu - Hamid D. Taghirad
Battery Sizing for energy management of islanded Microgrid considering the impact of discharge duration on Lead-Acid Battery effective capacity
Mehrdad Bagheri Sanjareh - Mohammad Hassan Nazari - Narges Sadat Ghiasi - Seyyed Mohammad Sadegh Ghiasi - Seyed Hoseein Hosseinian
Breast Cancer Detection by Time-Reversal Imaging Using Ultra-Wideband Modified Circular Patch Antenna Array
Mohammad Haghpanah - Zahra Ghattan Kashani - Atefeh Khalili Param
Evaluation of Different Connectivity Methods for Obsessive Compulsive Disorder Diagnosis
Samandokht Rashidi - Amin Abdipourasl - Fatemeh Jamaloo - Reza Rostami
Soft Decision Adaptive Deep Learning Detection for Enhanced Massive MIMO Performance
Farnaz Sedaghati - Mojtaba Amiri - Ali Olfat
Improving Quarter-Wavelength Resonator Technique for Parasitic Cancellation of the ESD Protection Diode for High-Frequency Applications
Emadodin Zia Khodadadian - Mojtaba Joodaki
A 400 ps Input Time Range 2× Time Amplifier Using Time-to-Current Compensation Technique
Mohammad Amin Yaldagard - Hossein Shamsi
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2