0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
PAVID-CVs: Persian Audio-Visual Database of CV syllables
نویسندگان :
Mahsa Hedayatipour
1
Yasser Shekofteh
2
Mohsen Ebrahimi Moghaddam
3
1- دانشگاه شهید بهشتی
2- دانشگاه شهید بهشتی
3- دانشگاه شهید بهشتی
کلمات کلیدی :
Visual Speech Recognition, Lip Reading, CV syllables, Visyllable, Audio-Visual Database, Persian/Farsi Language.
چکیده :
Abstract— Lip-reading is a visual speech recognition process. In this process, recognizing the smaller units of speech can be the basis for recognizing the larger units of a language such as words. In this paper, we have introduced a Persian (Farsi) Audio-Visual Database of CV syllables, named PAVID-CVs, as a set of isolated two-phoneme visyllable and isolated words related to the visyllables, which include only Persian CV syllables, for lip-reading or audio-visual speech recognition purposes such as isolated word recognition. This dataset can be used for machine learning-based methods due to its useful tagged information. Here, we explain the steps of preparing the database. It contains about 30 hours data from 40 speakers. Initial experiments are done utilizing hidden Markov models (HMM) as a visyllable classifier. Then, these models have been used for visual recognition of 6 Persian words with different numbers of syllables and an accuracy of 47.37% was obtained in a speaker-independent experiment.
لیست مقالات
لیست مقالات بایگانی شده
A Novel Low Torque Ripple Hexagon Biased Flux Doubly Salient Permanent Magnet Motor
Mohammad Amirkhani - Behnam Mohammadian Mosammam - Mojtaba Mirsalim
Deep SqueezeNet Based Technique for Detection of High Impedance Arcing Faults in Electric Power Distribution Networks
Amin Mohammadi - Mohsen Jannati - Mohammadreza Shams
تجزیه و تحلیل عملکرد سیستم ناوبری اینرسیایی با استفاده از الگوریتم GAME
نرجس احمدیان - بیژن ذاکری گتابی
Devloping a clustering routing algorithm based on the efficient hybrid methodology for WSN performance optimization
Neda Mazloomi - Sajad Haghzad Klidbary
Sum-Rate Maximization for NOMA-Based Networks with D2D Communications using Matching Theory
َAlireza Gholamrezaee - Hamid Farrokhi - Javad Zeraatkar Moqaddam
Social Welfare Maximization with Demand Response Program Using Stackelberg Game Theory
Mahtab Seyyedi - Ebrahim Pirmoradi - Turaj Amraee
طراحی و شبیه سازی یک مولد اعداد تصادفی ترکیبی ارتقا یافته در آتوماتای سلولی نقطهکوانتومی با به کارگیری ساختارهای فراپایدار
سورنا آسیابان جونقانی - نوید یثربی
Reactive Power Management of PV Systems by Distributed Cooperative Control in Low Voltage Distribution Networks
Saeed Mahdavian Rostami - Mohsen Hamzeh
Autoencoders for Input Reduction in Interval Type-2 Hyperbolic Fuzzy System Identification and Control: Experimental Results
Behnaz Mohammadi - Nazanin Ildarabadi - Mohammad-R Akbarzadeh-T
A Novel Generation Shedding Procedure for Power Management System in Industrial Power Plants
Erfan Asadi - Hamid Khoshkhoo - Ali Parizad
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2