0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
PAVID-CVs: Persian Audio-Visual Database of CV syllables
نویسندگان :
Mahsa Hedayatipour
1
Yasser Shekofteh
2
Mohsen Ebrahimi Moghaddam
3
1- دانشگاه شهید بهشتی
2- دانشگاه شهید بهشتی
3- دانشگاه شهید بهشتی
کلمات کلیدی :
Visual Speech Recognition, Lip Reading, CV syllables, Visyllable, Audio-Visual Database, Persian/Farsi Language.
چکیده :
Abstract— Lip-reading is a visual speech recognition process. In this process, recognizing the smaller units of speech can be the basis for recognizing the larger units of a language such as words. In this paper, we have introduced a Persian (Farsi) Audio-Visual Database of CV syllables, named PAVID-CVs, as a set of isolated two-phoneme visyllable and isolated words related to the visyllables, which include only Persian CV syllables, for lip-reading or audio-visual speech recognition purposes such as isolated word recognition. This dataset can be used for machine learning-based methods due to its useful tagged information. Here, we explain the steps of preparing the database. It contains about 30 hours data from 40 speakers. Initial experiments are done utilizing hidden Markov models (HMM) as a visyllable classifier. Then, these models have been used for visual recognition of 6 Persian words with different numbers of syllables and an accuracy of 47.37% was obtained in a speaker-independent experiment.
لیست مقالات
لیست مقالات بایگانی شده
Stable Target Tracking in Wireless Sensor Networks Under Malicious Cyber Attacks
Jafar Akhondali - Mohammad Taheri
Adaptive Attitude Synchronization and Tracking Control of Spacecraft Formation Flying using Reaction Wheel without Angular Velocity Measurement
Amin Mihankhah - Ali Doustmohammadi
Event Related Potentials Extraction using Low-rank Tensor Decomposition
Zahra SohrabiBonab - Mohammad Bagher Shamsollahi
SchEdge: A Dynamic, Multi-agent, and Scalable Scheduling Simulator for IoT Edge
Ali Hamedi - Amirali Ghaedi - Amin Soltan-beigi - Athena Abdi
Robust Laguerre based model predictive control for trajectory tracking of LTV systems
Marzieh Jamalabadi - Mahyar Naraghi - Iman Sharifi - Elnaz Firouzmand
Investigation of Cross-coupling Effects on Grid-connected Inverters with LCL Filter Based on RGA Analysis
Ali Rezaei - Mohsen Hamzeh - Nima Mahdian Dehkordi
Distributed Data Processing for Multi-Agent Systems Via Wave Model
Saeedreza Tofighi - Masoud Shafiee
Multiphysics Analysis of HTS Transformer utilizing Stainless Steel Stabilizer on Short Circuit Condition
Ashkan Mirzaei Rajeooni - Hossein Heydari - Mohammad Khakroei - Mahdi Rahimi Pirbasti
Reliability Evaluation of Distribution System Considering a Modified Electric Bus as a Mobile Energy Storage (Tehran E-Bus as a Case study)
Ali Kamali - Amir Soleimani - Seyed Vahid Nourbakhsh - Hassan Nehzati - Vahid Esfahanian - Mahmoud Oukati Sadegh
Object Detection enhancement based on Super-Resolution Mapping
Danial Abyazi - Dadfar Abyazi - Mehran Yazdi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0