0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
PAVID-CVs: Persian Audio-Visual Database of CV syllables
نویسندگان :
Mahsa Hedayatipour
1
Yasser Shekofteh
2
Mohsen Ebrahimi Moghaddam
3
1- دانشگاه شهید بهشتی
2- دانشگاه شهید بهشتی
3- دانشگاه شهید بهشتی
کلمات کلیدی :
Visual Speech Recognition, Lip Reading, CV syllables, Visyllable, Audio-Visual Database, Persian/Farsi Language.
چکیده :
Abstract— Lip-reading is a visual speech recognition process. In this process, recognizing the smaller units of speech can be the basis for recognizing the larger units of a language such as words. In this paper, we have introduced a Persian (Farsi) Audio-Visual Database of CV syllables, named PAVID-CVs, as a set of isolated two-phoneme visyllable and isolated words related to the visyllables, which include only Persian CV syllables, for lip-reading or audio-visual speech recognition purposes such as isolated word recognition. This dataset can be used for machine learning-based methods due to its useful tagged information. Here, we explain the steps of preparing the database. It contains about 30 hours data from 40 speakers. Initial experiments are done utilizing hidden Markov models (HMM) as a visyllable classifier. Then, these models have been used for visual recognition of 6 Persian words with different numbers of syllables and an accuracy of 47.37% was obtained in a speaker-independent experiment.
لیست مقالات
لیست مقالات بایگانی شده
Design and Parametric Study of Circular Polarized Electrically Small Archimedean Spiral PIFA Antenna for Biomedical Implants in ISM Band
Sina Saeedi - Arezoo Abdi - Farhad Ghorbani - Hadi Aliakbarian - Ramezan Ali Sadeghzadeh
Goodbye Bitcoin: A general framework for migrating to quantum-secure cryptocurrencies
Saeed Banaeian Far - Azadeh Imani Rad - Maryam Rajabzadeh Asaar
Numerical Study of a Microfluidic-Based Motile Sperm Enrichment Using Sperm Rheotactic Behavior
Mohammadjavad Bouloorchi - Saeed Javadizadeh - Aref Valipour - MirBehrad Mousavi - Majid Badieirostami
Electricity Tariff Volatility Mitigation Using Uncertainty-Diminution and Hedge Contracts along with Risk Management Policies
Majid Moazzami - Hossein Shahinzadeh - Majid Najafi - Zohreh Azani - Shohreh Azani - Gevork B. Gharehpetian
استفاده از طیفنگاری مادون قرمز نزدیک کارکردی جهت بررسی اثر پشیمانی بر تصمیمگیری خودکنترلی
جاوید بکرانی - سید کمال الدین ستاره دان - عبدالحسین وهابی
Stability Analysis of Distributed-Order Systems: a Lyapunov Scheme
Vahid Badri
Design and Determing Two Separate Rotor Axial Flux Permanent Magnet Motor Load and Efficinecy
Siamak Omrani - Ahmad Darabi
Design and Control of a Novel Multi-port Bidirectional Buck-Boost Converter Suitable for Hybrid Electric Vehicle Charging Stations
Amir Safaeinasab - Homayon Soltani Gohari - Karim Abbaszadeh
بخشبندی خودکار تصاویر تشدید مغناطیسی ستون فقرات کمری با شبکه سِگیونِت
محمد انصاری فرد - رضا آقایی زاده ظروفی
Interval-Based Setting Approach for Distance Relays Considering Uncertainties Using Monte Carlo Simulation
Abolfazl Hadadi - Mohammad Javad Jalilian - Behrooz Vahidi - Gholam Hossein Riahy Dehkordi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3