0% Complete
صفحه اصلی
/
بیست و نهمین کنفرانس مهندسی برق ایران
PAVID-CVs: Persian Audio-Visual Database of CV syllables
نویسندگان :
Mahsa Hedayatipour
1
Yasser Shekofteh
2
Mohsen Ebrahimi Moghaddam
3
1- دانشگاه شهید بهشتی
2- دانشگاه شهید بهشتی
3- دانشگاه شهید بهشتی
کلمات کلیدی :
Visual Speech Recognition, Lip Reading, CV syllables, Visyllable, Audio-Visual Database, Persian/Farsi Language.
چکیده :
Abstract— Lip-reading is a visual speech recognition process. In this process, recognizing the smaller units of speech can be the basis for recognizing the larger units of a language such as words. In this paper, we have introduced a Persian (Farsi) Audio-Visual Database of CV syllables, named PAVID-CVs, as a set of isolated two-phoneme visyllable and isolated words related to the visyllables, which include only Persian CV syllables, for lip-reading or audio-visual speech recognition purposes such as isolated word recognition. This dataset can be used for machine learning-based methods due to its useful tagged information. Here, we explain the steps of preparing the database. It contains about 30 hours data from 40 speakers. Initial experiments are done utilizing hidden Markov models (HMM) as a visyllable classifier. Then, these models have been used for visual recognition of 6 Persian words with different numbers of syllables and an accuracy of 47.37% was obtained in a speaker-independent experiment.
لیست مقالات
لیست مقالات بایگانی شده
Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels
Mohammad Hossein Givkashi - Mahshid Hadipour - َArezoo PariZanganeh - Zahra Nabizadeh Shahre-Babak - Nader Karimi - Shadrokh Samavi
Outage and Sum-Rate Analysis for mCAP-NOMA in Visible Light Communication Under Users' Mobility
Amir Oshtoudan - Seyed Mohammad Sajad Sadough
Covert Communication and Secure Transmission in the Presence of Multiple Antenna Untrusted Relay
Mohammad Reza Yari - Paeiz Azmi - Mahyar Ghasedi - Moslem Forouzesh - Hamid Saeedi
Design and Analysis of a New Electrically Controllable Brushless Eddy-Current Clutch
Hassan Mohammadi Pirouz - Mohammadreza Baghayipour
Development of a Tilt Bicopter: Experimental Results
Ali Moaveni - Alireza Bahmanyar - Arshia Rezaei - Amin Talaeizadeh - Aria Alasti
Optimal Design of a Synchronous Reluctance Motor Using BioGeography-Based Optimization
Tohid Sharifi - Mojtaba Mirsalim
طراحی و پیادهسازی آرایه انعکاسی چند پرتویی پهن باند با قطبش های خطی و دایروی همزمان با تنظیم فاز ثابت مرجع در سطح آرایه
مجید کریمی پور
Interval-Based Setting Approach for Distance Relays Considering Uncertainties Using Monte Carlo Simulation
Abolfazl Hadadi - Mohammad Javad Jalilian - Behrooz Vahidi - Gholam Hossein Riahy Dehkordi
بهبودی بر مساله تشخیص اشیا برجسته درتصاویر مبتنی بر یادگیری عمیق
مهران طاهری - محمد صادق هل فروش - کامران کاظمی
پیشبینی مسیر حرکت انسانها در محیطهای پر ازدحام
امین منافی سلطان احمدی - سمانه حسینی سمنانی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3