A review of audio-visual speech recognition

Thum, Wei Seong and M. Z., Ibrahim (2018) A review of audio-visual speech recognition. Journal of Telecommunication, Electronic and Computer Engineering, 10 (1-4). pp. 35-40. ISSN 2289-8131. (Published)

Preview

Pdf
A review of audio-visual speech recognition.pdf
Available under License Creative Commons Attribution.
Download (637kB) | Preview

DOI/Official URL: http://journal.utem.edu.my/index.php/jtec/article/...

Abstract

Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect the overall recognition performance. Thus, Audio-Visual Speech Recognition (AVSR) is designed to overcome the problems by utilising visual images which are unaffected by noise. The aim of this paper is to discuss the AVSR structures, which includes the front end processes, audio-visual data corpus used, recent works and accuracy estimation methods.

Item Type:	Article
Additional Information:	Indexed by Scopus
Uncontrolled Keywords:	Audio-visual speech recognition; Audio visual data corpus; Feature extraction; Model validation techniques; Performance evaluation
Subjects:	T Technology > TK Electrical engineering. Electronics Nuclear engineering
Faculty/Division:	Faculty of Electrical & Electronic Engineering
Depositing User:	Mrs. Neng Sury Sulaiman
Date Deposited:	14 Sep 2018 07:20
Last Modified:	14 Sep 2018 07:20
URI:	http://umpir.ump.edu.my/id/eprint/21637
Download Statistic:	View Download Statistics

Actions (login required)

View Item