CENSREC-2-AV: An evaluation framework for bimodal speech recognition in real environments

Naoya Ukai, Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, we introduce a bimodal speech recognition corpus in real environments. In recent years, speech recognition technology has been used in noisy conditions. Therefore, it becomes necessary to achieve higher recognition accuracy in real environments. As one of the solutions, bimodal speech recognition using audio and non-audio information is getting studied. However, there are few databases which can be used to evaluate the bimodal speech recognition in real environments. In this paper, we introduce CENSREC-2-AV we have been working to built, as a new bimodal speech recognition corpus. CENSREC-2-AV is one of the databases of the CEN-SREC project; we provided a similar corpus CENSREC-1-AV as a database for bimodal speech recognition for additive noises. In these corpora, there are speech data and lip images. Researchers can evaluate a bimodal speech recognition method built using CENSREC-1-AV which consists of clean data, in real environments by using CENSREC-2-AV.

Original languageEnglish
Title of host publicationProceedings of the 2012 International Conference on Speech Database and Assessments, Oriental COCOSDA 2012
Pages88-91
Number of pages4
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 15th International Conference on Speech Database and Assessments, Oriental COCOSDA 2012 - Macau, China
Duration: 2012 Dec 92012 Dec 12

Publication series

NameProceedings of the 2012 International Conference on Speech Database and Assessments, Oriental COCOSDA 2012

Conference

Conference2012 15th International Conference on Speech Database and Assessments, Oriental COCOSDA 2012
CountryChina
CityMacau
Period12/12/912/12/12

Keywords

  • audio-visual speech corpus
  • bimodal speech recognition
  • CENSREC
  • real environment

ASJC Scopus subject areas

  • Software
  • Speech and Hearing

Fingerprint Dive into the research topics of 'CENSREC-2-AV: An evaluation framework for bimodal speech recognition in real environments'. Together they form a unique fingerprint.

Cite this