抄録
This paper investigates audio-visual interaction, i.e. inter-modal influences, in linear-regressive model adaptation for multi-modal speech recognition. In the multi-modal adaptation, inter-modal information may contribute the performance of speech recognition. Thus the influence and advantage of intermodal elements should be examined. Experiments were conducted to evaluate several transformation matrices including or excluding inter-modal and intra-modal elements, using noisy data in an audio-visual corpus. From the experimental results, the importance of effective use of audio-visual interaction is clarified.
本文言語 | English |
---|---|
ページ | 875-878 |
ページ数 | 4 |
出版ステータス | Published - 2011 |
外部発表 | はい |
イベント | Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 - Xi'an, China 継続期間: 2011 10月 18 → 2011 10月 21 |
Conference
Conference | Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 |
---|---|
国/地域 | China |
City | Xi'an |
Period | 11/10/18 → 11/10/21 |
ASJC Scopus subject areas
- 情報システム
- 信号処理