Understanding three simultaneous speeches

Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata

研究成果: Conference contribution

6 被引用数 (Scopus)

抄録

Understanding three simultaneous speeches is proposed as a challenge problem to foster artificial intelligence, speech and sound understanding or recognition, and computational auditory scene analysis research. Automatic speech recognition under noisy environments is attacked by speech enhancement techniques such as noise reduction and speaker adaptation. However, the signal-to-noise ratio of speech in two simultaneous speeches is too poor to apply these techniques. Therefore, novel techniques need to be developed. One candidate is to use speech stream segregation as a front-end of automatic speech recognition systems. Preliminary experiments on understanding two simultaneous speeches show that the proposed challenge problem will be feasible with speech stream segregation. The detailed plan of the research on and benchmark sounds for the proposed challenge problem is also presented.

本文言語English
ホスト出版物のタイトルIJCAI International Joint Conference on Artificial Intelligence
ページ30-35
ページ数6
1
出版ステータスPublished - 1997
外部発表はい
イベント15th International Joint Conference on Artificial Intelligence, IJCAI 1997 - Nagoya, Aichi, Japan
継続期間: 1997 8 231997 8 29

Other

Other15th International Joint Conference on Artificial Intelligence, IJCAI 1997
CountryJapan
CityNagoya, Aichi
Period97/8/2397/8/29

ASJC Scopus subject areas

  • Artificial Intelligence

フィンガープリント 「Understanding three simultaneous speeches」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル