Automatic sound-imitation word recognition from environmental sounds focusing on ambiguity problem in determining phonemes

Kazushi Ishihara*, Tomohiro Nakatani, Tetsuya Ogata, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference article査読

11 被引用数 (Scopus)

抄録

Sound-imitation words (SIWs), or onomatopoeia, are important for computer human interactions and the automatic tagging of sound archives. The main problem in automatic SIW recognition is ambiguity in the determining phonemes, since different listener hears the same environmental sound as a different SIW even under the same situation. To solve this problem, we designed a set of new phonemes, called the basic phoneme-group set, to represent environmental sounds in addition to a set of the articulation-based phoneme-groups. Automatic SIW recognition based on Hidden Markov Model (HMM) with the basic phoneme-groups is allowed to generate plural SIWs in order to absorb ambiguities caused by listener- and situation-dependency. Listening experiments with seven subjects proved that automatic SIW recognition based on the basic phoneme-groups outperformed that based on the articulation-based phoneme-groups and that based on Japanese phonemes. The proposed system proved more adequate to use computer interactions.

本文言語English
ページ(範囲)909-918
ページ数10
ジャーナルLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
3157
DOI
出版ステータスPublished - 2004 1月 1
外部発表はい
イベント8th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2004: Trends in Artificial Intelligence - Auckland, New Zealand
継続期間: 2004 8月 92004 8月 13

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Automatic sound-imitation word recognition from environmental sounds focusing on ambiguity problem in determining phonemes」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル