Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition

Kazushi Ishihara, Yuya Hattori, Tomohiro Nakatani, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

研究成果

4 被引用数 (Scopus)

抄録

Onomatopoeia, or sound-imitation words (SIWs) are important in informing sound events in human-computer communication. One problem is listener-dependency in recognizing environmental sounds by means of SIWs, that is, different listener hears the same environmental sound as a different SIW even under the same condition. Therefore, the use of usual Japanese phonemes is not adequate to express SIWs. To cope with this ambiguity problem of phoneme determination, we designed a set of new phonemes, referred to as the basic phoneme-groups, to represent environmental sounds. The basic phoneme-group consists of one or more Japanese phonemes, and thus the ambiguity problem is resolved based on it by generating one or more SIWs for a sound event. An HMM-based scheme is adopted to recognize SIWs using the phoneme-groups. Listening experiments with seven subjects showed that automatic SIW recognition based on the basic phoneme-groups outperformed ones based on the other types of phonemes. The recall and precision rate were 56.4% and 72.2%, respectively.

本文言語English
ホスト出版物のタイトル8th International Conference on Spoken Language Processing, ICSLP 2004
出版社International Speech Communication Association
ページ1485-1488
ページ数4
出版ステータスPublished - 2004
外部発表はい
イベント8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
継続期間: 2004 10 42004 10 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
国/地域Korea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル