Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations

Masataka Goto, Koji Kitayama, Katunobu Itou, Tetsunori Kobayashi

研究成果査読

8 被引用数 (Scopus)

抄録

This paper describes a novel speech-interface function, called "speech spotter", which enablesauserto enter voice commands into a speech recognizer in the midst of natural human-human conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to another person or a speech recognizer. We solve this problem by using two kinds of nonverbal speech information: a filled pause (a vowel-lengthening hesitation like "er⋯") and voice pitch. Only when a user utters a voice command with a high pitch just after a filled pause is the voice command accepted by the speech recognizer. By using this speech-spotter function, we have built two application systems: an on-demand information system for assisting human-human conversation and a music-playback system for enriching telephone conversation. The results from using these systems have shown that the speech-spotter function is robust and convenient enough to be used in face-to-face or cellular-phone conversations.

本文言語English
ページ1533-1536
ページ数4
出版ステータスPublished - 2004 1 1
イベント8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
継続期間: 2004 10 42004 10 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
国/地域Korea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル