Speech shift: Direct speech-input-mode switching through intentional control of voice pitch

Masataka Goto, Yukihiro Omoto, Katunobu Itou, Tetsunori Kobayashi

Research output: Contribution to conferencePaper

2 Citations (Scopus)

Abstract

This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitch, a user can enter the same word with it having different meanings (functions) without explicitly changing the speech-input mode. Our speech-shift function implemented on a voice-enabled word processor, for example, can distinguish an utterance with a high pitch from one with a normal (low) pitch, regard the former as voice-command-mode input (such as file-menu edit-menu commands) the latter as regular dictation-mode text input. Our experimental results from twenty subjects showed that the speech-shift function is effective, easy to use, a labor-saving input method.

Original languageEnglish
Pages1201-1204
Number of pages4
Publication statusPublished - 2003 Jan 1
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: 2003 Sep 12003 Sep 4

Other

Other8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
CountrySwitzerland
CityGeneva
Period03/9/103/9/4

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Linguistics and Language
  • Communication

Fingerprint Dive into the research topics of 'Speech shift: Direct speech-input-mode switching through intentional control of voice pitch'. Together they form a unique fingerprint.

  • Cite this

    Goto, M., Omoto, Y., Itou, K., & Kobayashi, T. (2003). Speech shift: Direct speech-input-mode switching through intentional control of voice pitch. 1201-1204. Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland.