Speech shift: Direct speech-input-mode switching through intentional control of voice pitch

Masataka Goto, Yukihiro Omoto, Katunobu Itou, Tetsunori Kobayashi

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    2 Citations (Scopus)

    Abstract

    This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitch, a user can enter the same word with it having different meanings (functions) without explicitly changing the speech-input mode. Our speech-shift function implemented on a voice-enabled word processor, for example, can distinguish an utterance with a high pitch from one with a normal (low) pitch, regard the former as voice-command-mode input (such as file-menu edit-menu commands) the latter as regular dictation-mode text input. Our experimental results from twenty subjects showed that the speech-shift function is effective, easy to use, a labor-saving input method.

    Original languageEnglish
    Title of host publicationEUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology
    PublisherInternational Speech Communication Association
    Pages1201-1204
    Number of pages4
    Publication statusPublished - 2003
    Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
    Duration: 2003 Sep 12003 Sep 4

    Other

    Other8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
    CountrySwitzerland
    CityGeneva
    Period03/9/103/9/4

    Fingerprint

    labor
    Personnel

    ASJC Scopus subject areas

    • Computer Science Applications
    • Software
    • Linguistics and Language
    • Communication

    Cite this

    Goto, M., Omoto, Y., Itou, K., & Kobayashi, T. (2003). Speech shift: Direct speech-input-mode switching through intentional control of voice pitch. In EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp. 1201-1204). International Speech Communication Association.

    Speech shift : Direct speech-input-mode switching through intentional control of voice pitch. / Goto, Masataka; Omoto, Yukihiro; Itou, Katunobu; Kobayashi, Tetsunori.

    EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association, 2003. p. 1201-1204.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Goto, M, Omoto, Y, Itou, K & Kobayashi, T 2003, Speech shift: Direct speech-input-mode switching through intentional control of voice pitch. in EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association, pp. 1201-1204, 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland, 03/9/1.
    Goto M, Omoto Y, Itou K, Kobayashi T. Speech shift: Direct speech-input-mode switching through intentional control of voice pitch. In EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association. 2003. p. 1201-1204
    Goto, Masataka ; Omoto, Yukihiro ; Itou, Katunobu ; Kobayashi, Tetsunori. / Speech shift : Direct speech-input-mode switching through intentional control of voice pitch. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association, 2003. pp. 1201-1204
    @inproceedings{959434a214ba4e95a40300b5d9d974b5,
    title = "Speech shift: Direct speech-input-mode switching through intentional control of voice pitch",
    abstract = "This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitch, a user can enter the same word with it having different meanings (functions) without explicitly changing the speech-input mode. Our speech-shift function implemented on a voice-enabled word processor, for example, can distinguish an utterance with a high pitch from one with a normal (low) pitch, regard the former as voice-command-mode input (such as file-menu edit-menu commands) the latter as regular dictation-mode text input. Our experimental results from twenty subjects showed that the speech-shift function is effective, easy to use, a labor-saving input method.",
    author = "Masataka Goto and Yukihiro Omoto and Katunobu Itou and Tetsunori Kobayashi",
    year = "2003",
    language = "English",
    pages = "1201--1204",
    booktitle = "EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology",
    publisher = "International Speech Communication Association",

    }

    TY - GEN

    T1 - Speech shift

    T2 - Direct speech-input-mode switching through intentional control of voice pitch

    AU - Goto, Masataka

    AU - Omoto, Yukihiro

    AU - Itou, Katunobu

    AU - Kobayashi, Tetsunori

    PY - 2003

    Y1 - 2003

    N2 - This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitch, a user can enter the same word with it having different meanings (functions) without explicitly changing the speech-input mode. Our speech-shift function implemented on a voice-enabled word processor, for example, can distinguish an utterance with a high pitch from one with a normal (low) pitch, regard the former as voice-command-mode input (such as file-menu edit-menu commands) the latter as regular dictation-mode text input. Our experimental results from twenty subjects showed that the speech-shift function is effective, easy to use, a labor-saving input method.

    AB - This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitch, a user can enter the same word with it having different meanings (functions) without explicitly changing the speech-input mode. Our speech-shift function implemented on a voice-enabled word processor, for example, can distinguish an utterance with a high pitch from one with a normal (low) pitch, regard the former as voice-command-mode input (such as file-menu edit-menu commands) the latter as regular dictation-mode text input. Our experimental results from twenty subjects showed that the speech-shift function is effective, easy to use, a labor-saving input method.

    UR - http://www.scopus.com/inward/record.url?scp=33745222948&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=33745222948&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:33745222948

    SP - 1201

    EP - 1204

    BT - EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology

    PB - International Speech Communication Association

    ER -