Estimating articulatory motion from speech wave

Katsuhiko Shirai, Tetsunori Kobayashi

    Research output: Contribution to journalArticle

    25 Citations (Scopus)

    Abstract

    If articulatory movements can be estimated, then the articulatory parameters which represent the motion of the articulatory organs would be useful for speech recognition. This paper discusses an effective method of estimating articulatory movements and its application to speech recognition. Firstly, what is described is a method of estimating articulatory parameters known as the model matching method, and various spectral distance measures are evaluated for this method. The results show that the best in average is the higher order cepstral distance measure, which is one of the peak weighted measure. Secondly, articulatory parameters are utilized for the recognition of vowels uttered by unspecified speakers. It is shown that the adaptation of the model by the estimated mean vocal tract length is effective to normalize speaker difference. Thirdly, the motor commands to move the articulatory organs are estimated considering articulatory dynamics, and the continuous vowels are recognized by means of these estimated commands. It has been found that a considerable part of the coarticulation effects can be compensated for by this command estimated, and the method is useful for continuous speech recognition.

    Original languageEnglish
    Pages (from-to)159-170
    Number of pages12
    JournalSpeech Communication
    Volume5
    Issue number2
    DOIs
    Publication statusPublished - 1986

    Fingerprint

    Speech recognition
    Continuous speech recognition
    Speech Recognition
    Motion
    Distance Measure
    Model Matching
    Normalize
    Spectral Measure
    Speech
    Waves
    Higher Order
    Movement
    Organs

    Keywords

    • articulatory model
    • motor command
    • Speech analysis
    • speech recognition

    ASJC Scopus subject areas

    • Signal Processing
    • Electrical and Electronic Engineering
    • Experimental and Cognitive Psychology
    • Linguistics and Language

    Cite this

    Estimating articulatory motion from speech wave. / Shirai, Katsuhiko; Kobayashi, Tetsunori.

    In: Speech Communication, Vol. 5, No. 2, 1986, p. 159-170.

    Research output: Contribution to journalArticle

    Shirai, Katsuhiko ; Kobayashi, Tetsunori. / Estimating articulatory motion from speech wave. In: Speech Communication. 1986 ; Vol. 5, No. 2. pp. 159-170.
    @article{ae4267c2640243dc95a1ea3a20716216,
    title = "Estimating articulatory motion from speech wave",
    abstract = "If articulatory movements can be estimated, then the articulatory parameters which represent the motion of the articulatory organs would be useful for speech recognition. This paper discusses an effective method of estimating articulatory movements and its application to speech recognition. Firstly, what is described is a method of estimating articulatory parameters known as the model matching method, and various spectral distance measures are evaluated for this method. The results show that the best in average is the higher order cepstral distance measure, which is one of the peak weighted measure. Secondly, articulatory parameters are utilized for the recognition of vowels uttered by unspecified speakers. It is shown that the adaptation of the model by the estimated mean vocal tract length is effective to normalize speaker difference. Thirdly, the motor commands to move the articulatory organs are estimated considering articulatory dynamics, and the continuous vowels are recognized by means of these estimated commands. It has been found that a considerable part of the coarticulation effects can be compensated for by this command estimated, and the method is useful for continuous speech recognition.",
    keywords = "articulatory model, motor command, Speech analysis, speech recognition",
    author = "Katsuhiko Shirai and Tetsunori Kobayashi",
    year = "1986",
    doi = "10.1016/0167-6393(86)90005-1",
    language = "English",
    volume = "5",
    pages = "159--170",
    journal = "Speech Communication",
    issn = "0167-6393",
    publisher = "Elsevier",
    number = "2",

    }

    TY - JOUR

    T1 - Estimating articulatory motion from speech wave

    AU - Shirai, Katsuhiko

    AU - Kobayashi, Tetsunori

    PY - 1986

    Y1 - 1986

    N2 - If articulatory movements can be estimated, then the articulatory parameters which represent the motion of the articulatory organs would be useful for speech recognition. This paper discusses an effective method of estimating articulatory movements and its application to speech recognition. Firstly, what is described is a method of estimating articulatory parameters known as the model matching method, and various spectral distance measures are evaluated for this method. The results show that the best in average is the higher order cepstral distance measure, which is one of the peak weighted measure. Secondly, articulatory parameters are utilized for the recognition of vowels uttered by unspecified speakers. It is shown that the adaptation of the model by the estimated mean vocal tract length is effective to normalize speaker difference. Thirdly, the motor commands to move the articulatory organs are estimated considering articulatory dynamics, and the continuous vowels are recognized by means of these estimated commands. It has been found that a considerable part of the coarticulation effects can be compensated for by this command estimated, and the method is useful for continuous speech recognition.

    AB - If articulatory movements can be estimated, then the articulatory parameters which represent the motion of the articulatory organs would be useful for speech recognition. This paper discusses an effective method of estimating articulatory movements and its application to speech recognition. Firstly, what is described is a method of estimating articulatory parameters known as the model matching method, and various spectral distance measures are evaluated for this method. The results show that the best in average is the higher order cepstral distance measure, which is one of the peak weighted measure. Secondly, articulatory parameters are utilized for the recognition of vowels uttered by unspecified speakers. It is shown that the adaptation of the model by the estimated mean vocal tract length is effective to normalize speaker difference. Thirdly, the motor commands to move the articulatory organs are estimated considering articulatory dynamics, and the continuous vowels are recognized by means of these estimated commands. It has been found that a considerable part of the coarticulation effects can be compensated for by this command estimated, and the method is useful for continuous speech recognition.

    KW - articulatory model

    KW - motor command

    KW - Speech analysis

    KW - speech recognition

    UR - http://www.scopus.com/inward/record.url?scp=0008499181&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0008499181&partnerID=8YFLogxK

    U2 - 10.1016/0167-6393(86)90005-1

    DO - 10.1016/0167-6393(86)90005-1

    M3 - Article

    VL - 5

    SP - 159

    EP - 170

    JO - Speech Communication

    JF - Speech Communication

    SN - 0167-6393

    IS - 2

    ER -