Estimation of syntactic structure using prosodic parameters and its application to speech recognition

Takashi Endou, Mamoru Watado, Tetsunori Kobayashi, Katsuhiko Shirai

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Prosodic information is very useful for the estimation of syntactic structure. So Phrase-level segmentation of continuous speech using pitch and power information is discussed to utilize for the word spotting. First, a rule based approach using pitch and power pattern is carried out. Then, a statistical approach is carried out, where the vector sequence of pitch and power are matrix quantized and conditional probabilities of the codes and the labels are calculated. The experimental result of the rule-based approach shows that the boundary detection accuracy is 72.8% in case that the adjacent phrases have loose relation, and 43.4% in case of tight relation. As for the statistical method, the boundary detection accuracy is 63.0%.

Original languageEnglish
Title of host publicationBulletin of Centre for Informatics, Waseda University
Pages17-23
Number of pages7
Volume15
Publication statusPublished - 1992 Jun 1
Externally publishedYes

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'Estimation of syntactic structure using prosodic parameters and its application to speech recognition'. Together they form a unique fingerprint.

  • Cite this

    Endou, T., Watado, M., Kobayashi, T., & Shirai, K. (1992). Estimation of syntactic structure using prosodic parameters and its application to speech recognition. In Bulletin of Centre for Informatics, Waseda University (Vol. 15, pp. 17-23)