TY - CHAP

T1 - Estimation of syntactic structure using prosodic parameters and its application to speech recognition

AU - Endou, Takashi

AU - Watado, Mamoru

AU - Kobayashi, Tetsunori

AU - Shirai, Katsuhiko

PY - 1992/6/1

Y1 - 1992/6/1

N2 - Prosodic information is very useful for the estimation of syntactic structure. So Phrase-level segmentation of continuous speech using pitch and power information is discussed to utilize for the word spotting. First, a rule based approach using pitch and power pattern is carried out. Then, a statistical approach is carried out, where the vector sequence of pitch and power are matrix quantized and conditional probabilities of the codes and the labels are calculated. The experimental result of the rule-based approach shows that the boundary detection accuracy is 72.8% in case that the adjacent phrases have loose relation, and 43.4% in case of tight relation. As for the statistical method, the boundary detection accuracy is 63.0%.

AB - Prosodic information is very useful for the estimation of syntactic structure. So Phrase-level segmentation of continuous speech using pitch and power information is discussed to utilize for the word spotting. First, a rule based approach using pitch and power pattern is carried out. Then, a statistical approach is carried out, where the vector sequence of pitch and power are matrix quantized and conditional probabilities of the codes and the labels are calculated. The experimental result of the rule-based approach shows that the boundary detection accuracy is 72.8% in case that the adjacent phrases have loose relation, and 43.4% in case of tight relation. As for the statistical method, the boundary detection accuracy is 63.0%.

UR - http://www.scopus.com/inward/record.url?scp=0026877143&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0026877143&partnerID=8YFLogxK

M3 - Chapter

AN - SCOPUS:0026877143

VL - 15

SP - 17

EP - 23

BT - Bulletin of Centre for Informatics, Waseda University

ER -