TY - JOUR
T1 - Estimation of Articulatory Movements From Speech Acoustics Using an HMM-Based Speech Production Model
AU - Hiroya, Sadao
AU - Honda, Masaaki
N1 - Funding Information:
Manuscript received August 16, 2002; revised August 23, 2003. This work was supported in part by CREST of the Japan Science and Technology Corporation. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. S. S. Narayanan.
PY - 2004/3
Y1 - 2004/3
N2 - We present a method that determines articulatory movements from speech acoustics using a Hidden Markov Model (HMM)-based speech production model. The model statistically generates speech spectrum and articulatory parameters from a given phonemic string. It consists of HMMs of articulatory parameters for each phoneme and an articulatory-to-acoustic mapping for each HMM state. For a given speech spectrum, maximum a posteriori estimation of the articulatory parameters of the statistical model is presented. The performance on sentences was evaluated by comparing the estimated articulatory parameters with the observed parameters. The average RMS errors of the estimated articulatory parameters were 1.50 mm from the speech acoustics and the phonemic information in an utterance and 1.73 mm from the speech acoustics only.
AB - We present a method that determines articulatory movements from speech acoustics using a Hidden Markov Model (HMM)-based speech production model. The model statistically generates speech spectrum and articulatory parameters from a given phonemic string. It consists of HMMs of articulatory parameters for each phoneme and an articulatory-to-acoustic mapping for each HMM state. For a given speech spectrum, maximum a posteriori estimation of the articulatory parameters of the statistical model is presented. The performance on sentences was evaluated by comparing the estimated articulatory parameters with the observed parameters. The average RMS errors of the estimated articulatory parameters were 1.50 mm from the speech acoustics and the phonemic information in an utterance and 1.73 mm from the speech acoustics only.
KW - Articulatory HMM
KW - Articulatory-to-acoustic mapping
KW - HMM-based speech production model
KW - Speech inversion
UR - http://www.scopus.com/inward/record.url?scp=2142659020&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=2142659020&partnerID=8YFLogxK
U2 - 10.1109/TSA.2003.822636
DO - 10.1109/TSA.2003.822636
M3 - Article
AN - SCOPUS:2142659020
SN - 1063-6676
VL - 12
SP - 175
EP - 185
JO - IEEE Transactions on Speech and Audio Processing
JF - IEEE Transactions on Speech and Audio Processing
IS - 2
ER -