TY - GEN
T1 - Application of neural networks to articulatory motion estimation
AU - Kobayashi, Tetsunori
AU - Yagyu, Masayuki
AU - Shirai, Katsuhiko
PY - 1991/12/1
Y1 - 1991/12/1
N2 - The authors discuss an application of neural networks (NNs) to the problem of estimating the motion of articulatory organs from speech waves. A four-layer feedforward network was successfully applied to the articulatory parameter estimation problem. The evaluation test was performed using the vowel data in 5200 tokens in the ATR word database. Results show that the difference in estimated articulatory parameter values between the conventional model matching method (MM) and NN is only 0.1, which is about 3% of the value range, on average. For a few data, large differences arise between MM and NN, but this is due to misestimation in MM rather than NN. The percentage of misestimates in NN is less than 50% of that for MM. As for calculation time, NN is 10 times faster than MM. Thus, a high-speed and stable articulatory parameter estimation technique can be realized using neural networks.
AB - The authors discuss an application of neural networks (NNs) to the problem of estimating the motion of articulatory organs from speech waves. A four-layer feedforward network was successfully applied to the articulatory parameter estimation problem. The evaluation test was performed using the vowel data in 5200 tokens in the ATR word database. Results show that the difference in estimated articulatory parameter values between the conventional model matching method (MM) and NN is only 0.1, which is about 3% of the value range, on average. For a few data, large differences arise between MM and NN, but this is due to misestimation in MM rather than NN. The percentage of misestimates in NN is less than 50% of that for MM. As for calculation time, NN is 10 times faster than MM. Thus, a high-speed and stable articulatory parameter estimation technique can be realized using neural networks.
UR - http://www.scopus.com/inward/record.url?scp=0026396340&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0026396340&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0026396340
SN - 078030033
T3 - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
SP - 489
EP - 492
BT - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
A2 - Anon, null
PB - Publ by IEEE
T2 - Proceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing - ICASSP 91
Y2 - 14 May 1991 through 17 May 1991
ER -