TY - JOUR
T1 - Speech planning of an anthropomorphic talking robot for consonant sounds production
AU - Nishikawa, Kazufumi
AU - Imai, Akihiro
AU - Ogawara, Takayuki
AU - Takanobu, Hideaki
AU - Mochida, Takemi
AU - Takanishi, Atsuo
PY - 2002/1/1
Y1 - 2002/1/1
N2 - This paper describes the speech planning of the anthropomorphic talking robot WT-1R (Waseda Talker-No.1 Refined) for the production of consonant sounds. WT-1R has articulators (the tongue, lips, teeth, nasal cavity and soft palate) and vocal organs (the lungs and vocal cords), and can reproduce human vocal movement. Its total DOF (degrees of freedom) is 15. The vocal movement of WT-1R for vowels is steady. We produced Japanese vowels (/a/, /i/, /u/, /e/, /o/) using the first robot WT-1 in 2000. However, the vocal movement for consonant sounds is transient. We must control the 15-DOF talking robot coordinately in the space and time to reproduce the complicated phenomena of the consonant sounds. Therefore, because the Japanese voice generally consists of two phonemes of the first consonant sound and the last vowel, we proposed the speech planning of WT-1R by considering the phenomenon of the voice as three parts (steady consonant sound, transient consonant sound and vowel). WT-1R could produce Japanese vowels (/a/, /i/, /u/, /e/, /o/) and some consonant sounds (/s/, /h/, /m/, /p/ and /waseda/).
AB - This paper describes the speech planning of the anthropomorphic talking robot WT-1R (Waseda Talker-No.1 Refined) for the production of consonant sounds. WT-1R has articulators (the tongue, lips, teeth, nasal cavity and soft palate) and vocal organs (the lungs and vocal cords), and can reproduce human vocal movement. Its total DOF (degrees of freedom) is 15. The vocal movement of WT-1R for vowels is steady. We produced Japanese vowels (/a/, /i/, /u/, /e/, /o/) using the first robot WT-1 in 2000. However, the vocal movement for consonant sounds is transient. We must control the 15-DOF talking robot coordinately in the space and time to reproduce the complicated phenomena of the consonant sounds. Therefore, because the Japanese voice generally consists of two phonemes of the first consonant sound and the last vowel, we proposed the speech planning of WT-1R by considering the phenomenon of the voice as three parts (steady consonant sound, transient consonant sound and vowel). WT-1R could produce Japanese vowels (/a/, /i/, /u/, /e/, /o/) and some consonant sounds (/s/, /h/, /m/, /p/ and /waseda/).
KW - Humanoid robot
KW - Sound
KW - Speech production
KW - Vocal movement
KW - Voice
UR - http://www.scopus.com/inward/record.url?scp=0036059704&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0036059704&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:0036059704
VL - 2
SP - 1830
EP - 1835
JO - Proceedings - IEEE International Conference on Robotics and Automation
JF - Proceedings - IEEE International Conference on Robotics and Automation
SN - 1050-4729
T2 - 2002 IEEE International Conference on Robotics and Automation
Y2 - 11 May 2002 through 15 May 2002
ER -