Speech production of an advanced talking robot based on human acoustic theory

Kazufumi Nishikawa, Hideaki Takanobu, Takemi Mochida, Masaaki Honda, Atsuo Takanishi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This paper describes the mechanisms and the speech production of a new advanced talking robot WT-3 (Waseda Talker-No.3) that improved on WT-2 (Waseda Talker-No.2) and based on human acoustic theory for the reproduction of human speech. WT-3 consists of 1-DOF lungs and 3-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 18. The oral cavity was designed based on the MRI images of a human sagittal plane, although the cross section of the vocal tract was rectangular in shape except for the mouth. The width of the vocal tract was 30 [mm]. The average length of the vocal tract was approximately 175 [mm] and the same as that of a human's. Compared to the previous robots, WT-3 could produce vowels more clearly, and produce stops, fricatives and nasal sounds with the new flexible mechanisms that functioned as the human vocal tract area and the other mechanisms. WT-3 could mechanically reproduce human speech.

Original languageEnglish
Title of host publicationProceedings - IEEE International Conference on Robotics and Automation
Pages3213-3219
Number of pages7
Volume2004
Edition4
Publication statusPublished - 2004
EventProceedings- 2004 IEEE International Conference on Robotics and Automation - New Orleans, LA, United States
Duration: 2004 Apr 262004 May 1

Other

OtherProceedings- 2004 IEEE International Conference on Robotics and Automation
CountryUnited States
CityNew Orleans, LA
Period04/4/2604/5/1

Fingerprint

Acoustics
Robots
Magnetic resonance imaging
Acoustic waves

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering

Cite this

Nishikawa, K., Takanobu, H., Mochida, T., Honda, M., & Takanishi, A. (2004). Speech production of an advanced talking robot based on human acoustic theory. In Proceedings - IEEE International Conference on Robotics and Automation (4 ed., Vol. 2004, pp. 3213-3219)

Speech production of an advanced talking robot based on human acoustic theory. / Nishikawa, Kazufumi; Takanobu, Hideaki; Mochida, Takemi; Honda, Masaaki; Takanishi, Atsuo.

Proceedings - IEEE International Conference on Robotics and Automation. Vol. 2004 4. ed. 2004. p. 3213-3219.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Nishikawa, K, Takanobu, H, Mochida, T, Honda, M & Takanishi, A 2004, Speech production of an advanced talking robot based on human acoustic theory. in Proceedings - IEEE International Conference on Robotics and Automation. 4 edn, vol. 2004, pp. 3213-3219, Proceedings- 2004 IEEE International Conference on Robotics and Automation, New Orleans, LA, United States, 04/4/26.
Nishikawa K, Takanobu H, Mochida T, Honda M, Takanishi A. Speech production of an advanced talking robot based on human acoustic theory. In Proceedings - IEEE International Conference on Robotics and Automation. 4 ed. Vol. 2004. 2004. p. 3213-3219
Nishikawa, Kazufumi ; Takanobu, Hideaki ; Mochida, Takemi ; Honda, Masaaki ; Takanishi, Atsuo. / Speech production of an advanced talking robot based on human acoustic theory. Proceedings - IEEE International Conference on Robotics and Automation. Vol. 2004 4. ed. 2004. pp. 3213-3219
@inproceedings{a4509155b1a7489fbfa82d09c0ceedb5,
title = "Speech production of an advanced talking robot based on human acoustic theory",
abstract = "This paper describes the mechanisms and the speech production of a new advanced talking robot WT-3 (Waseda Talker-No.3) that improved on WT-2 (Waseda Talker-No.2) and based on human acoustic theory for the reproduction of human speech. WT-3 consists of 1-DOF lungs and 3-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 18. The oral cavity was designed based on the MRI images of a human sagittal plane, although the cross section of the vocal tract was rectangular in shape except for the mouth. The width of the vocal tract was 30 [mm]. The average length of the vocal tract was approximately 175 [mm] and the same as that of a human's. Compared to the previous robots, WT-3 could produce vowels more clearly, and produce stops, fricatives and nasal sounds with the new flexible mechanisms that functioned as the human vocal tract area and the other mechanisms. WT-3 could mechanically reproduce human speech.",
author = "Kazufumi Nishikawa and Hideaki Takanobu and Takemi Mochida and Masaaki Honda and Atsuo Takanishi",
year = "2004",
language = "English",
volume = "2004",
pages = "3213--3219",
booktitle = "Proceedings - IEEE International Conference on Robotics and Automation",
edition = "4",

}

TY - GEN

T1 - Speech production of an advanced talking robot based on human acoustic theory

AU - Nishikawa, Kazufumi

AU - Takanobu, Hideaki

AU - Mochida, Takemi

AU - Honda, Masaaki

AU - Takanishi, Atsuo

PY - 2004

Y1 - 2004

N2 - This paper describes the mechanisms and the speech production of a new advanced talking robot WT-3 (Waseda Talker-No.3) that improved on WT-2 (Waseda Talker-No.2) and based on human acoustic theory for the reproduction of human speech. WT-3 consists of 1-DOF lungs and 3-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 18. The oral cavity was designed based on the MRI images of a human sagittal plane, although the cross section of the vocal tract was rectangular in shape except for the mouth. The width of the vocal tract was 30 [mm]. The average length of the vocal tract was approximately 175 [mm] and the same as that of a human's. Compared to the previous robots, WT-3 could produce vowels more clearly, and produce stops, fricatives and nasal sounds with the new flexible mechanisms that functioned as the human vocal tract area and the other mechanisms. WT-3 could mechanically reproduce human speech.

AB - This paper describes the mechanisms and the speech production of a new advanced talking robot WT-3 (Waseda Talker-No.3) that improved on WT-2 (Waseda Talker-No.2) and based on human acoustic theory for the reproduction of human speech. WT-3 consists of 1-DOF lungs and 3-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 18. The oral cavity was designed based on the MRI images of a human sagittal plane, although the cross section of the vocal tract was rectangular in shape except for the mouth. The width of the vocal tract was 30 [mm]. The average length of the vocal tract was approximately 175 [mm] and the same as that of a human's. Compared to the previous robots, WT-3 could produce vowels more clearly, and produce stops, fricatives and nasal sounds with the new flexible mechanisms that functioned as the human vocal tract area and the other mechanisms. WT-3 could mechanically reproduce human speech.

UR - http://www.scopus.com/inward/record.url?scp=3042647369&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3042647369&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:3042647369

VL - 2004

SP - 3213

EP - 3219

BT - Proceedings - IEEE International Conference on Robotics and Automation

ER -