Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback

Kazufumi Nishikawa, Toshiharu Kuwae, Hideaki Takanobu, Takemi Mochida, Masaaki Honda, Atsuo Takanishi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

This paper describes an autonomous control method of an anthropomorphic talking robot WT-4 (Waseda Talker No.4) to mimic continuous human speech sounds by auditory feedback. WT-4 consisted of 1-DOF lungs, 4-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 19. In this method, the trajectory of each robot parameter was controlled so that the acoustic parameters (pitch, sound power, formant frequencies that are resonant frequencies of the vocal tract and have the peak of the output spectrum, and the timing of the switch between voiced and voiceless sounds) generated from the robot were close to those of human speech sounds. The trajectory of each robot parameter was optimized by inputting the acoustic parameters. This method will help to clarify the human speech mechanism and to create a new speech production system.

Original languageEnglish
Title of host publication2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Pages272-278
Number of pages7
Volume1
Publication statusPublished - 2004
Event2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) - Sendai
Duration: 2004 Sep 282004 Oct 2

Other

Other2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
CitySendai
Period04/9/2804/10/2

Fingerprint

Anthropomorphic robots
Acoustic waves
Feedback
Robots
Acoustics
Trajectories
Natural frequencies
Switches

Keywords

  • Auditory Feedback
  • Autonomous Control
  • Humanoid Robot
  • Speech Production

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Nishikawa, K., Kuwae, T., Takanobu, H., Mochida, T., Honda, M., & Takanishi, A. (2004). Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback. In 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (Vol. 1, pp. 272-278). [TA1-I5]

Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback. / Nishikawa, Kazufumi; Kuwae, Toshiharu; Takanobu, Hideaki; Mochida, Takemi; Honda, Masaaki; Takanishi, Atsuo.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Vol. 1 2004. p. 272-278 TA1-I5.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Nishikawa, K, Kuwae, T, Takanobu, H, Mochida, T, Honda, M & Takanishi, A 2004, Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback. in 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). vol. 1, TA1-I5, pp. 272-278, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Sendai, 04/9/28.
Nishikawa K, Kuwae T, Takanobu H, Mochida T, Honda M, Takanishi A. Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback. In 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Vol. 1. 2004. p. 272-278. TA1-I5
Nishikawa, Kazufumi ; Kuwae, Toshiharu ; Takanobu, Hideaki ; Mochida, Takemi ; Honda, Masaaki ; Takanishi, Atsuo. / Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback. 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Vol. 1 2004. pp. 272-278
@inproceedings{201bbd0fde074e3a88b33f36ebfd9dcd,
title = "Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback",
abstract = "This paper describes an autonomous control method of an anthropomorphic talking robot WT-4 (Waseda Talker No.4) to mimic continuous human speech sounds by auditory feedback. WT-4 consisted of 1-DOF lungs, 4-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 19. In this method, the trajectory of each robot parameter was controlled so that the acoustic parameters (pitch, sound power, formant frequencies that are resonant frequencies of the vocal tract and have the peak of the output spectrum, and the timing of the switch between voiced and voiceless sounds) generated from the robot were close to those of human speech sounds. The trajectory of each robot parameter was optimized by inputting the acoustic parameters. This method will help to clarify the human speech mechanism and to create a new speech production system.",
keywords = "Auditory Feedback, Autonomous Control, Humanoid Robot, Speech Production",
author = "Kazufumi Nishikawa and Toshiharu Kuwae and Hideaki Takanobu and Takemi Mochida and Masaaki Honda and Atsuo Takanishi",
year = "2004",
language = "English",
isbn = "0780384636",
volume = "1",
pages = "272--278",
booktitle = "2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)",

}

TY - GEN

T1 - Mimicry of human speech sounds using an anthropomorphic talking robot by auditory feedback

AU - Nishikawa, Kazufumi

AU - Kuwae, Toshiharu

AU - Takanobu, Hideaki

AU - Mochida, Takemi

AU - Honda, Masaaki

AU - Takanishi, Atsuo

PY - 2004

Y1 - 2004

N2 - This paper describes an autonomous control method of an anthropomorphic talking robot WT-4 (Waseda Talker No.4) to mimic continuous human speech sounds by auditory feedback. WT-4 consisted of 1-DOF lungs, 4-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 19. In this method, the trajectory of each robot parameter was controlled so that the acoustic parameters (pitch, sound power, formant frequencies that are resonant frequencies of the vocal tract and have the peak of the output spectrum, and the timing of the switch between voiced and voiceless sounds) generated from the robot were close to those of human speech sounds. The trajectory of each robot parameter was optimized by inputting the acoustic parameters. This method will help to clarify the human speech mechanism and to create a new speech production system.

AB - This paper describes an autonomous control method of an anthropomorphic talking robot WT-4 (Waseda Talker No.4) to mimic continuous human speech sounds by auditory feedback. WT-4 consisted of 1-DOF lungs, 4-DOF vocal cords and articulators (the 7-DOF tongue, 5-DOF lips, 1-DOF teeth, nasal cavity and 1-DOF soft palate), and could reproduce human-like articulatory motion; the total DOF was 19. In this method, the trajectory of each robot parameter was controlled so that the acoustic parameters (pitch, sound power, formant frequencies that are resonant frequencies of the vocal tract and have the peak of the output spectrum, and the timing of the switch between voiced and voiceless sounds) generated from the robot were close to those of human speech sounds. The trajectory of each robot parameter was optimized by inputting the acoustic parameters. This method will help to clarify the human speech mechanism and to create a new speech production system.

KW - Auditory Feedback

KW - Autonomous Control

KW - Humanoid Robot

KW - Speech Production

UR - http://www.scopus.com/inward/record.url?scp=14044268891&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=14044268891&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:14044268891

SN - 0780384636

SN - 9780780384637

VL - 1

SP - 272

EP - 278

BT - 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

ER -