Mechanical reproduction of human-like expressive speech using a talking robot

Mitsuki Kitani, Hideyuki Sawada

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The authors have developed a talking robot by reconstructing a human vocal system mechanically based on the physical model of human vocal organs. The robotic system consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. By applying the technique of the mechanical construction and its adaptive control, the robot is able to autonomously reproduce a human-like vocal articulation using its vocal organs. In vocalization, the vibration of vocal cords generates a source sound, and then the sound wave is led to a vocal tract, which works as a resonance filter to determine the spectrum envelope. For the autonomous acquisition of the robot's vocalization skills, an adaptive learning using an auditory feedback control is introduced. In this study, a human-like expressive speech production by the talking robot is introduced. A human generates speech by controlling their own vocal organs for changing the speech expressions such as the volume and the intonations. To realize the human-like expressive speech production, a control method for the mechanical organs has to be established. In the first part of the paper, the construction of the talking robot and the acquisition of the vocalization will be described, and then the control method for producing human-like speech with various expressions will be introduced.

Original languageEnglish
Title of host publicationProceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013
Pages229-234
Number of pages6
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013 - Tokyo, Japan
Duration: 2013 Jul 52013 Jul 7

Other

Other2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013
CountryJapan
CityTokyo
Period13/7/513/7/7

Fingerprint

Reproduction
Vocal Cords
Nasal Cavity
Robotics
Vibration
Learning

Keywords

  • Auditory feedback
  • Human vocal system
  • Human-like expressions
  • Self-Organizing Map
  • Talking robot

ASJC Scopus subject areas

  • Biotechnology

Cite this

Kitani, M., & Sawada, H. (2013). Mechanical reproduction of human-like expressive speech using a talking robot. In Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013 (pp. 229-234). [6603507] https://doi.org/10.1109/ICBAKE.2013.45

Mechanical reproduction of human-like expressive speech using a talking robot. / Kitani, Mitsuki; Sawada, Hideyuki.

Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013. 2013. p. 229-234 6603507.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kitani, M & Sawada, H 2013, Mechanical reproduction of human-like expressive speech using a talking robot. in Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013., 6603507, pp. 229-234, 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013, Tokyo, Japan, 13/7/5. https://doi.org/10.1109/ICBAKE.2013.45
Kitani M, Sawada H. Mechanical reproduction of human-like expressive speech using a talking robot. In Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013. 2013. p. 229-234. 6603507 https://doi.org/10.1109/ICBAKE.2013.45
Kitani, Mitsuki ; Sawada, Hideyuki. / Mechanical reproduction of human-like expressive speech using a talking robot. Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013. 2013. pp. 229-234
@inproceedings{2ce9db1ad6a34b53a2f065962a3f2707,
title = "Mechanical reproduction of human-like expressive speech using a talking robot",
abstract = "The authors have developed a talking robot by reconstructing a human vocal system mechanically based on the physical model of human vocal organs. The robotic system consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. By applying the technique of the mechanical construction and its adaptive control, the robot is able to autonomously reproduce a human-like vocal articulation using its vocal organs. In vocalization, the vibration of vocal cords generates a source sound, and then the sound wave is led to a vocal tract, which works as a resonance filter to determine the spectrum envelope. For the autonomous acquisition of the robot's vocalization skills, an adaptive learning using an auditory feedback control is introduced. In this study, a human-like expressive speech production by the talking robot is introduced. A human generates speech by controlling their own vocal organs for changing the speech expressions such as the volume and the intonations. To realize the human-like expressive speech production, a control method for the mechanical organs has to be established. In the first part of the paper, the construction of the talking robot and the acquisition of the vocalization will be described, and then the control method for producing human-like speech with various expressions will be introduced.",
keywords = "Auditory feedback, Human vocal system, Human-like expressions, Self-Organizing Map, Talking robot",
author = "Mitsuki Kitani and Hideyuki Sawada",
year = "2013",
doi = "10.1109/ICBAKE.2013.45",
language = "English",
isbn = "9780769550190",
pages = "229--234",
booktitle = "Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013",

}

TY - GEN

T1 - Mechanical reproduction of human-like expressive speech using a talking robot

AU - Kitani, Mitsuki

AU - Sawada, Hideyuki

PY - 2013

Y1 - 2013

N2 - The authors have developed a talking robot by reconstructing a human vocal system mechanically based on the physical model of human vocal organs. The robotic system consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. By applying the technique of the mechanical construction and its adaptive control, the robot is able to autonomously reproduce a human-like vocal articulation using its vocal organs. In vocalization, the vibration of vocal cords generates a source sound, and then the sound wave is led to a vocal tract, which works as a resonance filter to determine the spectrum envelope. For the autonomous acquisition of the robot's vocalization skills, an adaptive learning using an auditory feedback control is introduced. In this study, a human-like expressive speech production by the talking robot is introduced. A human generates speech by controlling their own vocal organs for changing the speech expressions such as the volume and the intonations. To realize the human-like expressive speech production, a control method for the mechanical organs has to be established. In the first part of the paper, the construction of the talking robot and the acquisition of the vocalization will be described, and then the control method for producing human-like speech with various expressions will be introduced.

AB - The authors have developed a talking robot by reconstructing a human vocal system mechanically based on the physical model of human vocal organs. The robotic system consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. By applying the technique of the mechanical construction and its adaptive control, the robot is able to autonomously reproduce a human-like vocal articulation using its vocal organs. In vocalization, the vibration of vocal cords generates a source sound, and then the sound wave is led to a vocal tract, which works as a resonance filter to determine the spectrum envelope. For the autonomous acquisition of the robot's vocalization skills, an adaptive learning using an auditory feedback control is introduced. In this study, a human-like expressive speech production by the talking robot is introduced. A human generates speech by controlling their own vocal organs for changing the speech expressions such as the volume and the intonations. To realize the human-like expressive speech production, a control method for the mechanical organs has to be established. In the first part of the paper, the construction of the talking robot and the acquisition of the vocalization will be described, and then the control method for producing human-like speech with various expressions will be introduced.

KW - Auditory feedback

KW - Human vocal system

KW - Human-like expressions

KW - Self-Organizing Map

KW - Talking robot

UR - http://www.scopus.com/inward/record.url?scp=84885400345&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84885400345&partnerID=8YFLogxK

U2 - 10.1109/ICBAKE.2013.45

DO - 10.1109/ICBAKE.2013.45

M3 - Conference contribution

AN - SCOPUS:84885400345

SN - 9780769550190

SP - 229

EP - 234

BT - Proceedings - 2013 International Conference on Biometrics and Kansei Engineering, ICBAKE 2013

ER -