Production of various vocal cord vibrations using a mechanical model for an anthropomorphic talking robot

Kotaro Fukui, Yuma Ishikawa, Eiji Shintaku, Masaaki Honda, Atsuo Takanishi

Research output: Contribution to journalArticle

Abstract

We developed a three-dimensional mechanical vocal cord model for Waseda Talker No. 7 (WT-7), an anthropomorphic talking robot, for generating speech sounds with various voice qualities. The vocal cord model is a cover model that has two thin folds made of thermoplastic material. The model self-oscillates by airflow exhausted from the lung model and generates the glottal sound source, which is fed into the vocal tract for generating the speech sound. Using the vocal cord model, breathy and creaky voices, as well as the modal (normal) voice, were produced in a manner similar to the human laryngeal control. The breathy voice is characterized by a noisy component mixed with the periodic glottal sound source and the creaky voice is characterized by an extremely low-pitch vibration. The breathy voice was produced by adjusting the glottal opening and generating the turbulence noise by the airflow just above the glottis. The creaky voice was produced by adjusting the vocal cord tension, the sub-glottal pressure and the vibration mass so as to generate a double-pitch vibration with a long pitch interval. The vocal cord model used to produce these voice qualities was evaluated in terms of the vibration pattern as measured by a high-speed camera, the glottal airflow and the acoustic characteristics of the glottal sound source, as compared to the data for a human.

Original languageEnglish
Pages (from-to)105-120
Number of pages16
JournalAdvanced Robotics
Volume26
Issue number1-2
DOIs
Publication statusPublished - 2012

Fingerprint

Anthropomorphic robots
Acoustic waves
High speed cameras
Thermoplastics
Turbulence

Keywords

  • Humanoid robot
  • speech production
  • talking robot
  • vocal cords
  • voice quality

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Human-Computer Interaction
  • Computer Science Applications
  • Hardware and Architecture
  • Software

Cite this

Production of various vocal cord vibrations using a mechanical model for an anthropomorphic talking robot. / Fukui, Kotaro; Ishikawa, Yuma; Shintaku, Eiji; Honda, Masaaki; Takanishi, Atsuo.

In: Advanced Robotics, Vol. 26, No. 1-2, 2012, p. 105-120.

Research output: Contribution to journalArticle

@article{37a76fe9da3244b0b5a4e532083d5b85,
title = "Production of various vocal cord vibrations using a mechanical model for an anthropomorphic talking robot",
abstract = "We developed a three-dimensional mechanical vocal cord model for Waseda Talker No. 7 (WT-7), an anthropomorphic talking robot, for generating speech sounds with various voice qualities. The vocal cord model is a cover model that has two thin folds made of thermoplastic material. The model self-oscillates by airflow exhausted from the lung model and generates the glottal sound source, which is fed into the vocal tract for generating the speech sound. Using the vocal cord model, breathy and creaky voices, as well as the modal (normal) voice, were produced in a manner similar to the human laryngeal control. The breathy voice is characterized by a noisy component mixed with the periodic glottal sound source and the creaky voice is characterized by an extremely low-pitch vibration. The breathy voice was produced by adjusting the glottal opening and generating the turbulence noise by the airflow just above the glottis. The creaky voice was produced by adjusting the vocal cord tension, the sub-glottal pressure and the vibration mass so as to generate a double-pitch vibration with a long pitch interval. The vocal cord model used to produce these voice qualities was evaluated in terms of the vibration pattern as measured by a high-speed camera, the glottal airflow and the acoustic characteristics of the glottal sound source, as compared to the data for a human.",
keywords = "Humanoid robot, speech production, talking robot, vocal cords, voice quality",
author = "Kotaro Fukui and Yuma Ishikawa and Eiji Shintaku and Masaaki Honda and Atsuo Takanishi",
year = "2012",
doi = "10.1163/016918611X607392",
language = "English",
volume = "26",
pages = "105--120",
journal = "Advanced Robotics",
issn = "0169-1864",
publisher = "Taylor and Francis Ltd.",
number = "1-2",

}

TY - JOUR

T1 - Production of various vocal cord vibrations using a mechanical model for an anthropomorphic talking robot

AU - Fukui, Kotaro

AU - Ishikawa, Yuma

AU - Shintaku, Eiji

AU - Honda, Masaaki

AU - Takanishi, Atsuo

PY - 2012

Y1 - 2012

N2 - We developed a three-dimensional mechanical vocal cord model for Waseda Talker No. 7 (WT-7), an anthropomorphic talking robot, for generating speech sounds with various voice qualities. The vocal cord model is a cover model that has two thin folds made of thermoplastic material. The model self-oscillates by airflow exhausted from the lung model and generates the glottal sound source, which is fed into the vocal tract for generating the speech sound. Using the vocal cord model, breathy and creaky voices, as well as the modal (normal) voice, were produced in a manner similar to the human laryngeal control. The breathy voice is characterized by a noisy component mixed with the periodic glottal sound source and the creaky voice is characterized by an extremely low-pitch vibration. The breathy voice was produced by adjusting the glottal opening and generating the turbulence noise by the airflow just above the glottis. The creaky voice was produced by adjusting the vocal cord tension, the sub-glottal pressure and the vibration mass so as to generate a double-pitch vibration with a long pitch interval. The vocal cord model used to produce these voice qualities was evaluated in terms of the vibration pattern as measured by a high-speed camera, the glottal airflow and the acoustic characteristics of the glottal sound source, as compared to the data for a human.

AB - We developed a three-dimensional mechanical vocal cord model for Waseda Talker No. 7 (WT-7), an anthropomorphic talking robot, for generating speech sounds with various voice qualities. The vocal cord model is a cover model that has two thin folds made of thermoplastic material. The model self-oscillates by airflow exhausted from the lung model and generates the glottal sound source, which is fed into the vocal tract for generating the speech sound. Using the vocal cord model, breathy and creaky voices, as well as the modal (normal) voice, were produced in a manner similar to the human laryngeal control. The breathy voice is characterized by a noisy component mixed with the periodic glottal sound source and the creaky voice is characterized by an extremely low-pitch vibration. The breathy voice was produced by adjusting the glottal opening and generating the turbulence noise by the airflow just above the glottis. The creaky voice was produced by adjusting the vocal cord tension, the sub-glottal pressure and the vibration mass so as to generate a double-pitch vibration with a long pitch interval. The vocal cord model used to produce these voice qualities was evaluated in terms of the vibration pattern as measured by a high-speed camera, the glottal airflow and the acoustic characteristics of the glottal sound source, as compared to the data for a human.

KW - Humanoid robot

KW - speech production

KW - talking robot

KW - vocal cords

KW - voice quality

UR - http://www.scopus.com/inward/record.url?scp=84858197654&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84858197654&partnerID=8YFLogxK

U2 - 10.1163/016918611X607392

DO - 10.1163/016918611X607392

M3 - Article

AN - SCOPUS:84858197654

VL - 26

SP - 105

EP - 120

JO - Advanced Robotics

JF - Advanced Robotics

SN - 0169-1864

IS - 1-2

ER -