Vocal imitation using physical vocal tract model

Hisashi Kanda*, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

A vocal imitation system was developed using a computational model that supports the motor theory of speech perception. A critical problem in vocal imitation is how to generate speech sounds produced by adults, whose vocal tracts have physical properties (i.e., articulatory motions) differing from those of infants' vocal tracts. To solve this problem, a model based on the motor theory of speech perception, was constructed. This model suggests that infants simulate the speech generation by estimating their own articulatory motions in order to interpret the speech sounds of adults. Applying this model enables the vocal imitation system to estimate articulatory motions for unexperienced speech sounds that have not actually been generated by the system. The system was implemented by using Recurrent Neural Network with Parametric Bias (RNNPB) and a physical vocal tract model, called the Maeda model. Experimental results demonstrated that the system was sufficiently robust with respect to individual differences in speech sounds and could imitate unexperienced vowel sounds.

Original languageEnglish
Title of host publicationProceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
Pages1846-1851
Number of pages6
DOIs
Publication statusPublished - 2007 Dec 1
Externally publishedYes
Event2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007 - San Diego, CA, United States
Duration: 2007 Oct 292007 Nov 2

Publication series

NameIEEE International Conference on Intelligent Robots and Systems

Conference

Conference2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
Country/TerritoryUnited States
CitySan Diego, CA
Period07/10/2907/11/2

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Software
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Vocal imitation using physical vocal tract model'. Together they form a unique fingerprint.

Cite this