Phoneme acquisition model based on vowel imitation using recurrent neural network

Hisashi Kanda*, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

5 被引用数 (Scopus)

抄録

A phoneme-acquisition system was developed using a computational model that explains the developmental process of human infants in the early period of acquiring language. There are two important findings in constructing an infant's acquisition of phonemes: (1) an infant's vowel like cooing tends to invoke utterances that are imitated by its caregiver, and (2) maternal imitation effectively reinforces infant vocalization. Therefore, we hypothesized that infants can acquire phonemes to imitate their caregivers' voices by trial and error, i. e., infants use self-vocalization experience to search for imitable and unimitable elements in their caregivers' voices. On the basis of this hypothesis, we constructed a phoneme acquisition process using interaction involving vowel imitation between a human and an infant model. Our infant model had a vocal tract system, called the Maeda model, and an auditory system implemented by using Mel-Frequency Cepstral Coefficients (MFCCs) through STRAIGHT analysis. We applied Recurrent Neural Network with Parametric Bias (RNNPB) to learn the experience of self-vocalization, to recognize the human voice, and to produce the sound imitated by the infant model. To evaluate imitable and unimitable sounds, we used the prediction error of the RNNPB model. The experimental results revealed that as imitation interactions were repeated, the formants of sounds imitated by our system moved closer to those of human voices, and our system could self-organize the same vowels in different continuous sounds. This suggests that our system can reflect the process of phoneme acquisition.

本文言語English
ホスト出版物のタイトル2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009
ページ5388-5393
ページ数6
DOI
出版ステータスPublished - 2009 12 11
外部発表はい
イベント2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009 - St. Louis, MO, United States
継続期間: 2009 10 112009 10 15

出版物シリーズ

名前2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009

Conference

Conference2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009
国/地域United States
CitySt. Louis, MO
Period09/10/1109/10/15

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ ビジョンおよびパターン認識
  • 人間とコンピュータの相互作用
  • 制御およびシステム工学

フィンガープリント

「Phoneme acquisition model based on vowel imitation using recurrent neural network」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル