Corpus-based modeling of naturalness estimation in timing control for non-native speech

Makiko Muto, Yoshinori Sagisaka, Takuro Naito, Daiju Maeki, Aki Kondo, Katsuhiko Shirai

    研究成果: Conference contribution

    5 引用 (Scopus)

    抜粋

    In this paper, aiming at automatic estimation of naturalness in timing control of non-native's speech, we have analyzed the timing characteristics of non-native's speech to correlate with the corresponding subjective naturalness evaluation scores given by native speakers. Through statistical analyses using English speech data spoken by Japanese with temporal naturalness scores ranging one to five given by natives, we found high correlation between their scores and the differences from native's speech. These analyses provided a linear regression model where naturalness in timing control is estimated by differences from native's speech in durations of overall sentences, individual content and function words and pauses. The proposed naturalness evaluation model was tested its estimation accuracy using open data. The root mean square errors 0.64 between scores predicted by the model and those given by the natives turned out to be comparable to the differences 0.85 of scores among native listeners. Good correlation between model prediction and native's judgments confirmed the appropriateness of the proposed model.

    元の言語English
    ホスト出版物のタイトルEUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology
    出版者International Speech Communication Association
    ページ401-404
    ページ数4
    出版物ステータスPublished - 2003
    イベント8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
    継続期間: 2003 9 12003 9 4

    Other

    Other8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
    Switzerland
    Geneva
    期間03/9/103/9/4

    ASJC Scopus subject areas

    • Computer Science Applications
    • Software
    • Linguistics and Language
    • Communication

    フィンガープリント Corpus-based modeling of naturalness estimation in timing control for non-native speech' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Muto, M., Sagisaka, Y., Naito, T., Maeki, D., Kondo, A., & Shirai, K. (2003). Corpus-based modeling of naturalness estimation in timing control for non-native speech. : EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp. 401-404). International Speech Communication Association.