Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis

研究成果: Conference contribution

2 引用 (Scopus)

抜粋

In this study, we investigated speaker's intentions that the listeners perceive through subtly different sentence-final intonations. Approximately 2,000 sentence utterances were recorded and the fundamental frequency (F0) contours at the last vowel of those sentences were classified through one of the standard clustering algorithms. There found various F0 contours, namely, not only simple rising and falling intonations but also rise-fall and fall-rise intonations. In order to reveal the relationship between the intonation and the intentions, 10 representative contours were selected on the basis of the results of the clustering. Using the selected contours, a subjective evaluation was conducted. Six Japanese sentences that could have different meanings according to the sentence-final intonations were synthesized and the F0 contour at the last vowel of each sentence was replaced with the contours. The results of the evaluation by nine listeners showed that, for example, a certain falling intonation could express the intention of the "conviction" and another one that slightly differ in the shape could convey "doubt." It was found that the subtle difference in the sentence-final F0 shape conveyed various nuances and connotations.

元の言語English
ホスト出版物のタイトル13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
ページ442-445
ページ数4
出版物ステータスPublished - 2012 12 1
イベント13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
継続期間: 2012 9 92012 9 13

出版物シリーズ

名前13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
1

Conference

Conference13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
United States
Portland, OR
期間12/9/912/9/13

    フィンガープリント

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Communication

これを引用

Iwata, K., & Kobayashi, T. (2012). Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis. : 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (pp. 442-445). (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; 巻数 1).