Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this study, we investigated speaker's intentions that the listeners perceive through subtly different sentence-final intonations. Approximately 2,000 sentence utterances were recorded and the fundamental frequency (F0) contours at the last vowel of those sentences were classified through one of the standard clustering algorithms. There found various F0 contours, namely, not only simple rising and falling intonations but also rise-fall and fall-rise intonations. In order to reveal the relationship between the intonation and the intentions, 10 representative contours were selected on the basis of the results of the clustering. Using the selected contours, a subjective evaluation was conducted. Six Japanese sentences that could have different meanings according to the sentence-final intonations were synthesized and the F0 contour at the last vowel of each sentence was replaced with the contours. The results of the evaluation by nine listeners showed that, for example, a certain falling intonation could express the intention of the "conviction" and another one that slightly differ in the shape could convey "doubt." It was found that the subtle difference in the sentence-final F0 shape conveyed various nuances and connotations.

Original languageEnglish
Title of host publication13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Pages442-445
Number of pages4
Publication statusPublished - 2012 Dec 1
Event13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
Duration: 2012 Sep 92012 Sep 13

Publication series

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Volume1

Conference

Conference13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
CountryUnited States
CityPortland, OR
Period12/9/912/9/13

Keywords

  • Sentence-final intonation
  • Speaker's intention
  • Speech synthesis

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Communication

Fingerprint Dive into the research topics of 'Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis'. Together they form a unique fingerprint.

  • Cite this

    Iwata, K., & Kobayashi, T. (2012). Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (pp. 442-445). (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Vol. 1).