Global F0 control parameter prediction based on impressions for communicative prosody generation

Lu Shao, Yoko Greenberg, Yoshinori Sagisaka

研究成果: Conference contribution

5 引用 (Scopus)

抜粋

Aiming at communicative speech synthesis, prosody control using impressions has been proposed by applying the correlation between impressions of input lexicons and prosody. In this paper, as the first step to compute communicative prosody, we attempt to predict the F0 generation model parameters by estimating the impressions of input sentence from its constituent lexicons. To obtain an impression vector consisting of three dimensional factors (positive-negative, confident-doubtful and allowable-unacceptable) for a given input utterance, we proposed a computational scheme to calculate impression vectors using impression scores of constituent words. Using obtained sentence impression vectors, F0 control parameters are predicted by applying three-layered feed-forward neural networks. To evaluate the effectiveness of the proposed computational framework, we experimentally confirmed that F0 parameters of communicative speech could be generated from the impressions of input lexicons.

元の言語English
ホスト出版物のタイトル2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013
DOI
出版物ステータスPublished - 2013 12 1
イベント2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013 - Gurgaon, India
継続期間: 2013 11 252013 11 27

出版物シリーズ

名前2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013

Conference

Conference2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013
India
Gurgaon
期間13/11/2513/11/27

ASJC Scopus subject areas

  • Software

フィンガープリント Global F0 control parameter prediction based on impressions for communicative prosody generation' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Shao, L., Greenberg, Y., & Sagisaka, Y. (2013). Global F0 control parameter prediction based on impressions for communicative prosody generation. : 2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013 [6709871] (2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation, O-COCOSDA/CASLRE 2013). https://doi.org/10.1109/ICSDA.2013.6709871