Generation and perception of F0 markedness for communicative speech synthesis

Yoshinori Sagisaka*, Takumi Yamashita, Yoko Kokenawa

*この研究の対応する著者

研究成果: Article査読

21 被引用数 (Scopus)

抄録

Aiming at natural F0 control for conversational speech synthesis using attributes of constituent output words, F0 characteristics are analyzed from both generation and perception viewpoints. We recorded commonly used two-phrase utterances consisting of Japanese adjective and adverb phrases expressing different degree of markedness under designed conversational situations, and compared their F0 characteristics. The comparison showed the consistent F0 control dependencies not only on adverbs themselves but also on the attribute of following adjective phrases. Strong positive or negative correlation is observed between the markedness of adverbs and F0 height when an adjective phrase showing positiveness or negativeness is followed to the current adverb phrase. These consistencies have been perceptually confirmed by naturalness evaluation tests using the same two-phrase samples with different F0 heights. Finally, a computational model of conversational F0 control is proposed using lexical information of adjectives showing positiveness or negativeness and adverbs expressing markedness. F0 estimation experiments quantitatively showed the possibility of F0 control for natural conversational speech synthesis using the attribute of constituent output words.

本文言語English
ページ(範囲)376-384
ページ数9
ジャーナルSpeech Communication
46
3-4
DOI
出版ステータスPublished - 2005 7月

ASJC Scopus subject areas

  • ソフトウェア
  • モデリングとシミュレーション
  • 通信
  • 言語および言語学
  • 言語学および言語
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ サイエンスの応用

フィンガープリント

「Generation and perception of F0 markedness for communicative speech synthesis」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル