F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis

Yoko Kokenawa*, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka

*この研究の対応する著者

研究成果

10 被引用数 (Scopus)

抄録

Aiming at prosody control for speech synthesis expressing speaking attitudes, F0 shapes were characterized by their perceptual impressions. To directly correlate F0 shapes with perceptual impressions, single word utterances "n" extracted from daily conversations were employed. The analysis showed that speaking attitudes were manifested in the global F0 control of "n" as the differences of their average height (high-low) and dynamic patterns (rise, flat, fall and rise&fall). Next, controlled utterances of "n" were perceptually examined through Multiple Dimensional Scaling analysis to confirm F0 control freedoms found in the analysis. The result showed the three-dimensional structure of a perceptual impression space and factor dependent F0 control characteristics. The positive-negative attitude can be controlled by average F0 height while those of confident-doubtful or allowable -unacceptable are manifested through dynamic F0 patterns. These findings provide new possibilities of systematic F0 control for conversational speech synthesis with speaking attitudes using corpus-based approach.

本文言語English
ホスト出版物のタイトル2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
出版社Institute of Electrical and Electronics Engineers Inc.
ページI273-I276
ISBN(印刷版)0780388747, 9780780388741
DOI
出版ステータスPublished - 2005 1 1
イベント2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
継続期間: 2005 3 182005 3 23

出版物シリーズ

名前ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
I
ISSN(印刷版)1520-6149

Conference

Conference2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
国/地域United States
CityPhiladelphia, PA
Period05/3/1805/3/23

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル