F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis

Yoko Kokenawa, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

Aiming at prosody control for speech synthesis expressing speaking attitudes, F0 shapes were characterized by their perceptual impressions. To directly correlate F0 shapes with perceptual impressions, single word utterances "n" extracted from daily conversations were employed. The analysis showed that speaking attitudes were manifested in the global F0 control of "n" as the differences of their average height (high-low) and dynamic patterns (rise, flat, fall and rise&fall). Next, controlled utterances of "n" were perceptually examined through Multiple Dimensional Scaling analysis to confirm F0 control freedoms found in the analysis. The result showed the three-dimensional structure of a perceptual impression space and factor dependent F0 control characteristics. The positive-negative attitude can be controlled by average F0 height while those of confident-doubtful or allowable -unacceptable are manifested through dynamic F0 patterns. These findings provide new possibilities of systematic F0 control for conversational speech synthesis with speaking attitudes using corpus-based approach.

Original languageEnglish
Title of host publication2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PublisherInstitute of Electrical and Electronics Engineers Inc.
PagesI273-I276
ISBN (Print)0780388747, 9780780388741
DOIs
Publication statusPublished - 2005 Jan 1
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: 2005 Mar 182005 Mar 23

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
VolumeI
ISSN (Print)1520-6149

Conference

Conference2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
CountryUnited States
CityPhiladelphia, PA
Period05/3/1805/3/23

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis'. Together they form a unique fingerprint.

  • Cite this

    Kokenawa, Y., Tsuzaki, M., Kato, H., & Sagisaka, Y. (2005). F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis. In 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing (pp. I273-I276). [1415103] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. I). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2005.1415103