Statistical properties of the fluctuation of pitch intervals are investigated, and pitch generation models considering fluctuation are discussed. Experimental results of natural speech analysis show that the distribution of pitch fluctuation can be approximated by shifted gamma distribution and that the correlation coefficients of 0th-5th and 30th-60th order show strong positive values. Several pitch generation models dealing with fluctuation are tested with the aim of realizing natural synthetic speech. The results of perceptual experiments recommend the fluctuation model using a 15th-order autoregressive filter excited by a uniform random number. The quality of the synthetic speech using the above fluctuation model is comparable to that of speech with the original fluctuation.
|ジャーナル||ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|
|出版ステータス||Published - 1990 12月 1|
|イベント||1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5) - Albuquerque, New Mexico, USA|
継続期間: 1990 4月 3 → 1990 4月 6
ASJC Scopus subject areas