Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech

Tetsunori Kobayashi, Hidetoshi Sekine

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Statistical properties of the fluctuation of pitch intervals are investigated, and pitch generation models considering fluctuation are discussed. Experimental results of natural speech analysis show that the distribution of pitch fluctuation can be approximated by shifted gamma distribution and that the correlation coefficients of 0th-5th and 30th-60th order show strong positive values. Several pitch generation models dealing with fluctuation are tested with the aim of realizing natural synthetic speech. The results of perceptual experiments recommend the fluctuation model using a 15th-order autoregressive filter excited by a uniform random number. The quality of the synthetic speech using the above fluctuation model is comparable to that of speech with the original fluctuation.

Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherPubl by IEEE
Pages321-324
Number of pages4
Volume1
Publication statusPublished - 1990
Externally publishedYes
Event1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5) - Albuquerque, New Mexico, USA
Duration: 1990 Apr 31990 Apr 6

Other

Other1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5)
CityAlbuquerque, New Mexico, USA
Period90/4/390/4/6

Fingerprint

intervals
Speech analysis
random numbers
correlation coefficients
Experiments
filters

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Cite this

Kobayashi, T., & Sekine, H. (1990). Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 1, pp. 321-324). Publ by IEEE.

Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech. / Kobayashi, Tetsunori; Sekine, Hidetoshi.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 Publ by IEEE, 1990. p. 321-324.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kobayashi, T & Sekine, H 1990, Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 1, Publ by IEEE, pp. 321-324, 1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5), Albuquerque, New Mexico, USA, 90/4/3.
Kobayashi T, Sekine H. Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1. Publ by IEEE. 1990. p. 321-324
Kobayashi, Tetsunori ; Sekine, Hidetoshi. / Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 Publ by IEEE, 1990. pp. 321-324
@inproceedings{8b8be1a7070a49ed803f44e680bcc36f,
title = "Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech",
abstract = "Statistical properties of the fluctuation of pitch intervals are investigated, and pitch generation models considering fluctuation are discussed. Experimental results of natural speech analysis show that the distribution of pitch fluctuation can be approximated by shifted gamma distribution and that the correlation coefficients of 0th-5th and 30th-60th order show strong positive values. Several pitch generation models dealing with fluctuation are tested with the aim of realizing natural synthetic speech. The results of perceptual experiments recommend the fluctuation model using a 15th-order autoregressive filter excited by a uniform random number. The quality of the synthetic speech using the above fluctuation model is comparable to that of speech with the original fluctuation.",
author = "Tetsunori Kobayashi and Hidetoshi Sekine",
year = "1990",
language = "English",
volume = "1",
pages = "321--324",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "Publ by IEEE",

}

TY - GEN

T1 - Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech

AU - Kobayashi, Tetsunori

AU - Sekine, Hidetoshi

PY - 1990

Y1 - 1990

N2 - Statistical properties of the fluctuation of pitch intervals are investigated, and pitch generation models considering fluctuation are discussed. Experimental results of natural speech analysis show that the distribution of pitch fluctuation can be approximated by shifted gamma distribution and that the correlation coefficients of 0th-5th and 30th-60th order show strong positive values. Several pitch generation models dealing with fluctuation are tested with the aim of realizing natural synthetic speech. The results of perceptual experiments recommend the fluctuation model using a 15th-order autoregressive filter excited by a uniform random number. The quality of the synthetic speech using the above fluctuation model is comparable to that of speech with the original fluctuation.

AB - Statistical properties of the fluctuation of pitch intervals are investigated, and pitch generation models considering fluctuation are discussed. Experimental results of natural speech analysis show that the distribution of pitch fluctuation can be approximated by shifted gamma distribution and that the correlation coefficients of 0th-5th and 30th-60th order show strong positive values. Several pitch generation models dealing with fluctuation are tested with the aim of realizing natural synthetic speech. The results of perceptual experiments recommend the fluctuation model using a 15th-order autoregressive filter excited by a uniform random number. The quality of the synthetic speech using the above fluctuation model is comparable to that of speech with the original fluctuation.

UR - http://www.scopus.com/inward/record.url?scp=0025659614&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025659614&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0025659614

VL - 1

SP - 321

EP - 324

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

PB - Publ by IEEE

ER -