Concatenative speech synthesis by minimum distortion criteria

Naoto Iwahashi, Nobuyoshi Kaiki, Yoshinori Sagisaka

研究成果: Conference contribution

11 被引用数 (Scopus)

抄録

A new scheme is proposed for concatenative speech synthesis to improve the segment selection procedure by minimizing acoustic distortions between the selected segment and the desired spectrum for the target. The spectral pro-totypicality of a segment, the spectral difference between the source and target contexts, the degradation resulting from concatenation of phonemes, and the acoustic continuity between the concatenated segments are all considered as measures. A search method for selecting segments from a large speech database is also described. In this method, a three-step optimization is used for distortion minimization. A perceptual test shows that contextual spectral difference and acoustic continuity at the segment boundary are important measures for improving the quality of synthesized speech.

本文言語English
ホスト出版物のタイトルICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing
出版社Institute of Electrical and Electronics Engineers Inc.
ページ65-68
ページ数4
2
ISBN(電子版)0780305329
DOI
出版ステータスPublished - 1992
外部発表はい
イベント1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, United States
継続期間: 1992 3 231992 3 26

Other

Other1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992
国/地域United States
CitySan Francisco
Period92/3/2392/3/26

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「Concatenative speech synthesis by minimum distortion criteria」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル