TY - GEN
T1 - Concatenative speech synthesis by minimum distortion criteria
AU - Iwahashi, Naoto
AU - Kaiki, Nobuyoshi
AU - Sagisaka, Yoshinori
N1 - Publisher Copyright:
© 1992 IEEE.
PY - 1992
Y1 - 1992
N2 - A new scheme is proposed for concatenative speech synthesis to improve the segment selection procedure by minimizing acoustic distortions between the selected segment and the desired spectrum for the target. The spectral pro-totypicality of a segment, the spectral difference between the source and target contexts, the degradation resulting from concatenation of phonemes, and the acoustic continuity between the concatenated segments are all considered as measures. A search method for selecting segments from a large speech database is also described. In this method, a three-step optimization is used for distortion minimization. A perceptual test shows that contextual spectral difference and acoustic continuity at the segment boundary are important measures for improving the quality of synthesized speech.
AB - A new scheme is proposed for concatenative speech synthesis to improve the segment selection procedure by minimizing acoustic distortions between the selected segment and the desired spectrum for the target. The spectral pro-totypicality of a segment, the spectral difference between the source and target contexts, the degradation resulting from concatenation of phonemes, and the acoustic continuity between the concatenated segments are all considered as measures. A search method for selecting segments from a large speech database is also described. In this method, a three-step optimization is used for distortion minimization. A perceptual test shows that contextual spectral difference and acoustic continuity at the segment boundary are important measures for improving the quality of synthesized speech.
UR - http://www.scopus.com/inward/record.url?scp=85009071260&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85009071260&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.1992.226119
DO - 10.1109/ICASSP.1992.226119
M3 - Conference contribution
AN - SCOPUS:85009071260
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 65
EP - 68
BT - ICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992
Y2 - 23 March 1992 through 26 March 1992
ER -