TY - JOUR
T1 - A low-band spectrum envelope modeling for high quality pitch modification
AU - Mochizuki, Ryo
AU - Kobayashi, Tetsunori
PY - 2004
Y1 - 2004
N2 - A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of speech modified by the PSOLA(Pitch Synchronous OverLap Add) method. In the conventional TD(Time Domain)-PSOLA method, the spectrum envelope extracted using a Hanning window with a two-pitch-period length had no reliable information in the band of frequencies lower than original F0. This problem causes the sound degradation of the F0 modified speech. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band was reproduced based on the spectral tilt of the spectrum envelope. Subjective listening test results suggest this proposed method yields better sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.
AB - A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of speech modified by the PSOLA(Pitch Synchronous OverLap Add) method. In the conventional TD(Time Domain)-PSOLA method, the spectrum envelope extracted using a Hanning window with a two-pitch-period length had no reliable information in the band of frequencies lower than original F0. This problem causes the sound degradation of the F0 modified speech. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band was reproduced based on the spectral tilt of the spectrum envelope. Subjective listening test results suggest this proposed method yields better sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.
UR - http://www.scopus.com/inward/record.url?scp=4544232303&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=4544232303&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:4544232303
SN - 0736-7791
VL - 1
SP - I645-I648
JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
T2 - Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing
Y2 - 17 May 2004 through 21 May 2004
ER -