A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification

Ryo Mochizuki, Tetsunori Kobayashi

    Research output: Contribution to journalArticle

    Abstract

    A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous Overlap Add) method. In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0. This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band were reproduced based on the spectral tilt of the spectrum envelope. Subjective listening tests suggest the proposed method yields improved sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.

    Original languageEnglish
    Pages (from-to)2426-2429
    Number of pages4
    JournalIEICE Transactions on Information and Systems
    VolumeE87-D
    Issue number10
    Publication statusPublished - 2004 Oct

    Fingerprint

    Acoustic waves
    Degradation

    Keywords

    • F modification
    • PSOLA
    • Spectrum envelope
    • Speech synthesis

    ASJC Scopus subject areas

    • Information Systems
    • Computer Graphics and Computer-Aided Design
    • Software

    Cite this

    A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification. / Mochizuki, Ryo; Kobayashi, Tetsunori.

    In: IEICE Transactions on Information and Systems, Vol. E87-D, No. 10, 10.2004, p. 2426-2429.

    Research output: Contribution to journalArticle

    @article{70ff952e38ce4a55ae8a22195bd119b8,
    title = "A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification",
    abstract = "A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous Overlap Add) method. In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0. This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band were reproduced based on the spectral tilt of the spectrum envelope. Subjective listening tests suggest the proposed method yields improved sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.",
    keywords = "F modification, PSOLA, Spectrum envelope, Speech synthesis",
    author = "Ryo Mochizuki and Tetsunori Kobayashi",
    year = "2004",
    month = "10",
    language = "English",
    volume = "E87-D",
    pages = "2426--2429",
    journal = "IEICE Transactions on Information and Systems",
    issn = "0916-8532",
    publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
    number = "10",

    }

    TY - JOUR

    T1 - A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification

    AU - Mochizuki, Ryo

    AU - Kobayashi, Tetsunori

    PY - 2004/10

    Y1 - 2004/10

    N2 - A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous Overlap Add) method. In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0. This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band were reproduced based on the spectral tilt of the spectrum envelope. Subjective listening tests suggest the proposed method yields improved sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.

    AB - A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous Overlap Add) method. In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0. This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band were reproduced based on the spectral tilt of the spectrum envelope. Subjective listening tests suggest the proposed method yields improved sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.

    KW - F modification

    KW - PSOLA

    KW - Spectrum envelope

    KW - Speech synthesis

    UR - http://www.scopus.com/inward/record.url?scp=7544251782&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=7544251782&partnerID=8YFLogxK

    M3 - Article

    AN - SCOPUS:7544251782

    VL - E87-D

    SP - 2426

    EP - 2429

    JO - IEICE Transactions on Information and Systems

    JF - IEICE Transactions on Information and Systems

    SN - 0916-8532

    IS - 10

    ER -