Communicative speech synthesis using constituent word attributes

Yoko Greenberg, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    8 Citations (Scopus)

    Abstract

    Aiming at F0 control for communicative speech synthesis, the relationship between word attributes and F0 characteristics was analyzed. By analyzing one-phrase utterances in conversational situations, we studied correlations between word attributes defined by their impressions and prosodic control characteristics. For word attribute description, we used three dimensions in perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative obtained from our previous studies on one syllable utterances of "n". The result showed that F0 height, F0 dynamic patterns and duration could be consistently controlled by the word attributes. The positive-(negative) can be controlled by F0 height, while confident-doubtful, allowable-unacceptable were reflected in F0 dynamic patterns and duration. These results confirmed the usefulness of word attributes in communicative speech synthesis.

    Original languageEnglish
    Title of host publication9th European Conference on Speech Communication and Technology
    Pages517-520
    Number of pages4
    Publication statusPublished - 2005
    Event9th European Conference on Speech Communication and Technology - Lisbon
    Duration: 2005 Sep 42005 Sep 8

    Other

    Other9th European Conference on Speech Communication and Technology
    CityLisbon
    Period05/9/405/9/8

    Fingerprint

    Speech synthesis

    ASJC Scopus subject areas

    • Engineering(all)

    Cite this

    Greenberg, Y., Tsuzaki, M., Kato, H., & Sagisaka, Y. (2005). Communicative speech synthesis using constituent word attributes. In 9th European Conference on Speech Communication and Technology (pp. 517-520)

    Communicative speech synthesis using constituent word attributes. / Greenberg, Yoko; Tsuzaki, Minoru; Kato, Hiroaki; Sagisaka, Yoshinori.

    9th European Conference on Speech Communication and Technology. 2005. p. 517-520.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Greenberg, Y, Tsuzaki, M, Kato, H & Sagisaka, Y 2005, Communicative speech synthesis using constituent word attributes. in 9th European Conference on Speech Communication and Technology. pp. 517-520, 9th European Conference on Speech Communication and Technology, Lisbon, 05/9/4.
    Greenberg Y, Tsuzaki M, Kato H, Sagisaka Y. Communicative speech synthesis using constituent word attributes. In 9th European Conference on Speech Communication and Technology. 2005. p. 517-520
    Greenberg, Yoko ; Tsuzaki, Minoru ; Kato, Hiroaki ; Sagisaka, Yoshinori. / Communicative speech synthesis using constituent word attributes. 9th European Conference on Speech Communication and Technology. 2005. pp. 517-520
    @inproceedings{dd8c98a384a44292bf39c0df39721e42,
    title = "Communicative speech synthesis using constituent word attributes",
    abstract = "Aiming at F0 control for communicative speech synthesis, the relationship between word attributes and F0 characteristics was analyzed. By analyzing one-phrase utterances in conversational situations, we studied correlations between word attributes defined by their impressions and prosodic control characteristics. For word attribute description, we used three dimensions in perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative obtained from our previous studies on one syllable utterances of {"}n{"}. The result showed that F0 height, F0 dynamic patterns and duration could be consistently controlled by the word attributes. The positive-(negative) can be controlled by F0 height, while confident-doubtful, allowable-unacceptable were reflected in F0 dynamic patterns and duration. These results confirmed the usefulness of word attributes in communicative speech synthesis.",
    author = "Yoko Greenberg and Minoru Tsuzaki and Hiroaki Kato and Yoshinori Sagisaka",
    year = "2005",
    language = "English",
    pages = "517--520",
    booktitle = "9th European Conference on Speech Communication and Technology",

    }

    TY - GEN

    T1 - Communicative speech synthesis using constituent word attributes

    AU - Greenberg, Yoko

    AU - Tsuzaki, Minoru

    AU - Kato, Hiroaki

    AU - Sagisaka, Yoshinori

    PY - 2005

    Y1 - 2005

    N2 - Aiming at F0 control for communicative speech synthesis, the relationship between word attributes and F0 characteristics was analyzed. By analyzing one-phrase utterances in conversational situations, we studied correlations between word attributes defined by their impressions and prosodic control characteristics. For word attribute description, we used three dimensions in perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative obtained from our previous studies on one syllable utterances of "n". The result showed that F0 height, F0 dynamic patterns and duration could be consistently controlled by the word attributes. The positive-(negative) can be controlled by F0 height, while confident-doubtful, allowable-unacceptable were reflected in F0 dynamic patterns and duration. These results confirmed the usefulness of word attributes in communicative speech synthesis.

    AB - Aiming at F0 control for communicative speech synthesis, the relationship between word attributes and F0 characteristics was analyzed. By analyzing one-phrase utterances in conversational situations, we studied correlations between word attributes defined by their impressions and prosodic control characteristics. For word attribute description, we used three dimensions in perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative obtained from our previous studies on one syllable utterances of "n". The result showed that F0 height, F0 dynamic patterns and duration could be consistently controlled by the word attributes. The positive-(negative) can be controlled by F0 height, while confident-doubtful, allowable-unacceptable were reflected in F0 dynamic patterns and duration. These results confirmed the usefulness of word attributes in communicative speech synthesis.

    UR - http://www.scopus.com/inward/record.url?scp=33745225462&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=33745225462&partnerID=8YFLogxK

    M3 - Conference contribution

    SP - 517

    EP - 520

    BT - 9th European Conference on Speech Communication and Technology

    ER -