Parallel speech corpora of Japanese dialects

Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama, Hiroshi G. Okuno

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    2 Citations (Scopus)

    Abstract

    Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel speech corpora of Japanese dialect, 100 read speeches utterance of 25 dialect speakers and their transcriptions of phoneme. We recorded speeches of 5 common language speakers and 20 dialect speakers from 4 areas, 5 speakers from 1 area, respectively. Each dialect speaker converted the same common language texts to their dialect and read them. Speeches are recorded with closed-talk microphone, using for spoken language processing (recognition, synthesis, pronounce estimation). In the experiments, accuracies of automatic speech recognition (ASR) and Kana Kanji conversion (KKC) system are improved by adapting the system with the data.

    Original languageEnglish
    Title of host publicationProceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
    PublisherEuropean Language Resources Association (ELRA)
    Pages4652-4657
    Number of pages6
    ISBN (Electronic)9782951740891
    Publication statusPublished - 2016 Jan 1
    Event10th International Conference on Language Resources and Evaluation, LREC 2016 - Portoroz, Slovenia
    Duration: 2016 May 232016 May 28

    Other

    Other10th International Conference on Language Resources and Evaluation, LREC 2016
    CountrySlovenia
    CityPortoroz
    Period16/5/2316/5/28

    Fingerprint

    dialect
    spoken language
    language
    Japanese Dialects
    Common Language
    experiment

    Keywords

    • Dialect
    • Japanese
    • Speech
    • Transcription

    ASJC Scopus subject areas

    • Linguistics and Language
    • Library and Information Sciences
    • Language and Linguistics
    • Education

    Cite this

    Yoshino, K., Hirayama, N., Mori, S., Takahashi, F., Itoyama, K., & Okuno, H. G. (2016). Parallel speech corpora of Japanese dialects. In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp. 4652-4657). European Language Resources Association (ELRA).

    Parallel speech corpora of Japanese dialects. / Yoshino, Koichiro; Hirayama, Naoki; Mori, Shinsuke; Takahashi, Fumihiko; Itoyama, Katsutoshi; Okuno, Hiroshi G.

    Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), 2016. p. 4652-4657.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Yoshino, K, Hirayama, N, Mori, S, Takahashi, F, Itoyama, K & Okuno, HG 2016, Parallel speech corpora of Japanese dialects. in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), pp. 4652-4657, 10th International Conference on Language Resources and Evaluation, LREC 2016, Portoroz, Slovenia, 16/5/23.
    Yoshino K, Hirayama N, Mori S, Takahashi F, Itoyama K, Okuno HG. Parallel speech corpora of Japanese dialects. In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA). 2016. p. 4652-4657
    Yoshino, Koichiro ; Hirayama, Naoki ; Mori, Shinsuke ; Takahashi, Fumihiko ; Itoyama, Katsutoshi ; Okuno, Hiroshi G. / Parallel speech corpora of Japanese dialects. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), 2016. pp. 4652-4657
    @inproceedings{88b99c48b7de49008a975a0bb822fd31,
    title = "Parallel speech corpora of Japanese dialects",
    abstract = "Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel speech corpora of Japanese dialect, 100 read speeches utterance of 25 dialect speakers and their transcriptions of phoneme. We recorded speeches of 5 common language speakers and 20 dialect speakers from 4 areas, 5 speakers from 1 area, respectively. Each dialect speaker converted the same common language texts to their dialect and read them. Speeches are recorded with closed-talk microphone, using for spoken language processing (recognition, synthesis, pronounce estimation). In the experiments, accuracies of automatic speech recognition (ASR) and Kana Kanji conversion (KKC) system are improved by adapting the system with the data.",
    keywords = "Dialect, Japanese, Speech, Transcription",
    author = "Koichiro Yoshino and Naoki Hirayama and Shinsuke Mori and Fumihiko Takahashi and Katsutoshi Itoyama and Okuno, {Hiroshi G.}",
    year = "2016",
    month = "1",
    day = "1",
    language = "English",
    pages = "4652--4657",
    booktitle = "Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016",
    publisher = "European Language Resources Association (ELRA)",

    }

    TY - GEN

    T1 - Parallel speech corpora of Japanese dialects

    AU - Yoshino, Koichiro

    AU - Hirayama, Naoki

    AU - Mori, Shinsuke

    AU - Takahashi, Fumihiko

    AU - Itoyama, Katsutoshi

    AU - Okuno, Hiroshi G.

    PY - 2016/1/1

    Y1 - 2016/1/1

    N2 - Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel speech corpora of Japanese dialect, 100 read speeches utterance of 25 dialect speakers and their transcriptions of phoneme. We recorded speeches of 5 common language speakers and 20 dialect speakers from 4 areas, 5 speakers from 1 area, respectively. Each dialect speaker converted the same common language texts to their dialect and read them. Speeches are recorded with closed-talk microphone, using for spoken language processing (recognition, synthesis, pronounce estimation). In the experiments, accuracies of automatic speech recognition (ASR) and Kana Kanji conversion (KKC) system are improved by adapting the system with the data.

    AB - Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel speech corpora of Japanese dialect, 100 read speeches utterance of 25 dialect speakers and their transcriptions of phoneme. We recorded speeches of 5 common language speakers and 20 dialect speakers from 4 areas, 5 speakers from 1 area, respectively. Each dialect speaker converted the same common language texts to their dialect and read them. Speeches are recorded with closed-talk microphone, using for spoken language processing (recognition, synthesis, pronounce estimation). In the experiments, accuracies of automatic speech recognition (ASR) and Kana Kanji conversion (KKC) system are improved by adapting the system with the data.

    KW - Dialect

    KW - Japanese

    KW - Speech

    KW - Transcription

    UR - http://www.scopus.com/inward/record.url?scp=85037160047&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85037160047&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:85037160047

    SP - 4652

    EP - 4657

    BT - Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016

    PB - European Language Resources Association (ELRA)

    ER -