Japanese speech databases for robust speech recognition

Atsushi Nakamura, Shoichi Matsunaga, Tohru Shimizu, Masahiro Tonomura, Yoshinori Sagisaka

Research output: Contribution to conferencePaper

39 Citations (Scopus)

Abstract

At ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved not only for speech translation but also for the general use of speech recognition in real environments. In this paper, three large speech databases are designed to cope with these problems in speech recognition and the current status of data collection is reported.

Original languageEnglish
Pages2199-2202
Number of pages4
Publication statusPublished - 1996 Dec 1
EventProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4) - Philadelphia, PA, USA
Duration: 1996 Oct 31996 Oct 6

Other

OtherProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4)
CityPhiladelphia, PA, USA
Period96/10/396/10/6

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., & Sagisaka, Y. (1996). Japanese speech databases for robust speech recognition. 2199-2202. Paper presented at Proceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4), Philadelphia, PA, USA, .