THE DESIGN OF THE NEWSPAPER-BASED JAPANESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION CORPUS

Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi

研究成果: Paper査読

60 被引用数 (Scopus)

抄録

In this paper we present the first public Japanese speech corpus for large vocabulary continuous speech recognition (LVCSR) technology, which we have titled JNAS (Japanese Newspaper Article Sentences). We designed it to be comparable to the corpora used in the American and European LVCSR projects. The corpus contains speech recordings (60 hrs.) and their orthographic transcriptions for 306 speakers (153 males and 153 females) reading excerpts from the newspaper's articles and phonetically balanced (PB) sentences. This corpus contains utterances of about 45,000 sentences as a whole with each speaker reading about 150 sentences. JNAS is being distributed on 16 CD-ROMs.

本文言語English
出版ステータスPublished - 1998
イベント5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia
継続期間: 1998 11月 301998 12月 4

Conference

Conference5th International Conference on Spoken Language Processing, ICSLP 1998
国/地域Australia
CitySydney
Period98/11/3098/12/4

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「THE DESIGN OF THE NEWSPAPER-BASED JAPANESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION CORPUS」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル