ATR Japanese speech database as a tool of speech recognition and synthesis

Akira Kurematsu, Kazuya Takeda, Yoshinori Sagisaka, Shigeru Katagiri, Hisao Kuwabara, Kiyohiro Shikano

Research output: Contribution to journalArticle

153 Citations (Scopus)

Abstract

A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies.

Original languageEnglish
Pages (from-to)357-363
Number of pages7
JournalSpeech Communication
Volume9
Issue number4
DOIs
Publication statusPublished - 1990
Externally publishedYes

Fingerprint

Speech Synthesis
Speech synthesis
Speech Recognition
Speech recognition
Databases
Phonetics
Speech analysis
phonetics
acoustics
Transcription
Acoustics
Linguistics
Speech
Data Base
linguistics
Technology
evidence

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

ATR Japanese speech database as a tool of speech recognition and synthesis. / Kurematsu, Akira; Takeda, Kazuya; Sagisaka, Yoshinori; Katagiri, Shigeru; Kuwabara, Hisao; Shikano, Kiyohiro.

In: Speech Communication, Vol. 9, No. 4, 1990, p. 357-363.

Research output: Contribution to journalArticle

Kurematsu, A, Takeda, K, Sagisaka, Y, Katagiri, S, Kuwabara, H & Shikano, K 1990, 'ATR Japanese speech database as a tool of speech recognition and synthesis', Speech Communication, vol. 9, no. 4, pp. 357-363. https://doi.org/10.1016/0167-6393(90)90011-W
Kurematsu, Akira ; Takeda, Kazuya ; Sagisaka, Yoshinori ; Katagiri, Shigeru ; Kuwabara, Hisao ; Shikano, Kiyohiro. / ATR Japanese speech database as a tool of speech recognition and synthesis. In: Speech Communication. 1990 ; Vol. 9, No. 4. pp. 357-363.
@article{a647151f1ead47c097e90accd4c709c5,
title = "ATR Japanese speech database as a tool of speech recognition and synthesis",
abstract = "A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies.",
author = "Akira Kurematsu and Kazuya Takeda and Yoshinori Sagisaka and Shigeru Katagiri and Hisao Kuwabara and Kiyohiro Shikano",
year = "1990",
doi = "10.1016/0167-6393(90)90011-W",
language = "English",
volume = "9",
pages = "357--363",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "4",

}

TY - JOUR

T1 - ATR Japanese speech database as a tool of speech recognition and synthesis

AU - Kurematsu, Akira

AU - Takeda, Kazuya

AU - Sagisaka, Yoshinori

AU - Katagiri, Shigeru

AU - Kuwabara, Hisao

AU - Shikano, Kiyohiro

PY - 1990

Y1 - 1990

N2 - A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies.

AB - A large-scale Japanese speech database has been described. The database basically consists of (1) a word speech database, (2) a continuous speech database, (3) a database for a large number of speakers, and (4) a database for speech synthesis. Multiple transcriptions have been made in five different layers from simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidence that will serve as basic data for speech technologies.

UR - http://www.scopus.com/inward/record.url?scp=0025475528&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025475528&partnerID=8YFLogxK

U2 - 10.1016/0167-6393(90)90011-W

DO - 10.1016/0167-6393(90)90011-W

M3 - Article

AN - SCOPUS:0025475528

VL - 9

SP - 357

EP - 363

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 4

ER -