Analysis of acoustic models trained on a large-scale Japanese speech database

Tomoko Matsui, Masaki Naito, Yoshinori Sagisaka, Kozo Okuda, Satoshi Nakamura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper investigates the performance of speaker-independent (SI) acoustic hidden-Markov-models (HMMs) trained with a huge Japanese speech database, and discusses the efficiency and task-independency involved. The database consists of read and spontaneous speech uttered by 3,771 speakers. The speech involves wide distributions with respect to region and age to capture the Japanese speech characteristics as best as possible. Recognition experiments using the spontaneous speech show that task-independent acoustic models can be created when training data with a huge number of speakers is available.

Original languageEnglish
Title of host publication6th International Conference on Spoken Language Processing, ICSLP 2000
PublisherInternational Speech Communication Association
ISBN (Electronic)7801501144, 9787801501141
Publication statusPublished - 2000
Externally publishedYes
Event6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
Duration: 2000 Oct 162000 Oct 20

Other

Other6th International Conference on Spoken Language Processing, ICSLP 2000
CountryChina
CityBeijing
Period00/10/1600/10/20

Fingerprint

acoustics
Acoustics
Data Base
efficiency
experiment
Spontaneous Speech
performance
Hidden Markov Model
Experiment

ASJC Scopus subject areas

  • Linguistics and Language
  • Language and Linguistics

Cite this

Matsui, T., Naito, M., Sagisaka, Y., Okuda, K., & Nakamura, S. (2000). Analysis of acoustic models trained on a large-scale Japanese speech database. In 6th International Conference on Spoken Language Processing, ICSLP 2000 International Speech Communication Association.

Analysis of acoustic models trained on a large-scale Japanese speech database. / Matsui, Tomoko; Naito, Masaki; Sagisaka, Yoshinori; Okuda, Kozo; Nakamura, Satoshi.

6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Matsui, T, Naito, M, Sagisaka, Y, Okuda, K & Nakamura, S 2000, Analysis of acoustic models trained on a large-scale Japanese speech database. in 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 6th International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, 00/10/16.
Matsui T, Naito M, Sagisaka Y, Okuda K, Nakamura S. Analysis of acoustic models trained on a large-scale Japanese speech database. In 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association. 2000
Matsui, Tomoko ; Naito, Masaki ; Sagisaka, Yoshinori ; Okuda, Kozo ; Nakamura, Satoshi. / Analysis of acoustic models trained on a large-scale Japanese speech database. 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000.
@inproceedings{614aa3a6ed934bf8ae11b750e4aa6efe,
title = "Analysis of acoustic models trained on a large-scale Japanese speech database",
abstract = "This paper investigates the performance of speaker-independent (SI) acoustic hidden-Markov-models (HMMs) trained with a huge Japanese speech database, and discusses the efficiency and task-independency involved. The database consists of read and spontaneous speech uttered by 3,771 speakers. The speech involves wide distributions with respect to region and age to capture the Japanese speech characteristics as best as possible. Recognition experiments using the spontaneous speech show that task-independent acoustic models can be created when training data with a huge number of speakers is available.",
author = "Tomoko Matsui and Masaki Naito and Yoshinori Sagisaka and Kozo Okuda and Satoshi Nakamura",
year = "2000",
language = "English",
booktitle = "6th International Conference on Spoken Language Processing, ICSLP 2000",
publisher = "International Speech Communication Association",

}

TY - GEN

T1 - Analysis of acoustic models trained on a large-scale Japanese speech database

AU - Matsui, Tomoko

AU - Naito, Masaki

AU - Sagisaka, Yoshinori

AU - Okuda, Kozo

AU - Nakamura, Satoshi

PY - 2000

Y1 - 2000

N2 - This paper investigates the performance of speaker-independent (SI) acoustic hidden-Markov-models (HMMs) trained with a huge Japanese speech database, and discusses the efficiency and task-independency involved. The database consists of read and spontaneous speech uttered by 3,771 speakers. The speech involves wide distributions with respect to region and age to capture the Japanese speech characteristics as best as possible. Recognition experiments using the spontaneous speech show that task-independent acoustic models can be created when training data with a huge number of speakers is available.

AB - This paper investigates the performance of speaker-independent (SI) acoustic hidden-Markov-models (HMMs) trained with a huge Japanese speech database, and discusses the efficiency and task-independency involved. The database consists of read and spontaneous speech uttered by 3,771 speakers. The speech involves wide distributions with respect to region and age to capture the Japanese speech characteristics as best as possible. Recognition experiments using the spontaneous speech show that task-independent acoustic models can be created when training data with a huge number of speakers is available.

UR - http://www.scopus.com/inward/record.url?scp=85009080621&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009080621&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85009080621

BT - 6th International Conference on Spoken Language Processing, ICSLP 2000

PB - International Speech Communication Association

ER -