Application of variational Bayesian approach to speech recognition

Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda

研究成果: Conference contribution

17 引用 (Scopus)

抄録

In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data.

元の言語English
ホスト出版物のタイトルAdvances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002
出版者Neural information processing systems foundation
ISBN(印刷物)0262025507, 9780262025508
出版物ステータスPublished - 2003
外部発表Yes
イベント16th Annual Neural Information Processing Systems Conference, NIPS 2002 - Vancouver, BC, Canada
継続期間: 2002 12 92002 12 14

Other

Other16th Annual Neural Information Processing Systems Conference, NIPS 2002
Canada
Vancouver, BC
期間02/12/902/12/14

Fingerprint

Model structures
Speech recognition
Maximum likelihood
Experiments

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

これを引用

Watanabe, S., Minami, Y., Nakamura, A., & Ueda, N. (2003). Application of variational Bayesian approach to speech recognition. : Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002 Neural information processing systems foundation.

Application of variational Bayesian approach to speech recognition. / Watanabe, Shinji; Minami, Yasuhiro; Nakamura, Atsushi; Ueda, Naonori.

Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002. Neural information processing systems foundation, 2003.

研究成果: Conference contribution

Watanabe, S, Minami, Y, Nakamura, A & Ueda, N 2003, Application of variational Bayesian approach to speech recognition. : Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002. Neural information processing systems foundation, 16th Annual Neural Information Processing Systems Conference, NIPS 2002, Vancouver, BC, Canada, 02/12/9.
Watanabe S, Minami Y, Nakamura A, Ueda N. Application of variational Bayesian approach to speech recognition. : Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002. Neural information processing systems foundation. 2003
Watanabe, Shinji ; Minami, Yasuhiro ; Nakamura, Atsushi ; Ueda, Naonori. / Application of variational Bayesian approach to speech recognition. Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002. Neural information processing systems foundation, 2003.
@inproceedings{68995be87b83481787fe0adceb02bd15,
title = "Application of variational Bayesian approach to speech recognition",
abstract = "In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data.",
author = "Shinji Watanabe and Yasuhiro Minami and Atsushi Nakamura and Naonori Ueda",
year = "2003",
language = "English",
isbn = "0262025507",
booktitle = "Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002",
publisher = "Neural information processing systems foundation",

}

TY - GEN

T1 - Application of variational Bayesian approach to speech recognition

AU - Watanabe, Shinji

AU - Minami, Yasuhiro

AU - Nakamura, Atsushi

AU - Ueda, Naonori

PY - 2003

Y1 - 2003

N2 - In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data.

AB - In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data.

UR - http://www.scopus.com/inward/record.url?scp=79957689964&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79957689964&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:79957689964

SN - 0262025507

SN - 9780262025508

BT - Advances in Neural Information Processing Systems 15 - Proceedings of the 2002 Conference, NIPS 2002

PB - Neural information processing systems foundation

ER -