A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches

Shinji Watanabe, Atsushi Nakamura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. Recently we proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. Maximum Likelihood Linear Regression (MLLR)) and direct adaptation of classifier parameters (e.g. Maximum A Posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.

Original languageEnglish
Title of host publication2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
Pages4285-4288
Number of pages4
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP - Las Vegas, NV
Duration: 2008 Mar 312008 Apr 4

Other

Other2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
CityLas Vegas, NV
Period08/3/3108/4/4

Fingerprint

Acoustics
Speech recognition
Linear regression
Maximum likelihood
acoustics
Classifiers
speech recognition
classifiers
proposals
regression analysis
adjusting

Keywords

  • Acoustic model
  • Incremental adaptation
  • Indirect/direct adaptation
  • Macroscopic time evolution
  • Speech recognition

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Watanabe, S., & Nakamura, A. (2008). A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. In 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP (pp. 4285-4288). [4518602] https://doi.org/10.1109/ICASSP.2008.4518602

A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. / Watanabe, Shinji; Nakamura, Atsushi.

2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. 2008. p. 4285-4288 4518602.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Watanabe, S & Nakamura, A 2008, A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP., 4518602, pp. 4285-4288, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Las Vegas, NV, 08/3/31. https://doi.org/10.1109/ICASSP.2008.4518602
Watanabe S, Nakamura A. A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. In 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. 2008. p. 4285-4288. 4518602 https://doi.org/10.1109/ICASSP.2008.4518602
Watanabe, Shinji ; Nakamura, Atsushi. / A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. 2008. pp. 4285-4288
@inproceedings{c735146b252544e7902af9a09d681a1a,
title = "A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches",
abstract = "Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. Recently we proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. Maximum Likelihood Linear Regression (MLLR)) and direct adaptation of classifier parameters (e.g. Maximum A Posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.",
keywords = "Acoustic model, Incremental adaptation, Indirect/direct adaptation, Macroscopic time evolution, Speech recognition",
author = "Shinji Watanabe and Atsushi Nakamura",
year = "2008",
doi = "10.1109/ICASSP.2008.4518602",
language = "English",
isbn = "1424414849",
pages = "4285--4288",
booktitle = "2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP",

}

TY - GEN

T1 - A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches

AU - Watanabe, Shinji

AU - Nakamura, Atsushi

PY - 2008

Y1 - 2008

N2 - Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. Recently we proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. Maximum Likelihood Linear Regression (MLLR)) and direct adaptation of classifier parameters (e.g. Maximum A Posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.

AB - Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. Recently we proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. Maximum Likelihood Linear Regression (MLLR)) and direct adaptation of classifier parameters (e.g. Maximum A Posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.

KW - Acoustic model

KW - Incremental adaptation

KW - Indirect/direct adaptation

KW - Macroscopic time evolution

KW - Speech recognition

UR - http://www.scopus.com/inward/record.url?scp=51449104599&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=51449104599&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2008.4518602

DO - 10.1109/ICASSP.2008.4518602

M3 - Conference contribution

AN - SCOPUS:51449104599

SN - 1424414849

SN - 9781424414840

SP - 4285

EP - 4288

BT - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP

ER -