A generalized discriminative training framework for system combination

Yuuki Tachioka, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This paper proposes a generalized discriminative training framework for system combination, which encompasses acoustic modeling (Gaussian mixture models and deep neural networks) and discriminative feature transformation. To improve the performance by combining base systems with complementary systems, complementary systems should have reasonably good performance while tending to have different outputs compared with the base system. Although it is difficult to balance these two somewhat opposite targets in conventional heuristic combination approaches, our framework provides a new objective function that enables to adjust the balance within a sequential discriminative training criterion. We also describe how the proposed method relates to boosting methods. Experiments on highly noisy middle vocabulary speech recognition task (2nd CHiME challenge track 2) and LVCSR task (Corpus of Spontaneous Japanese) show the effectiveness of the proposed method, compared with a conventional system combination approach.

Original languageEnglish
Title of host publication2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings
Pages43-48
Number of pages6
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Olomouc, Czech Republic
Duration: 2013 Dec 82013 Dec 13

Publication series

Name2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings

Other

Other2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013
Country/TerritoryCzech Republic
CityOlomouc
Period13/12/813/12/13

Keywords

  • boosting
  • discriminative training
  • margin training
  • system combination

ASJC Scopus subject areas

  • Speech and Hearing

Fingerprint

Dive into the research topics of 'A generalized discriminative training framework for system combination'. Together they form a unique fingerprint.

Cite this