Discriminative approach to dynamic variance adaptation for noisy speech recognition

Marc Delcroix, Shinji Watanabe, Tomohiro Nakatani, Atsushi Nakamura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The performance of automatic speech recognition suffers from severe degradation in the presence of noise or reverberation. One conventional approach for handling such acoustic distortions is to use a speech enhancement technique prior to recognition. However, most speech enhancement techniques introduce artifacts that create a mismatch between the enhanced speech features and the acoustic model used for recognition, therefore limiting the improvement in recognition performance. Recently, there has been increased interest in methods capable of compensating for such a mismatch by accounting for the feature variance during decoding. In this paper, we propose to estimate the feature variance using an adaptation technique based on a discriminative criterion. In an experiment using the Aurora2 database, the proposed method could achieve significant digit error rate reduction compared with a spectral subtraction pre-processor, and using a discriminative criterion for adaptation provided further improvement compared with maximum likelihood estimation.

Original languageEnglish
Title of host publication2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11
Pages7-12
Number of pages6
DOIs
Publication statusPublished - 2011 Aug 10
Externally publishedYes
Event2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11 - Edinburgh, United Kingdom
Duration: 2011 May 302011 Jun 1

Publication series

Name2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11

Other

Other2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11
CountryUnited Kingdom
CityEdinburgh
Period11/5/3011/6/1

Keywords

  • MMI
  • Model Adaptation
  • Noise reduction
  • Robust ASR
  • Variance Compensation

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Signal Processing
  • Communication

Fingerprint Dive into the research topics of 'Discriminative approach to dynamic variance adaptation for noisy speech recognition'. Together they form a unique fingerprint.

  • Cite this

    Delcroix, M., Watanabe, S., Nakatani, T., & Nakamura, A. (2011). Discriminative approach to dynamic variance adaptation for noisy speech recognition. In 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11 (pp. 7-12). [5942414] (2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11). https://doi.org/10.1109/HSCMA.2011.5942414