In this paper, mechanisms to recognize speech in time varying noise are proposed. In these methods, the noise source and the speech source are modeled independently. The probabilities that these two models emit observed data sequence are calculated. We tested two methods. The first one uses HMMs to model both of noise and speech. The second one uses a normal Morkov model for noise representation. Adopting normal Markov model, the noise model itself become rather complex, however, the dynamical features such as delta cepstrum can be easily and precisely considered. Using these methods, 100 word recognition tests in car noise environment are performed. As the results, the performances are improved by 23 % and 26 % using the first and the second method, respectively, as compared with normal spectral subtraction.
|出版ステータス||Published - 1993|
|イベント||3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 - Berlin, Germany|
継続期間: 1993 9月 22 → 1993 9月 25
|Conference||3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993|
|Period||93/9/22 → 93/9/25|
ASJC Scopus subject areas
- コンピュータ サイエンスの応用