抄録
This paper discusses a statistical-model-based approach to speech dereverberation. With this approach, we first define parametric statistical models of probability density functions (pdfs) for a clean speech signal and a room transmission channel, then estimate the model parameters, and finally recover the clean speech signal by using the pdfs with the estimated parameter values. The key to the success of this approach lies in the definition of the models of the clean speech signal and room transmission channel pdfs. This paper presents several statistical models (including newly proposed ones) and compares them in a large-scale experiment. As regards the room transmission channel pdf, an autoregressive (AR) model, an autoregressive power spectral density (ARPSD) model, and a moving-average power spectral density (MAPSD) model are considered. A clean speech signal pdf model is selected according to the room transmission channel pdf model. The AR model exhibited the highest dereverberation accuracy when a reverberant speech signal of 2 sec or longer was available while the other two models outperformed the AR model when only a l-sec reverberant speech signal was available.
本文言語 | English |
---|---|
ホスト出版物のタイトル | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics |
ページ | 145-148 |
ページ数 | 4 |
DOI | |
出版ステータス | Published - 2009 |
外部発表 | はい |
イベント | 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 - New Paltz, NY 継続期間: 2009 10月 18 → 2009 10月 21 |
Other
Other | 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 |
---|---|
City | New Paltz, NY |
Period | 09/10/18 → 09/10/21 |
ASJC Scopus subject areas
- 電子工学および電気工学
- コンピュータ サイエンスの応用