Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure

Takuya Yoshioka*, Tomohiro Nakatani, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

3 被引用数 (Scopus)

抄録

This paper considers the enhancement of noisy speech. Earlier studies have revealed that an approach that enhances spectral envelopes by using prior knowledge about the all-pole (AP) model parameters of clean speech learnt from speech corpora is advantageous in terms of the amount of musical noise and speech distortion. This paper proposes a new speech enhancement method, in which harmonic structure enhancement is incorporated in learning-based spectral envelope enhancement to further improve performance. The harmonic structure is represented by using a harmonic Gaussian mixture model (GMM), which is parameterized by a voicing indicator and a fundamental frequency. The parameters of the AP model and the harmonic GMM are jointly estimated by maximum a posteriori estimation, thus enabling the enhancement of spectral envelopes and harmonic structures in a unified framework. The proposed method outperforms the spectral envelope enhancement approach by 0.85 dB in cepstral distance.

本文言語English
ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ページ4270-4273
ページ数4
DOI
出版ステータスPublished - 2010
外部発表はい
イベント2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Dallas, TX
継続期間: 2010 3月 142010 3月 19

Other

Other2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010
CityDallas, TX
Period10/3/1410/3/19

ASJC Scopus subject areas

  • 信号処理
  • ソフトウェア
  • 電子工学および電気工学

フィンガープリント

「Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル