Nonlinear speech enhancement by virtual increase of channels and maximum SNR beamformer

Hiroki Katahira, Nobutaka Ono, Shigeki Miyabe, Takeshi Yamada, Shoji Makino*

*この研究の対応する著者

研究成果: Article査読

12 被引用数 (Scopus)

抄録

In this paper, we propose a new microphone array signal processing technique, which increases the number of microphones virtually by generating extra signal channels from real microphone signals. Microphone array signal processing methods such as speech enhancement are effective for improving the quality of various speech applications such as speech recognition and voice communication systems. However, the performance of speech enhancement and other signal processing methods depends on the number of microphones. Thus, special equipment such as a multichannel A/D converter or a microphone array is needed to achieve high processing performance. Therefore, our aim was to establish a technique for improving the performance of array signal processing with a small number of microphones and, in particular, to increase the number of channels virtually by synthesizing virtual microphone signals, or extra signal channels, from two channels of microphone signals. Each virtual microphone signal is generated by interpolating a short-time Fourier transform (STFT) representation of the microphone signals. The phase and amplitude of the signal are interpolated individually. The phase is linearly interpolated on the basis of a sound propagation model, and the amplitude is nonlinearly interpolated on the basis of β divergence. We also performed speech enhancement experiments using a maximum signal-to-noise ratio (SNR) beamformer equipped with virtual microphones and evaluated the improvement in performance upon introducing virtual microphones.

本文言語English
論文番号11
ページ(範囲)1-8
ページ数8
ジャーナルEurasip Journal on Advances in Signal Processing
2016
1
DOI
出版ステータスPublished - 2016 12月 1
外部発表はい

ASJC Scopus subject areas

  • 信号処理
  • ハードウェアとアーキテクチャ
  • 電子工学および電気工学

フィンガープリント

「Nonlinear speech enhancement by virtual increase of channels and maximum SNR beamformer」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル