CNN-based virtual microphone signal estimation for MPDR beamforming in underdetermined situations

Kouei Yamaoka, Li L. Li, Nobutaka Ono, Shoji Makino, Takeshi Yamada

研究成果: Conference contribution

4 被引用数 (Scopus)

抄録

In this paper, we propose a novel approach to virtually increasing the number of microphone elements between two real microphones to improve speech enhancement performance in underdetermined situations. The virtual microphone technique, with which virtual signals in the audio signal domain are estimated by linearly interpolating the phase and nonlinearly interpolating the amplitude independently on the basis of βdivergence, has been recently proposed and experimentally shown to be effective in improving speech enhancement performance. Furthermore, it has been reported that the performance tends to improve as the nonlinearity is improved. However, one drawback of this method is that the interpolation is employed in each time-frequency bin independently, in which the spectral and temporal structures of speech signals are ignored. To address this problem and improve the nonlinearity, motivated by the high capability of neural networks to model nonlinear functions and speech spectrograms, in this paper, we propose an alternative method of amplitude interpolation. In this method, we employ a convolutional neural network as an amplitude estimator that minimizes the mean squared error between the outputs of a minimum power distortionless response (MPDR) beamformer and the target speech signals. The experimental results revealed that the proposed method showed high potential for improving speech enhancement performance, which was not only superior to that of the conventional virtual microphone technique but also the performance in the corresponding determined situation.

本文言語English
ホスト出版物のタイトルEUSIPCO 2019 - 27th European Signal Processing Conference
出版社European Signal Processing Conference, EUSIPCO
ISBN(電子版)9789082797039
DOI
出版ステータスPublished - 2019 9
外部発表はい
イベント27th European Signal Processing Conference, EUSIPCO 2019 - A Coruna, Spain
継続期間: 2019 9 22019 9 6

出版物シリーズ

名前European Signal Processing Conference
2019-September
ISSN(印刷版)2219-5491

Conference

Conference27th European Signal Processing Conference, EUSIPCO 2019
国/地域Spain
CityA Coruna
Period19/9/219/9/6

ASJC Scopus subject areas

  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「CNN-based virtual microphone signal estimation for MPDR beamforming in underdetermined situations」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル