Mel-Generalized cepstral regularization for discriminative non-negative matrix factorization

Li Li, Hirokazu Kameoka, Shoji Makino

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The non-negative matrix factorization (NMF) approach has shown to work reasonably well for monaural speech enhancement tasks. This paper proposes addressing two shortcomings of the original NMF approach: (1) the objective functions for the basis training and separation (Wiener filtering) are inconsistent (the basis spectra are not trained so that the separated signal becomes optimal); (2) minimizing spectral divergence measures does not necessarily lead to an enhancement in the feature domain (e.g., cepstral domain) or in terms of perceived quality. To address the first shortcoming, we have previously proposed an algorithm for Discriminative NMF (DNMF), which optimizes the same objective for basis training and separation. To address the second shortcoming, we have previously introduced novel frameworks called the cepstral distance regularized NMF (CDRNMF) and mel-generalized cepstral distance regularized NMF (MGCRNMF), which aim to enhance speech both in the spectral domain and feature domain. This paper proposes combining the goals of DNMF and MGCRNMF by incorporating the MGC regularizer into the DNMF objective function and proposes an algorithm for parameter estimation. The experimental results revealed that the proposed method outperformed the baseline approaches.

Original languageEnglish
Title of host publication2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017 - Proceedings
EditorsNaonori Ueda, Jen-Tzung Chien, Tomoko Matsui, Jan Larsen, Shinji Watanabe
PublisherIEEE Computer Society
Pages1-6
Number of pages6
ISBN (Electronic)9781509063413
DOIs
Publication statusPublished - 2017 Dec 5
Externally publishedYes
Event2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017 - Tokyo, Japan
Duration: 2017 Sept 252017 Sept 28

Publication series

NameIEEE International Workshop on Machine Learning for Signal Processing, MLSP
Volume2017-September
ISSN (Print)2161-0363
ISSN (Electronic)2161-0371

Other

Other2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017
Country/TerritoryJapan
CityTokyo
Period17/9/2517/9/28

Keywords

  • Discriminative non-negative matrix factorization
  • Mel-generalized cepstral representation
  • Single-channel
  • Speech enhancement

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Signal Processing

Fingerprint

Dive into the research topics of 'Mel-Generalized cepstral regularization for discriminative non-negative matrix factorization'. Together they form a unique fingerprint.

Cite this