Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis

Daichi Sakaue, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Since important musical features are mutually dependent, their relations should be analyzed simultaneously. Their Bayesian analysis is particularly important to reveal their statistical relation. As the first step for a unified music content analyzer, we focus on the harmonic and temporal structures of the wavelet spectrogram obtained from harmonic sounds. In this paper, we present a new Bayesian multipitch analyzer, called Bayesian non-negative harmonic-temporal factorization (BNHTF). BN-HTF models the harmonic and temporal structures separately based on Gaussian mixture model. The input signal is assumed to contain a finite number of harmonic sounds. Each harmonic sound is assumed to emit a large number of sound quanta over the time-log-frequency domain. The observation probability is expressed as the product of two Gaussian mixtures. The number of quanta is calculated in the e-neighborhood of each grid point on the spectrogram. BNHTF integrates latent harmonic allocation (LHA) and nonnegative matrix factorization (NMF) to estimate both the observation probability and the number of quanta. The model is optimized by newly designed deterministic procedures with several approximations for the variational Bayesian inference. Results of experiments on multipitch estimation with 40 musical pieces showed that BNHTF outperforms the conventional method by 0.018 in terms of F-measure on average.

Original languageEnglish
Title of host publicationProceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012
Pages91-96
Number of pages6
Publication statusPublished - 2012
Externally publishedYes
Event13th International Society for Music Information Retrieval Conference, ISMIR 2012 - Porto
Duration: 2012 Oct 82012 Oct 12

Other

Other13th International Society for Music Information Retrieval Conference, ISMIR 2012
CityPorto
Period12/10/812/10/12

    Fingerprint

ASJC Scopus subject areas

  • Music
  • Information Systems

Cite this

Sakaue, D., Otsuka, T., Itoyama, K., & Okuno, H. G. (2012). Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis. In Proceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012 (pp. 91-96)