Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis

Daichi Sakaue, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno

研究成果: Conference contribution

4 引用 (Scopus)

抜粋

Since important musical features are mutually dependent, their relations should be analyzed simultaneously. Their Bayesian analysis is particularly important to reveal their statistical relation. As the first step for a unified music content analyzer, we focus on the harmonic and temporal structures of the wavelet spectrogram obtained from harmonic sounds. In this paper, we present a new Bayesian multipitch analyzer, called Bayesian non-negative harmonic-temporal factorization (BNHTF). BN-HTF models the harmonic and temporal structures separately based on Gaussian mixture model. The input signal is assumed to contain a finite number of harmonic sounds. Each harmonic sound is assumed to emit a large number of sound quanta over the time-log-frequency domain. The observation probability is expressed as the product of two Gaussian mixtures. The number of quanta is calculated in the e-neighborhood of each grid point on the spectrogram. BNHTF integrates latent harmonic allocation (LHA) and nonnegative matrix factorization (NMF) to estimate both the observation probability and the number of quanta. The model is optimized by newly designed deterministic procedures with several approximations for the variational Bayesian inference. Results of experiments on multipitch estimation with 40 musical pieces showed that BNHTF outperforms the conventional method by 0.018 in terms of F-measure on average.

元の言語English
ホスト出版物のタイトルProceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012
ページ91-96
ページ数6
出版物ステータスPublished - 2012
外部発表Yes
イベント13th International Society for Music Information Retrieval Conference, ISMIR 2012 - Porto
継続期間: 2012 10 82012 10 12

Other

Other13th International Society for Music Information Retrieval Conference, ISMIR 2012
Porto
期間12/10/812/10/12

    フィンガープリント

ASJC Scopus subject areas

  • Music
  • Information Systems

これを引用

Sakaue, D., Otsuka, T., Itoyama, K., & Okuno, H. G. (2012). Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis. : Proceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012 (pp. 91-96)