Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis

Daichi Sakaue*, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

6 被引用数 (Scopus)

抄録

Since important musical features are mutually dependent, their relations should be analyzed simultaneously. Their Bayesian analysis is particularly important to reveal their statistical relation. As the first step for a unified music content analyzer, we focus on the harmonic and temporal structures of the wavelet spectrogram obtained from harmonic sounds. In this paper, we present a new Bayesian multipitch analyzer, called Bayesian non-negative harmonic-temporal factorization (BNHTF). BN-HTF models the harmonic and temporal structures separately based on Gaussian mixture model. The input signal is assumed to contain a finite number of harmonic sounds. Each harmonic sound is assumed to emit a large number of sound quanta over the time-log-frequency domain. The observation probability is expressed as the product of two Gaussian mixtures. The number of quanta is calculated in the e-neighborhood of each grid point on the spectrogram. BNHTF integrates latent harmonic allocation (LHA) and nonnegative matrix factorization (NMF) to estimate both the observation probability and the number of quanta. The model is optimized by newly designed deterministic procedures with several approximations for the variational Bayesian inference. Results of experiments on multipitch estimation with 40 musical pieces showed that BNHTF outperforms the conventional method by 0.018 in terms of F-measure on average.

本文言語English
ホスト出版物のタイトルProceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012
ページ91-96
ページ数6
出版ステータスPublished - 2012
外部発表はい
イベント13th International Society for Music Information Retrieval Conference, ISMIR 2012 - Porto
継続期間: 2012 10月 82012 10月 12

Other

Other13th International Society for Music Information Retrieval Conference, ISMIR 2012
CityPorto
Period12/10/812/10/12

ASJC Scopus subject areas

  • 音楽
  • 情報システム

フィンガープリント

「Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル