Abstract
Since important musical features are mutually dependent, their relations should be analyzed simultaneously. Their Bayesian analysis is particularly important to reveal their statistical relation. As the first step for a unified music content analyzer, we focus on the harmonic and temporal structures of the wavelet spectrogram obtained from harmonic sounds. In this paper, we present a new Bayesian multipitch analyzer, called Bayesian non-negative harmonic-temporal factorization (BNHTF). BN-HTF models the harmonic and temporal structures separately based on Gaussian mixture model. The input signal is assumed to contain a finite number of harmonic sounds. Each harmonic sound is assumed to emit a large number of sound quanta over the time-log-frequency domain. The observation probability is expressed as the product of two Gaussian mixtures. The number of quanta is calculated in the e-neighborhood of each grid point on the spectrogram. BNHTF integrates latent harmonic allocation (LHA) and nonnegative matrix factorization (NMF) to estimate both the observation probability and the number of quanta. The model is optimized by newly designed deterministic procedures with several approximations for the variational Bayesian inference. Results of experiments on multipitch estimation with 40 musical pieces showed that BNHTF outperforms the conventional method by 0.018 in terms of F-measure on average.
Original language | English |
---|---|
Title of host publication | Proceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012 |
Pages | 91-96 |
Number of pages | 6 |
Publication status | Published - 2012 |
Externally published | Yes |
Event | 13th International Society for Music Information Retrieval Conference, ISMIR 2012 - Porto Duration: 2012 Oct 8 → 2012 Oct 12 |
Other
Other | 13th International Society for Music Information Retrieval Conference, ISMIR 2012 |
---|---|
City | Porto |
Period | 12/10/8 → 12/10/12 |
ASJC Scopus subject areas
- Music
- Information Systems