TY - GEN
T1 - Incremental bayesian audio-to-score alignment with flexible harmonic structure models
AU - Otsuka, Takuma
AU - Nakadai, Kazuhiro
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2011/12/1
Y1 - 2011/12/1
N2 - Music information retrieval, especially the audio-to-score alignment problem, often involves a matching problem between the audio and symbolic representations. We must cope with uncertainty in the audio signal generated from the score in a symbolic representation such as the variation in the timbre or temporal fluctuations. Existing audio-to-score alignment methods are sometimes vulnerable to the uncertainty in which multiple notes are simultaneously played with a variety of timbres because these methods rely on static observation models. For example, a chroma vector or a fixed harmonic structure template is used under the assumption that musical notes in a chord are all in the same volume and timbre. This paper presents a particle filterbased audio-to-score alignment method with a flexible observation model based on latent harmonic allocation. Our method adapts to the harmonic structure for the audio-toscore matching based on the observation of the audio signal through Bayesian inference. Experimental results with 20 polyphonic songs reveal that our method is effective when more number of instruments are involved in the ensemble.
AB - Music information retrieval, especially the audio-to-score alignment problem, often involves a matching problem between the audio and symbolic representations. We must cope with uncertainty in the audio signal generated from the score in a symbolic representation such as the variation in the timbre or temporal fluctuations. Existing audio-to-score alignment methods are sometimes vulnerable to the uncertainty in which multiple notes are simultaneously played with a variety of timbres because these methods rely on static observation models. For example, a chroma vector or a fixed harmonic structure template is used under the assumption that musical notes in a chord are all in the same volume and timbre. This paper presents a particle filterbased audio-to-score alignment method with a flexible observation model based on latent harmonic allocation. Our method adapts to the harmonic structure for the audio-toscore matching based on the observation of the audio signal through Bayesian inference. Experimental results with 20 polyphonic songs reveal that our method is effective when more number of instruments are involved in the ensemble.
UR - http://www.scopus.com/inward/record.url?scp=84873592130&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84873592130&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84873592130
SN - 9780615548654
T3 - Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
SP - 525
EP - 530
BT - Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
T2 - 12th International Society for Music Information Retrieval Conference, ISMIR 2011
Y2 - 24 October 2011 through 28 October 2011
ER -