Incremental bayesian audio-to-score alignment with flexible harmonic structure models

Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Music information retrieval, especially the audio-to-score alignment problem, often involves a matching problem between the audio and symbolic representations. We must cope with uncertainty in the audio signal generated from the score in a symbolic representation such as the variation in the timbre or temporal fluctuations. Existing audio-to-score alignment methods are sometimes vulnerable to the uncertainty in which multiple notes are simultaneously played with a variety of timbres because these methods rely on static observation models. For example, a chroma vector or a fixed harmonic structure template is used under the assumption that musical notes in a chord are all in the same volume and timbre. This paper presents a particle filterbased audio-to-score alignment method with a flexible observation model based on latent harmonic allocation. Our method adapts to the harmonic structure for the audio-toscore matching based on the observation of the audio signal through Bayesian inference. Experimental results with 20 polyphonic songs reveal that our method is effective when more number of instruments are involved in the ensemble.

Original languageEnglish
Title of host publicationProceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
Pages525-530
Number of pages6
Publication statusPublished - 2011 Dec 1
Event12th International Society for Music Information Retrieval Conference, ISMIR 2011 - Miami, FL, United States
Duration: 2011 Oct 242011 Oct 28

Publication series

NameProceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011

Conference

Conference12th International Society for Music Information Retrieval Conference, ISMIR 2011
CountryUnited States
CityMiami, FL
Period11/10/2411/10/28

    Fingerprint

ASJC Scopus subject areas

  • Music
  • Information Systems

Cite this

Otsuka, T., Nakadai, K., Ogata, T., & Okuno, H. G. (2011). Incremental bayesian audio-to-score alignment with flexible harmonic structure models. In Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011 (pp. 525-530). (Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011).