TY - JOUR
T1 - Bayesian audio-to-score alignment based on joint inference of Timbre, Volume, Tempo, and note onset timings
AU - Maezawa, Akira
AU - Okuno, Hiroshi G.
PY - 2015/3/27
Y1 - 2015/3/27
N2 - This article presents an offline method for aligning an audio signal to individual instrumental parts constituting a musical score. The proposed method is based on fitting multiple hidden semi-Markov models (HSMMs) to the observed audio signal. The emission probability of each state of the HSMM is described using latent harmonic allocation (LHA), a Bayesian model of a harmonic sound mixture. Each HSMM corresponds to one musical instrument's part, and the state duration probability is conditioned on a linear dynamics system (LDS) tempo model. Variational Bayesian inference is used to jointly infer LHA, HSMM, and the LDS. We evaluate the capability of the method to align musical audio to its score, under reverberation, structural variations, and fluctuations in onset timing among different parts.
AB - This article presents an offline method for aligning an audio signal to individual instrumental parts constituting a musical score. The proposed method is based on fitting multiple hidden semi-Markov models (HSMMs) to the observed audio signal. The emission probability of each state of the HSMM is described using latent harmonic allocation (LHA), a Bayesian model of a harmonic sound mixture. Each HSMM corresponds to one musical instrument's part, and the state duration probability is conditioned on a linear dynamics system (LDS) tempo model. Variational Bayesian inference is used to jointly infer LHA, HSMM, and the LDS. We evaluate the capability of the method to align musical audio to its score, under reverberation, structural variations, and fluctuations in onset timing among different parts.
UR - http://www.scopus.com/inward/record.url?scp=84925601978&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84925601978&partnerID=8YFLogxK
U2 - 10.1162/COMJ-a-00286
DO - 10.1162/COMJ-a-00286
M3 - Article
AN - SCOPUS:84925601978
VL - 39
SP - 74
EP - 87
JO - Computer Music Journal
JF - Computer Music Journal
SN - 0148-9267
IS - 1
ER -