Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus

Daichi Sakaue, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

We present a new method for modeling the overtone structures of musical instruments that uses an overtone corpus generated using a MIDI synthesizer. Since multipitch estimation requires a joint estimation of F0's and their overtone structures, one of the most important problems is the overtone structure modeling. Latent harmonic allocation (LHA), a promising multipitch estimation method, is difficult to use for various applications because it requires appropriate prior distributions of the overtone structures, which cannot be determined from statistical evidence. Our method uses an overtone corpus to avoid the problem of setting prior distributions and instead restricts the lower and upper bounds of each overtone weight. The bounds are determined from reference signals generated by a MIDI synthesizer. Experimental results demonstrated that the overtone structures were stably and accurately estimated for a wide variety of initial settings.

Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages425-428
Number of pages4
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Kyoto
Duration: 2012 Mar 252012 Mar 30

Other

Other2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
CityKyoto
Period12/3/2512/3/30

Fingerprint

Musical instruments

Keywords

  • harmonic clustering
  • Multipitch estimation
  • musical instrument sounds
  • overtone estimation

ASJC Scopus subject areas

  • Signal Processing
  • Software
  • Electrical and Electronic Engineering

Cite this

Sakaue, D., Itoyama, K., Ogata, T., & Okuno, H. G. (2012). Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 425-428). [6287907] https://doi.org/10.1109/ICASSP.2012.6287907

Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. / Sakaue, Daichi; Itoyama, Katsutoshi; Ogata, Tetsuya; Okuno, Hiroshi G.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2012. p. 425-428 6287907.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sakaue, D, Itoyama, K, Ogata, T & Okuno, HG 2012, Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings., 6287907, pp. 425-428, 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, 12/3/25. https://doi.org/10.1109/ICASSP.2012.6287907
Sakaue D, Itoyama K, Ogata T, Okuno HG. Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2012. p. 425-428. 6287907 https://doi.org/10.1109/ICASSP.2012.6287907
Sakaue, Daichi ; Itoyama, Katsutoshi ; Ogata, Tetsuya ; Okuno, Hiroshi G. / Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2012. pp. 425-428
@inproceedings{c4ace4956e344c969e17e41f127f8a4c,
title = "Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus",
abstract = "We present a new method for modeling the overtone structures of musical instruments that uses an overtone corpus generated using a MIDI synthesizer. Since multipitch estimation requires a joint estimation of F0's and their overtone structures, one of the most important problems is the overtone structure modeling. Latent harmonic allocation (LHA), a promising multipitch estimation method, is difficult to use for various applications because it requires appropriate prior distributions of the overtone structures, which cannot be determined from statistical evidence. Our method uses an overtone corpus to avoid the problem of setting prior distributions and instead restricts the lower and upper bounds of each overtone weight. The bounds are determined from reference signals generated by a MIDI synthesizer. Experimental results demonstrated that the overtone structures were stably and accurately estimated for a wide variety of initial settings.",
keywords = "harmonic clustering, Multipitch estimation, musical instrument sounds, overtone estimation",
author = "Daichi Sakaue and Katsutoshi Itoyama and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2012",
doi = "10.1109/ICASSP.2012.6287907",
language = "English",
isbn = "9781467300469",
pages = "425--428",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

}

TY - GEN

T1 - Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus

AU - Sakaue, Daichi

AU - Itoyama, Katsutoshi

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2012

Y1 - 2012

N2 - We present a new method for modeling the overtone structures of musical instruments that uses an overtone corpus generated using a MIDI synthesizer. Since multipitch estimation requires a joint estimation of F0's and their overtone structures, one of the most important problems is the overtone structure modeling. Latent harmonic allocation (LHA), a promising multipitch estimation method, is difficult to use for various applications because it requires appropriate prior distributions of the overtone structures, which cannot be determined from statistical evidence. Our method uses an overtone corpus to avoid the problem of setting prior distributions and instead restricts the lower and upper bounds of each overtone weight. The bounds are determined from reference signals generated by a MIDI synthesizer. Experimental results demonstrated that the overtone structures were stably and accurately estimated for a wide variety of initial settings.

AB - We present a new method for modeling the overtone structures of musical instruments that uses an overtone corpus generated using a MIDI synthesizer. Since multipitch estimation requires a joint estimation of F0's and their overtone structures, one of the most important problems is the overtone structure modeling. Latent harmonic allocation (LHA), a promising multipitch estimation method, is difficult to use for various applications because it requires appropriate prior distributions of the overtone structures, which cannot be determined from statistical evidence. Our method uses an overtone corpus to avoid the problem of setting prior distributions and instead restricts the lower and upper bounds of each overtone weight. The bounds are determined from reference signals generated by a MIDI synthesizer. Experimental results demonstrated that the overtone structures were stably and accurately estimated for a wide variety of initial settings.

KW - harmonic clustering

KW - Multipitch estimation

KW - musical instrument sounds

KW - overtone estimation

UR - http://www.scopus.com/inward/record.url?scp=84867585341&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84867585341&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2012.6287907

DO - 10.1109/ICASSP.2012.6287907

M3 - Conference contribution

AN - SCOPUS:84867585341

SN - 9781467300469

SP - 425

EP - 428

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

ER -