Comparing features for forming music streams in automatic music transcription

Yohei Sakuraba*, Tetsuro Kitahara, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

11 被引用数 (Scopus)

抄録

In formating temporal sequences of notes played by the same instrument (referred to as music streams'), timbre of musical instruments may be a predominant feature. In polyphonic music, the performance of timber extraction based on power-related features deteriorates, because such features are blurred when two or more frequency components are superimposed in the same frequency. To cope with this problem, we integrated timbre similarity and direction proximity with success, but left using other features as future work. In this paper, we investigate four features, timbre similarity, direction proximity, pitch transition and pitch relation consistency to clarify the precedence among them in music stream formation. Experimental results with quartet music show that direction proximity is the most dominant feature, and pitch transition is the secondary. In addition, the performance of music stream formation was improved from 63.3% by only timbre similarity to 84.9% by integrating four features.

本文言語English
ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
4
出版ステータスPublished - 2004
外部発表はい
イベントProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
継続期間: 2004 5 172004 5 21

Other

OtherProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing
国/地域Canada
CityMontreal, Que
Period04/5/1704/5/21

ASJC Scopus subject areas

  • 電子工学および電気工学
  • 信号処理
  • 音響学および超音波学

フィンガープリント

「Comparing features for forming music streams in automatic music transcription」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル