Singer identification based on accompaniment sound reduction and reliable frame selection

Hiromasa Fujihara*, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

48 被引用数 (Scopus)

抄録

This paper describes a method for automatic singer identification from polyphonic musical audio signals including sounds of various instruments. Because singing voices play an important role in musical pieces with a vocal part, the identification of singer names is useful for music information retrieval systems. The main problem in automatically identifying singers is the negative influences caused by accompaniment sounds. To solve this problem, we developed two methods, accompaniment sound reduction and reliable frame selection. The former method makes it possible to identify the singer of a singing voice after reducing accompaniment sounds. It first extracts harmonic components of the predominant melody from sound mixtures and then resynthesizes the melody by using a sinusoidal model driven by those components. The latter method then judges whether each frame of the obtained melody is reliable (i.e. little influenced by accompaniment sound) or not by using two Gaussian mixture models for vocal and non-vocal frames. It enables the singer identification using only reliable vocal portions of musical pieces. Experimental results with forty popular-music songs by ten singers showed that our method was able to reduce the influences of accompaniment sounds and achieved an accuracy of 95%, while the accuracy for a conventional method was 53%.

本文言語English
ホスト出版物のタイトルISMIR 2005 - 6th International Conference on Music Information Retrieval
ページ329-336
ページ数8
出版ステータスPublished - 2005 12 1
外部発表はい
イベント6th International Conference on Music Information Retrieval, ISMIR 2005 - London, United Kingdom
継続期間: 2005 9 112005 9 15

出版物シリーズ

名前ISMIR 2005 - 6th International Conference on Music Information Retrieval

Conference

Conference6th International Conference on Music Information Retrieval, ISMIR 2005
国/地域United Kingdom
CityLondon
Period05/9/1105/9/15

ASJC Scopus subject areas

  • 音楽
  • 情報システム

フィンガープリント

「Singer identification based on accompaniment sound reduction and reliable frame selection」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル