Singer identification based on accompaniment sound reduction and reliable frame selection

Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

45 Citations (Scopus)

Abstract

This paper describes a method for automatic singer identification from polyphonic musical audio signals including sounds of various instruments. Because singing voices play an important role in musical pieces with a vocal part, the identification of singer names is useful for music information retrieval systems. The main problem in automatically identifying singers is the negative influences caused by accompaniment sounds. To solve this problem, we developed two methods, accompaniment sound reduction and reliable frame selection. The former method makes it possible to identify the singer of a singing voice after reducing accompaniment sounds. It first extracts harmonic components of the predominant melody from sound mixtures and then resynthesizes the melody by using a sinusoidal model driven by those components. The latter method then judges whether each frame of the obtained melody is reliable (i.e. little influenced by accompaniment sound) or not by using two Gaussian mixture models for vocal and non-vocal frames. It enables the singer identification using only reliable vocal portions of musical pieces. Experimental results with forty popular-music songs by ten singers showed that our method was able to reduce the influences of accompaniment sounds and achieved an accuracy of 95%, while the accuracy for a conventional method was 53%.

Original languageEnglish
Title of host publicationISMIR 2005 - 6th International Conference on Music Information Retrieval
Pages329-336
Number of pages8
Publication statusPublished - 2005 Dec 1
Externally publishedYes
Event6th International Conference on Music Information Retrieval, ISMIR 2005 - London, United Kingdom
Duration: 2005 Sep 112005 Sep 15

Publication series

NameISMIR 2005 - 6th International Conference on Music Information Retrieval

Conference

Conference6th International Conference on Music Information Retrieval, ISMIR 2005
CountryUnited Kingdom
CityLondon
Period05/9/1105/9/15

Keywords

  • Artist identification
  • Melody extraction
  • Similarity-based MIR
  • Singer identification
  • Singing detection

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint Dive into the research topics of 'Singer identification based on accompaniment sound reduction and reliable frame selection'. Together they form a unique fingerprint.

  • Cite this

    Fujihara, H., Kitahara, T., Goto, M., Komatani, K., Ogata, T., & Okuno, H. G. (2005). Singer identification based on accompaniment sound reduction and reliable frame selection. In ISMIR 2005 - 6th International Conference on Music Information Retrieval (pp. 329-336). (ISMIR 2005 - 6th International Conference on Music Information Retrieval).