An automatic singing impression estimation method using factor analysis and multiple regression

Ai Kanato, Tomoyasu Nakano, Masataka Goto, Hideaki Kikuchi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

This paper describes a method for estimating the impression of a singing voice via acoustic features. While much research has been conducted on singing impression, to date no method for determining appropriate words to represent the impressions created by a person's singing has been developed, primarily due to the lack of a comprehensive evaluation scale. We followed two steps: construction of such an impression scale, and development of models for estimating the impression score of each word. In the scale construction, two experiments were carried out. Firstly, 44 words were selected as relevant words based on subjective evaluation. Secondly, 12 words were selected as an impression scale, and three factors ("powerful", "cautious", and "cheerful") were extracted by factor analysis. To estimate impression scores, multiple regression models were constructed for each impression word with acoustic features. The models were tested by cross validation. The average R2 value for the 12 words of the complete scale was 0.567, and the R2 for the three factors were 0.863 (powerful), 0.381 (cautious), and 0.603 (cheerful).

Original languageEnglish
Title of host publicationProceedings - 40th International Computer Music Conference, ICMC 2014 and 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy
Subtitle of host publicationFrom Digital Echos to Virtual Ethos
EditorsGeorgios Kouroupetroglou, Anastasia Georgaki
PublisherNational and Kapodistrian University of Athens
Pages1244-1251
Number of pages8
ISBN (Electronic)9789604661374
Publication statusPublished - 2014 Jan 1
Event40th International Computer Music Conference, ICMC 2014, Joint with the 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos - Athens, Greece
Duration: 2014 Sep 142014 Sep 20

Publication series

NameProceedings - 40th International Computer Music Conference, ICMC 2014 and 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos

Conference

Conference40th International Computer Music Conference, ICMC 2014, Joint with the 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos
CountryGreece
CityAthens
Period14/9/1414/9/20

    Fingerprint

ASJC Scopus subject areas

  • Music
  • Media Technology
  • Computer Science Applications

Cite this

Kanato, A., Nakano, T., Goto, M., & Kikuchi, H. (2014). An automatic singing impression estimation method using factor analysis and multiple regression. In G. Kouroupetroglou, & A. Georgaki (Eds.), Proceedings - 40th International Computer Music Conference, ICMC 2014 and 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos (pp. 1244-1251). (Proceedings - 40th International Computer Music Conference, ICMC 2014 and 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos). National and Kapodistrian University of Athens.