Automatic singing voice to music video generation via mashup of singing video clips

Tatsunori Hirai, Yukara Ikemiya, Kazuyoshi Yoshii, Tomoyasu Nakano, Masataka Goto, Shigeo Morishima

研究成果: Conference contribution

抄録

This paper presents a system that takes audio signals of any song sung by a singer as the input and automatically generates a music video clip in which the singer appears to be actually singing the song. Although music video clips have gained the popularity in video streaming services, not all existing songs have corresponding video clips. Given a song sung by a singer, our system generates a singing video clip by reusing existing singing video clips featuring the singer. More specifically, the system retrieves short fragments of singing video clips that include singing voices similar to that in target song, and then concatenates these fragments using a technique of dynamic programming (DP). To achieve this, we propose a method to extract singing scenes from music video clips by combining vocal activity detection (VAD) with mouth aperture detection (MAD). The subjective experimental results demonstrate the effectiveness of our system.

本文言語English
ホスト出版物のタイトルProceedings of the 12th International Conference in Sound and Music Computing, SMC 2015
出版社Music Technology Research Group, Department of Computer Science, Maynooth University
ページ153-159
ページ数7
ISBN(電子版)9780992746629
出版ステータスPublished - 2015
イベント12th International Conference on Sound and Music Computing, SMC 2015 - Maynooth, Ireland
継続期間: 2015 7 302015 8 1

出版物シリーズ

名前Proceedings of the 12th International Conference in Sound and Music Computing, SMC 2015

Other

Other12th International Conference on Sound and Music Computing, SMC 2015
CountryIreland
CityMaynooth
Period15/7/3015/8/1

ASJC Scopus subject areas

  • Music
  • Computer Science Applications
  • Media Technology

フィンガープリント 「Automatic singing voice to music video generation via mashup of singing video clips」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル