Semi-automatic synthesis of videos of performers appearing to play user-specified music

Tomohiro Yamamoto, Makoto Okabe, Yusuke Hijikata, Rikio Onai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We propose a method to synthesize the video of a user-specified band music, in which the performers appear to play it nicely. Given the music and videos of the band members as inputs, our system synthesizes the resulting video by semi-automatically cutting and concatenating the videos temporarily so that these synchronize to the music. To compute the synchronization between music and video, we analyze the timings of the musical notes of them, which we estimate from the audio signals by applying techniques including short-time Fourier transform (STFT), image processing, and sound source separation. Our video retrieval technique then uses the estimated timings of musical notes as the feature vector. To efficiently retrieve a part of the video that matches to a part of the music, we develop a novel feature matching technique more suitable for our feature vector than dynamic-time warping (DTW) algorithm. The output of our system is the project file of Adobe After Effects, on which the user can further refine the result interactively. In our experiment, we recorded videos of performances of playing the violin, piano, guitar, bass and drums. Each video is recorded independently for each instrument. We demonstrate that our system helps the non-expert performers who cannot play the music well to synthesize its performance videos. We also present that, given an arbitrary music as input, our system can synthesize its performance video by semi-automatically cutting and pasting existing videos.

Original languageEnglish
Title of host publication21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG 2013 - Full Papers Proceedings
Pages179-186
Number of pages8
Publication statusPublished - 2013
Externally publishedYes
Event21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG 2013 - Plzen, Czech Republic
Duration: 2013 Jun 242013 Jun 27

Other

Other21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG 2013
CountryCzech Republic
CityPlzen
Period13/6/2413/6/27

    Fingerprint

Keywords

  • Video synthesis music analysis multimedia

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition

Cite this

Yamamoto, T., Okabe, M., Hijikata, Y., & Onai, R. (2013). Semi-automatic synthesis of videos of performers appearing to play user-specified music. In 21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG 2013 - Full Papers Proceedings (pp. 179-186)