Unified inter- and intra-recording duration model for multiple music audio alignment

Akira Maezawa, Katsutoshi Itoyama, Kazuyoshi Yoshii, Hiroshi G. Okuno

    研究成果: Conference contribution

    2 被引用数 (Scopus)

    抄録

    This paper presents a probabilistic audio-to-audio alignment method that focuses on the relationship among the note durations of different performances of a piece of music. A key issue in probabilistic audio alignment methods is in expressing how interrelated are the durations of notes in the underlying piece of music. Existing studies focus either on the duration of adjacent notes within a recording (intra-recording duration model), or the duration of a given note across different recordings (inter-recording duration model). This paper unifies these approaches through a simple modification to them. Furthermore, the paper extends the unified model, allowing the dynamics of the note duration to change sporadically. Experimental evaluation demonstrated that the proposed models decrease the alignment error.

    本文言語English
    ホスト出版物のタイトル2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2015
    出版社Institute of Electrical and Electronics Engineers Inc.
    ISBN(印刷版)9781479974504
    DOI
    出版ステータスPublished - 2015 11 24
    イベントIEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2015 - New Paltz, United States
    継続期間: 2015 10 182015 10 21

    Other

    OtherIEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2015
    国/地域United States
    CityNew Paltz
    Period15/10/1815/10/21

    ASJC Scopus subject areas

    • コンピュータ サイエンスの応用
    • 信号処理
    • メディア記述

    引用スタイル