Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates

Shuhei Hamawaki, Shintaro Funasawa, Jiro Katto, Hiromi Ishizaki, Keiichiro Hoashi, Yasuhiro Takishima

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    5 Citations (Scopus)

    Abstract

    In order to achieve highly accurate content-based music information retrieval (MIR), it is necessary to compensate the various bit rates of encoded songs which are stored in the music collection, since the bit rate differences are expected to apply a negative effect to content-based MIR results. In this paper, we examine how the bit rate differences affect MIR results, propose methods to normalize MFCC features extracted from encoded files with various bit rates, and show their effects to stabilize MIR results.

    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Pages298-309
    Number of pages12
    Volume5371 LNCS
    DOIs
    Publication statusPublished - 2009
    Event15th International Multimedia Modeling Conference, MMM 2009 - Sophia-Antipolis
    Duration: 2009 Jan 72009 Jan 9

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume5371 LNCS
    ISSN (Print)03029743
    ISSN (Electronic)16113349

    Other

    Other15th International Multimedia Modeling Conference, MMM 2009
    CitySophia-Antipolis
    Period09/1/709/1/9

    Fingerprint

    Music Information Retrieval
    Information retrieval
    Music
    Normalization
    Retrieval
    Normalize
    Necessary

    Keywords

    • Content-based MIR Normalization
    • Mel-Frequency Cepstral Coefficient (MFCC)

    ASJC Scopus subject areas

    • Computer Science(all)
    • Theoretical Computer Science

    Cite this

    Hamawaki, S., Funasawa, S., Katto, J., Ishizaki, H., Hoashi, K., & Takishima, Y. (2009). Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5371 LNCS, pp. 298-309). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5371 LNCS). https://doi.org/10.1007/978-3-540-92892-8_32

    Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates. / Hamawaki, Shuhei; Funasawa, Shintaro; Katto, Jiro; Ishizaki, Hiromi; Hoashi, Keiichiro; Takishima, Yasuhiro.

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5371 LNCS 2009. p. 298-309 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5371 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Hamawaki, S, Funasawa, S, Katto, J, Ishizaki, H, Hoashi, K & Takishima, Y 2009, Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5371 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5371 LNCS, pp. 298-309, 15th International Multimedia Modeling Conference, MMM 2009, Sophia-Antipolis, 09/1/7. https://doi.org/10.1007/978-3-540-92892-8_32
    Hamawaki S, Funasawa S, Katto J, Ishizaki H, Hoashi K, Takishima Y. Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5371 LNCS. 2009. p. 298-309. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-92892-8_32
    Hamawaki, Shuhei ; Funasawa, Shintaro ; Katto, Jiro ; Ishizaki, Hiromi ; Hoashi, Keiichiro ; Takishima, Yasuhiro. / Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5371 LNCS 2009. pp. 298-309 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
    @inproceedings{091b767d9d004c0fbbfb098d2ad40a9f,
    title = "Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates",
    abstract = "In order to achieve highly accurate content-based music information retrieval (MIR), it is necessary to compensate the various bit rates of encoded songs which are stored in the music collection, since the bit rate differences are expected to apply a negative effect to content-based MIR results. In this paper, we examine how the bit rate differences affect MIR results, propose methods to normalize MFCC features extracted from encoded files with various bit rates, and show their effects to stabilize MIR results.",
    keywords = "Content-based MIR Normalization, Mel-Frequency Cepstral Coefficient (MFCC)",
    author = "Shuhei Hamawaki and Shintaro Funasawa and Jiro Katto and Hiromi Ishizaki and Keiichiro Hoashi and Yasuhiro Takishima",
    year = "2009",
    doi = "10.1007/978-3-540-92892-8_32",
    language = "English",
    isbn = "354092891X",
    volume = "5371 LNCS",
    series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
    pages = "298--309",
    booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

    }

    TY - GEN

    T1 - Feature analysis and normalization approach for robust content-based music retrieval to encoded audio with different bit rates

    AU - Hamawaki, Shuhei

    AU - Funasawa, Shintaro

    AU - Katto, Jiro

    AU - Ishizaki, Hiromi

    AU - Hoashi, Keiichiro

    AU - Takishima, Yasuhiro

    PY - 2009

    Y1 - 2009

    N2 - In order to achieve highly accurate content-based music information retrieval (MIR), it is necessary to compensate the various bit rates of encoded songs which are stored in the music collection, since the bit rate differences are expected to apply a negative effect to content-based MIR results. In this paper, we examine how the bit rate differences affect MIR results, propose methods to normalize MFCC features extracted from encoded files with various bit rates, and show their effects to stabilize MIR results.

    AB - In order to achieve highly accurate content-based music information retrieval (MIR), it is necessary to compensate the various bit rates of encoded songs which are stored in the music collection, since the bit rate differences are expected to apply a negative effect to content-based MIR results. In this paper, we examine how the bit rate differences affect MIR results, propose methods to normalize MFCC features extracted from encoded files with various bit rates, and show their effects to stabilize MIR results.

    KW - Content-based MIR Normalization

    KW - Mel-Frequency Cepstral Coefficient (MFCC)

    UR - http://www.scopus.com/inward/record.url?scp=59149097902&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=59149097902&partnerID=8YFLogxK

    U2 - 10.1007/978-3-540-92892-8_32

    DO - 10.1007/978-3-540-92892-8_32

    M3 - Conference contribution

    AN - SCOPUS:59149097902

    SN - 354092891X

    SN - 9783540928911

    VL - 5371 LNCS

    T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    SP - 298

    EP - 309

    BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    ER -