Sound annotation tool for multidirectional sounds based on spatial information extracted by HARK robot audition software

Osamu Sugiyama, Katsutoshi Itoyama, Kazuhiro Nakada, Hiroshi G. Okuno

    研究成果: Conference contribution

    1 引用 (Scopus)

    抜粋

    With the rise of inexpensive microphone array products and the robot audition software called HARK, we can record and analyze multidirectional sound sources easily. The combination of microphone array and the software enables us to separate, localize, and track multidirectional sound sources. Most of the solutions for accessing these separated sound source information provide clients for interpreting simplified information about the separated sources, but not to directly execute the semantic annotations. Since the multidirectional sound annotation requires simultaneous labeling of separated sound sources and a multidirectional overview of the sources, it is essential to have an efficient way of annotation and an intuitive view of multidirectional sounds. Our proposed sound annotation tool provides drag & drop operation of annotation with a 3D sound source view and also provides annotation autocompletion with a SVM trained with the user's annotation history. The proposed features enable users to do the annotation task intuitively and confirm its result. We also conducted an evaluation demonstrating the efficiency of annotation done using the tool.

    元の言語English
    ホスト出版物のタイトルConference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
    出版者Institute of Electrical and Electronics Engineers Inc.
    ページ2335-2340
    ページ数6
    2014-January
    エディションJanuary
    DOI
    出版物ステータスPublished - 2014
    イベント2014 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2014 - San Diego, United States
    継続期間: 2014 10 52014 10 8

    Other

    Other2014 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2014
    United States
    San Diego
    期間14/10/514/10/8

      フィンガープリント

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering
    • Control and Systems Engineering
    • Human-Computer Interaction

    これを引用

    Sugiyama, O., Itoyama, K., Nakada, K., & Okuno, H. G. (2014). Sound annotation tool for multidirectional sounds based on spatial information extracted by HARK robot audition software. : Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics (January 版, 巻 2014-January, pp. 2335-2340). [6974275] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/smc.2014.6974275