3D Auditory Scene Visualizer with face tracking: Design and implementation for auditory awareness compensation

Yuji Kubota, Shun Shiramatsu, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper presents the design and implementation of 3D Auditory Scene Visualizer based on the visual information seeking mantra, "overview first, zoom and filter, then details on demand". The machine audition system called HARK captures 3D sounds with a microphone array. The natural language processing called SalienceGraph visualizes topic transition by using discourse salience. The 3D visualizer implemented in Java 3D displays topic transition and each sound stream as a beam originating from the microphones (overview mode), shows temporal snapshots with/without specifying focusing areas (zoom-andfilter mode), and shows detailed information about a particular sound stream (details-on-demand mode). This threemode visualization will give the user auditory awareness enhanced by HARK and SalienceGraph. In addition, a facetracking system automatically determines the user's intention by tracking the user's face. The resulting system will enable users to manage and browse auditory scene files effectively, so it should acceleration and support the information explosion to compensate the lack of auditory awareness.

Original languageEnglish
Title of host publicationProceedings of the 2nd International Symposium on Universal Communication, ISUC 2008
Pages42-49
Number of pages8
DOIs
Publication statusPublished - 2008 Dec 1
Externally publishedYes
Event2nd International Symposium on Universal Communication, ISUC 2008 - Osaka, Japan
Duration: 2008 Dec 152008 Dec 16

Publication series

NameProceedings of the 2nd International Symposium on Universal Communication, ISUC 2008

Conference

Conference2nd International Symposium on Universal Communication, ISUC 2008
CountryJapan
CityOsaka
Period08/12/1508/12/16

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems
  • Software

Fingerprint Dive into the research topics of '3D Auditory Scene Visualizer with face tracking: Design and implementation for auditory awareness compensation'. Together they form a unique fingerprint.

  • Cite this

    Kubota, Y., Shiramatsu, S., Yoshida, M., Komatani, K., Ogata, T., & Okuno, H. G. (2008). 3D Auditory Scene Visualizer with face tracking: Design and implementation for auditory awareness compensation. In Proceedings of the 2nd International Symposium on Universal Communication, ISUC 2008 (pp. 42-49). [4724440] (Proceedings of the 2nd International Symposium on Universal Communication, ISUC 2008). https://doi.org/10.1109/ISUC.2008.59