Design and implementation of selectable sound separation on the texai telepresence system using HARK

Takeshi Mizumoto, Kazuhiro Nakadai, Takami Yoshida, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

26 Citations (Scopus)

Abstract

This paper presents the design and implementation of selectable sound separation functions on the telepresence system "Texai" using the robot audition software "HARK." An operator of Texai can "walk" around a faraway office to attend a meeting or talk with people through video-conference instead of meeting in person. With a normal microphone, the operator has difficulty recognizing the auditory scene of the Texai, e.g., he/she cannot know the number and the locations of sounds. To solve this problem, we design selectable sound separation functions with 8 microphones in two modes, overview and filter modes, and implement them using HARK's sound source localization and separation. The overview mode visualizes the direction-of-arrival of surrounding sounds, while the filter mode provides sounds that originate from the range of directions he/she specifies. The functions enable the operator to be aware of a sound even if it comes from behind the Texai, and to concentrate on a particular sound. The design and implementation was completed in five days due to the portability of HARK. Experimental evaluations with actual and simulated data show that the resulting system localizes sound sources with a tolerance of 5 degrees.

Original languageEnglish
Title of host publicationProceedings - IEEE International Conference on Robotics and Automation
Pages2130-2137
Number of pages8
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event2011 IEEE International Conference on Robotics and Automation, ICRA 2011 - Shanghai
Duration: 2011 May 92011 May 13

Other

Other2011 IEEE International Conference on Robotics and Automation, ICRA 2011
CityShanghai
Period11/5/911/5/13

    Fingerprint

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Mizumoto, T., Nakadai, K., Yoshida, T., Takeda, R., Otsuka, T., Takahashi, T., & Okuno, H. G. (2011). Design and implementation of selectable sound separation on the texai telepresence system using HARK. In Proceedings - IEEE International Conference on Robotics and Automation (pp. 2130-2137). [5979849] https://doi.org/10.1109/ICRA.2011.5979849