An active audition framework for auditory-driven HRI: Application to interactive robot dancing

Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luis Paulo Reis, Fabien Gouyon

研究成果: Conference contribution

10 被引用数 (Scopus)

抄録

In this paper we propose a general active audition framework for auditory-driven Human-Robot Interaction (HRI). The proposed framework simultaneously processes speech and music on-the-fly, integrates perceptual models for robot audition, and supports verbal and non-verbal interactive communication by means of (pro)active behaviors. To ensure a reliable interaction, on top of the framework a behavior decision mechanism based on active audition policies the robot's actions according to the reliability of the acoustic signals for auditory processing. To validate the framework's application to general auditory-driven HRI, we propose the implementation of an interactive robot dancing system. This system integrates three preprocessing robot audition modules: sound source localization, sound source separation, and ego noise suppression; two modules for auditory perception: live audio beat tracking and automatic speech recognition; and multi-modal behaviors for verbal and non-verbal interaction: music-driven dancing and speech-driven dialoguing. To fully assess the system, we set up experimental and interactive real-world scenarios with highly dynamic acoustic conditions, and defined a set of evaluation criteria. The experimental tests revealed accurate and robust beat tracking and speech recognition, and convincing dance beat-synchrony. The interactive sessions confirmed the fundamental role of the behavior decision mechanism for actively maintaining a robust and natural human-robot interaction.

本文言語English
ホスト出版物のタイトルProceedings - IEEE International Workshop on Robot and Human Interactive Communication
ページ1078-1085
ページ数8
DOI
出版ステータスPublished - 2012
外部発表はい
イベント2012 21st IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN 2012 - Paris
継続期間: 2012 9 92012 9 13

Other

Other2012 21st IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN 2012
CityParis
Period12/9/912/9/13

ASJC Scopus subject areas

  • ソフトウェア
  • 人工知能
  • 人間とコンピュータの相互作用

フィンガープリント

「An active audition framework for auditory-driven HRI: Application to interactive robot dancing」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル