Sound sources selection system by using onomatopoeic querries from multiple sound sources

Yusuke Yamamura, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Our motivation is to develop a robot that treats auditory information in real environment because auditory information is useful for animated communications or understanding our surroundings. Interactions by using sound information need an aquisition of it and a proper sound source reference between a user and a robot leads to it. Such sound source reference is difficult due to multiple sound sources generating in real environemnt, and we use onomatopoeic representations as a representation for the reference. This paper shows a system that selects a sound source specified by a user from multiple sound sources. Users use onomatopoeias in the specification, and our system separates a mixed sound and converts separated sounds into onomatopoeias for the selection. Onomatopoeais have the ambiguity that each user gives each expression to a certain sound and we create an original similarity based on Minimum Edit Distance and acoustic features for solving its problem. In experiments, our system receives a mixed sound consisting of three sounds and a user's query as inputs, and checks a count of a consistency of a sound source selected by a system and a sound source specified by a user in 100 tests. The result shows

本文言語English
ホスト出版物のタイトル2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012
ページ2364-2369
ページ数6
DOI
出版ステータスPublished - 2012
外部発表はい
イベント25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012 - Vilamoura, Algarve, Portugal
継続期間: 2012 10 72012 10 12

出版物シリーズ

名前IEEE International Conference on Intelligent Robots and Systems
ISSN(印刷版)2153-0858
ISSN(電子版)2153-0866

Conference

Conference25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012
国/地域Portugal
CityVilamoura, Algarve
Period12/10/712/10/12

ASJC Scopus subject areas

  • 制御およびシステム工学
  • ソフトウェア
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ サイエンスの応用

フィンガープリント

「Sound sources selection system by using onomatopoeic querries from multiple sound sources」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル