Sound sources selection system by using onomatopoeic querries from multiple sound sources

Yusuke Yamamura, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Our motivation is to develop a robot that treats auditory information in real environment because auditory information is useful for animated communications or understanding our surroundings. Interactions by using sound information need an aquisition of it and a proper sound source reference between a user and a robot leads to it. Such sound source reference is difficult due to multiple sound sources generating in real environemnt, and we use onomatopoeic representations as a representation for the reference. This paper shows a system that selects a sound source specified by a user from multiple sound sources. Users use onomatopoeias in the specification, and our system separates a mixed sound and converts separated sounds into onomatopoeias for the selection. Onomatopoeais have the ambiguity that each user gives each expression to a certain sound and we create an original similarity based on Minimum Edit Distance and acoustic features for solving its problem. In experiments, our system receives a mixed sound consisting of three sounds and a user's query as inputs, and checks a count of a consistency of a sound source selected by a system and a sound source specified by a user in 100 tests. The result shows

元の言語English
ホスト出版物のタイトル2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012
ページ2364-2369
ページ数6
DOI
出版物ステータスPublished - 2012 12 1
外部発表Yes
イベント25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012 - Vilamoura, Algarve, Portugal
継続期間: 2012 10 72012 10 12

出版物シリーズ

名前IEEE International Conference on Intelligent Robots and Systems
ISSN(印刷物)2153-0858
ISSN(電子版)2153-0866

Conference

Conference25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012
Portugal
Vilamoura, Algarve
期間12/10/712/10/12

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Software
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

フィンガープリント 「Sound sources selection system by using onomatopoeic querries from multiple sound sources」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル