Realtime speaker tracking for robotic auditory system

Hideyuki Sawada, Atsushi Todo, Toshiya Takechi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Voice and sounds are the primary media employed for the human communication. Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are surrounded by various sounds, and are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize what makes the sounds. The paper will introduce a realtime detection and identification of particular sound among plural sound sources using a microphone array based on the location of a speaker and the tonal characteristics. The algorithm is applied to an adaptive auditory system of a robotic arm for the interaction with humans.

Original languageEnglish
Title of host publicationIECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics
Pages5474-5479
Number of pages6
DOIs
Publication statusPublished - 2006
Externally publishedYes
EventIECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics - Paris
Duration: 2006 Nov 62006 Nov 10

Other

OtherIECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics
CityParis
Period06/11/606/11/10

Fingerprint

Robotics
Acoustic waves
Robotic arms
Microphones
Communication

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Sawada, H., Todo, A., & Takechi, T. (2006). Realtime speaker tracking for robotic auditory system. In IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics (pp. 5474-5479). [4153306] https://doi.org/10.1109/IECON.2006.347942

Realtime speaker tracking for robotic auditory system. / Sawada, Hideyuki; Todo, Atsushi; Takechi, Toshiya.

IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics. 2006. p. 5474-5479 4153306.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sawada, H, Todo, A & Takechi, T 2006, Realtime speaker tracking for robotic auditory system. in IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics., 4153306, pp. 5474-5479, IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics, Paris, 06/11/6. https://doi.org/10.1109/IECON.2006.347942
Sawada H, Todo A, Takechi T. Realtime speaker tracking for robotic auditory system. In IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics. 2006. p. 5474-5479. 4153306 https://doi.org/10.1109/IECON.2006.347942
Sawada, Hideyuki ; Todo, Atsushi ; Takechi, Toshiya. / Realtime speaker tracking for robotic auditory system. IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics. 2006. pp. 5474-5479
@inproceedings{64c848ba4c67418abb30e2aae77345c8,
title = "Realtime speaker tracking for robotic auditory system",
abstract = "Voice and sounds are the primary media employed for the human communication. Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are surrounded by various sounds, and are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize what makes the sounds. The paper will introduce a realtime detection and identification of particular sound among plural sound sources using a microphone array based on the location of a speaker and the tonal characteristics. The algorithm is applied to an adaptive auditory system of a robotic arm for the interaction with humans.",
author = "Hideyuki Sawada and Atsushi Todo and Toshiya Takechi",
year = "2006",
doi = "10.1109/IECON.2006.347942",
language = "English",
isbn = "1424401364",
pages = "5474--5479",
booktitle = "IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics",

}

TY - GEN

T1 - Realtime speaker tracking for robotic auditory system

AU - Sawada, Hideyuki

AU - Todo, Atsushi

AU - Takechi, Toshiya

PY - 2006

Y1 - 2006

N2 - Voice and sounds are the primary media employed for the human communication. Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are surrounded by various sounds, and are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize what makes the sounds. The paper will introduce a realtime detection and identification of particular sound among plural sound sources using a microphone array based on the location of a speaker and the tonal characteristics. The algorithm is applied to an adaptive auditory system of a robotic arm for the interaction with humans.

AB - Voice and sounds are the primary media employed for the human communication. Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are surrounded by various sounds, and are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize what makes the sounds. The paper will introduce a realtime detection and identification of particular sound among plural sound sources using a microphone array based on the location of a speaker and the tonal characteristics. The algorithm is applied to an adaptive auditory system of a robotic arm for the interaction with humans.

UR - http://www.scopus.com/inward/record.url?scp=50249151510&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=50249151510&partnerID=8YFLogxK

U2 - 10.1109/IECON.2006.347942

DO - 10.1109/IECON.2006.347942

M3 - Conference contribution

AN - SCOPUS:50249151510

SN - 1424401364

SN - 9781424401369

SP - 5474

EP - 5479

BT - IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics

ER -