Integration of sound and image information for active tracking of particular person

Hideyuki Sawada, Atsushi Todo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A human has various sensory perceptions, and effectively uses them in communication. Auditory and visual functions especially play an important role for recognizing someone to talk to and understanding the conversation. In vocal communication, we are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. In addition, we are able to detect a particular person by recognizing body features and individual gestures. By realizing this mechanism using a computer, new applications will be presented, which are utilized in the flexible and intuitive communication with humans. The authors are working for the identification of a particular person using microphones and a USB camera. The paper describes the development of an information fusion system and how to deal with multiple data obtained by different sensors.

Original languageEnglish
Title of host publicationECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology
Pages861-865
Number of pages5
Publication statusPublished - 2010
Externally publishedYes
Event7th Annual International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, ECTI-CON 2010 - Chiang Mai, Thailand
Duration: 2010 May 192010 May 21

Other

Other7th Annual International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, ECTI-CON 2010
CountryThailand
CityChiang Mai
Period10/5/1910/5/21

Fingerprint

Acoustic waves
Communication
Information fusion
Computer applications
Microphones
Cameras
Sensors

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Electrical and Electronic Engineering

Cite this

Sawada, H., & Todo, A. (2010). Integration of sound and image information for active tracking of particular person. In ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (pp. 861-865). [5491587]

Integration of sound and image information for active tracking of particular person. / Sawada, Hideyuki; Todo, Atsushi.

ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology. 2010. p. 861-865 5491587.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sawada, H & Todo, A 2010, Integration of sound and image information for active tracking of particular person. in ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology., 5491587, pp. 861-865, 7th Annual International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, ECTI-CON 2010, Chiang Mai, Thailand, 10/5/19.
Sawada H, Todo A. Integration of sound and image information for active tracking of particular person. In ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology. 2010. p. 861-865. 5491587
Sawada, Hideyuki ; Todo, Atsushi. / Integration of sound and image information for active tracking of particular person. ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology. 2010. pp. 861-865
@inproceedings{cb0f8a88d8b243bf8d3787c8cfee3020,
title = "Integration of sound and image information for active tracking of particular person",
abstract = "A human has various sensory perceptions, and effectively uses them in communication. Auditory and visual functions especially play an important role for recognizing someone to talk to and understanding the conversation. In vocal communication, we are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. In addition, we are able to detect a particular person by recognizing body features and individual gestures. By realizing this mechanism using a computer, new applications will be presented, which are utilized in the flexible and intuitive communication with humans. The authors are working for the identification of a particular person using microphones and a USB camera. The paper describes the development of an information fusion system and how to deal with multiple data obtained by different sensors.",
author = "Hideyuki Sawada and Atsushi Todo",
year = "2010",
language = "English",
isbn = "9789746724913",
pages = "861--865",
booktitle = "ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology",

}

TY - GEN

T1 - Integration of sound and image information for active tracking of particular person

AU - Sawada, Hideyuki

AU - Todo, Atsushi

PY - 2010

Y1 - 2010

N2 - A human has various sensory perceptions, and effectively uses them in communication. Auditory and visual functions especially play an important role for recognizing someone to talk to and understanding the conversation. In vocal communication, we are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. In addition, we are able to detect a particular person by recognizing body features and individual gestures. By realizing this mechanism using a computer, new applications will be presented, which are utilized in the flexible and intuitive communication with humans. The authors are working for the identification of a particular person using microphones and a USB camera. The paper describes the development of an information fusion system and how to deal with multiple data obtained by different sensors.

AB - A human has various sensory perceptions, and effectively uses them in communication. Auditory and visual functions especially play an important role for recognizing someone to talk to and understanding the conversation. In vocal communication, we are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. In addition, we are able to detect a particular person by recognizing body features and individual gestures. By realizing this mechanism using a computer, new applications will be presented, which are utilized in the flexible and intuitive communication with humans. The authors are working for the identification of a particular person using microphones and a USB camera. The paper describes the development of an information fusion system and how to deal with multiple data obtained by different sensors.

UR - http://www.scopus.com/inward/record.url?scp=77954949492&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954949492&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:77954949492

SN - 9789746724913

SP - 861

EP - 865

BT - ECTI-CON 2010 - The 2010 ECTI International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology

ER -