Identification and tracking of particular speaker in noisy environment

Hideyuki Sawada, Minoru Ohkado

Research output: Contribution to journalConference article

6 Citations (Scopus)

Abstract

Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

Original languageEnglish
Article number18
Pages (from-to)138-145
Number of pages8
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume5603
DOIs
Publication statusPublished - 2004 Dec 1
EventMachine Vision and its Optomechatronic Applications - Philadelphia, PA, United States
Duration: 2004 Oct 262004 Oct 28

Keywords

  • Cepstrum
  • Microphone array
  • Noisy environment
  • Sound localization
  • Speaker identification

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Identification and tracking of particular speaker in noisy environment'. Together they form a unique fingerprint.

  • Cite this