Identification and tracking of particular speaker in noisy environment

Hideyuki Sawada, Minoru Ohkado

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

Original languageEnglish
Article number18
Pages (from-to)138-145
Number of pages8
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume5603
DOIs
Publication statusPublished - 2004
Externally publishedYes

Fingerprint

Acoustic waves
acoustics
Microphones
microphones
talking
Microphone Array
Computer applications
speech recognition
Adaptive Systems
Speech Recognition
Speech recognition
robots
industrial plants
Acoustic noise
Mobile Robot
Mobile robots
Industrial plants
Sound
recording
Real-time

Keywords

  • Cepstrum
  • Microphone array
  • Noisy environment
  • Sound localization
  • Speaker identification

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Condensed Matter Physics

Cite this

Identification and tracking of particular speaker in noisy environment. / Sawada, Hideyuki; Ohkado, Minoru.

In: Proceedings of SPIE - The International Society for Optical Engineering, Vol. 5603, 18, 2004, p. 138-145.

Research output: Contribution to journalArticle

@article{0fcad4134d84431fbea116c9075b0415,
title = "Identification and tracking of particular speaker in noisy environment",
abstract = "Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.",
keywords = "Cepstrum, Microphone array, Noisy environment, Sound localization, Speaker identification",
author = "Hideyuki Sawada and Minoru Ohkado",
year = "2004",
doi = "10.1117/12.580588",
language = "English",
volume = "5603",
pages = "138--145",
journal = "Proceedings of SPIE - The International Society for Optical Engineering",
issn = "0277-786X",
publisher = "SPIE",

}

TY - JOUR

T1 - Identification and tracking of particular speaker in noisy environment

AU - Sawada, Hideyuki

AU - Ohkado, Minoru

PY - 2004

Y1 - 2004

N2 - Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

AB - Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

KW - Cepstrum

KW - Microphone array

KW - Noisy environment

KW - Sound localization

KW - Speaker identification

UR - http://www.scopus.com/inward/record.url?scp=17644387178&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=17644387178&partnerID=8YFLogxK

U2 - 10.1117/12.580588

DO - 10.1117/12.580588

M3 - Article

AN - SCOPUS:17644387178

VL - 5603

SP - 138

EP - 145

JO - Proceedings of SPIE - The International Society for Optical Engineering

JF - Proceedings of SPIE - The International Society for Optical Engineering

SN - 0277-786X

M1 - 18

ER -