Sensing of particular speakers for the construction of voice interface utilized in noisy environment

Hideyuki Sawada, Minoru Ohkado

Research output: Contribution to journalArticle

Abstract

Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

Original languageEnglish
JournalIEEJ Transactions on Industry Applications
Volume126
Issue number11
Publication statusPublished - 2006
Externally publishedYes

Fingerprint

Acoustic waves
Microphones
Computer applications
Speech recognition
Acoustic noise
Mobile robots
Industrial plants

Keywords

  • Cepstrum
  • Microphone array
  • Noisy environment
  • Sound localization
  • Speaker identification

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Industrial and Manufacturing Engineering

Cite this

@article{1cc753ac34754fd5b543ffd8b5a113ce,
title = "Sensing of particular speakers for the construction of voice interface utilized in noisy environment",
abstract = "Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.",
keywords = "Cepstrum, Microphone array, Noisy environment, Sound localization, Speaker identification",
author = "Hideyuki Sawada and Minoru Ohkado",
year = "2006",
language = "English",
volume = "126",
journal = "IEEJ Transactions on Industry Applications",
issn = "0913-6339",
publisher = "The Institute of Electrical Engineers of Japan",
number = "11",

}

TY - JOUR

T1 - Sensing of particular speakers for the construction of voice interface utilized in noisy environment

AU - Sawada, Hideyuki

AU - Ohkado, Minoru

PY - 2006

Y1 - 2006

N2 - Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

AB - Human is able to exchange information smoothly using voice under different situations such as noisy environment in a crowd and with the existence of plural speakers. We are able to detect the position of a source sound in 3D space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing a microphone-free speech recognition by extracting particular sound. The paper will introduce a realtime detection and identification of particular speaker in noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system of a mobile robot which collaborates with a factory worker.

KW - Cepstrum

KW - Microphone array

KW - Noisy environment

KW - Sound localization

KW - Speaker identification

UR - http://www.scopus.com/inward/record.url?scp=33750926862&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750926862&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:33750926862

VL - 126

JO - IEEJ Transactions on Industry Applications

JF - IEEJ Transactions on Industry Applications

SN - 0913-6339

IS - 11

ER -