Sensing of particular speakers for the construction of voice interface utilized in noisy environment

Hideyuki Sawada, Minoru Ohkado

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

A human is able to exchange information smoothly using voice under different situations such as a noisy environment in a crowd and with the existence of multiple speakers. We are able to detect the position of a source sound in three-dimensional space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing microphone-free speech recognition by extracting particular sounds. The paper will introduce real-time detection and identification of a particular speaker in a noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system for a mobile robot which collaborates with a factory worker.

Original languageEnglish
Pages (from-to)78-86
Number of pages9
JournalElectrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi)
Volume162
Issue number3
DOIs
Publication statusPublished - 2008 Feb
Externally publishedYes

Fingerprint

Acoustic waves
Microphones
Computer applications
Speech recognition
Acoustic noise
Mobile robots
Industrial plants

Keywords

  • Cepstrum
  • Microphone array
  • Noisy environment
  • Sound localization
  • Speaker identification

ASJC Scopus subject areas

  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering

Cite this

@article{66c138e74b5d4c19b7387211dbf99af3,
title = "Sensing of particular speakers for the construction of voice interface utilized in noisy environment",
abstract = "A human is able to exchange information smoothly using voice under different situations such as a noisy environment in a crowd and with the existence of multiple speakers. We are able to detect the position of a source sound in three-dimensional space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing microphone-free speech recognition by extracting particular sounds. The paper will introduce real-time detection and identification of a particular speaker in a noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system for a mobile robot which collaborates with a factory worker.",
keywords = "Cepstrum, Microphone array, Noisy environment, Sound localization, Speaker identification",
author = "Hideyuki Sawada and Minoru Ohkado",
year = "2008",
month = "2",
doi = "10.1002/eej.20629",
language = "English",
volume = "162",
pages = "78--86",
journal = "Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi)",
issn = "0424-7760",
publisher = "John Wiley and Sons Inc.",
number = "3",

}

TY - JOUR

T1 - Sensing of particular speakers for the construction of voice interface utilized in noisy environment

AU - Sawada, Hideyuki

AU - Ohkado, Minoru

PY - 2008/2

Y1 - 2008/2

N2 - A human is able to exchange information smoothly using voice under different situations such as a noisy environment in a crowd and with the existence of multiple speakers. We are able to detect the position of a source sound in three-dimensional space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing microphone-free speech recognition by extracting particular sounds. The paper will introduce real-time detection and identification of a particular speaker in a noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system for a mobile robot which collaborates with a factory worker.

AB - A human is able to exchange information smoothly using voice under different situations such as a noisy environment in a crowd and with the existence of multiple speakers. We are able to detect the position of a source sound in three-dimensional space, extract a particular sound from mixed sounds, and recognize who is talking. By realizing this mechanism with a computer, new applications will be presented for recording a sound with high quality by reducing noise, presenting a clarified sound, and realizing microphone-free speech recognition by extracting particular sounds. The paper will introduce real-time detection and identification of a particular speaker in a noisy environment using a microphone array based on the location of a speaker and the individual voice characteristics. The study will be applied to develop an adaptive auditory system for a mobile robot which collaborates with a factory worker.

KW - Cepstrum

KW - Microphone array

KW - Noisy environment

KW - Sound localization

KW - Speaker identification

UR - http://www.scopus.com/inward/record.url?scp=37249009294&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=37249009294&partnerID=8YFLogxK

U2 - 10.1002/eej.20629

DO - 10.1002/eej.20629

M3 - Article

AN - SCOPUS:37249009294

VL - 162

SP - 78

EP - 86

JO - Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi)

JF - Electrical Engineering in Japan (English translation of Denki Gakkai Ronbunshi)

SN - 0424-7760

IS - 3

ER -