Realtime detection and identification of plural speakers using a microphone array

M. Ohkado, Hideyuki Sawada

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Voices are used as primary media in human communication. A human voice is a sound generated by the complex movements of the vocal organs, and is the most important media employed for communication in daily life to logical discussions. A human is able to detect the position of a source sound in 3D space by perceiving the time difference reaching to both ears. Furthermore we can selectively listen to an objective voice in the crowd. This paper presents real time detection and identification of a particular person among plural speakers using a microphone array. The system identifies the position of a particular speaker, and enhances the voice signal selectively.

Original languageEnglish
Title of host publicationVECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages151-156
Number of pages6
ISBN (Electronic)0780377850, 9780780377851
DOIs
Publication statusPublished - 2003
Externally publishedYes
Event2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems, VECIMS 2003 - Lugano, Switzerland
Duration: 2003 Jul 272003 Jul 29

Other

Other2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems, VECIMS 2003
CountrySwitzerland
CityLugano
Period03/7/2703/7/29

Fingerprint

Microphones
Acoustic waves
Communication

Keywords

  • Acoustic noise
  • Acoustical engineering
  • Data mining
  • Discrete Fourier transforms
  • Human voice
  • Intelligent systems
  • Loudspeakers
  • Mechanical systems
  • Microphone arrays
  • Signal processing

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Ohkado, M., & Sawada, H. (2003). Realtime detection and identification of plural speakers using a microphone array. In VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems (pp. 151-156). [1227046] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/VECIMS.2003.1227046

Realtime detection and identification of plural speakers using a microphone array. / Ohkado, M.; Sawada, Hideyuki.

VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems. Institute of Electrical and Electronics Engineers Inc., 2003. p. 151-156 1227046.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ohkado, M & Sawada, H 2003, Realtime detection and identification of plural speakers using a microphone array. in VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems., 1227046, Institute of Electrical and Electronics Engineers Inc., pp. 151-156, 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems, VECIMS 2003, Lugano, Switzerland, 03/7/27. https://doi.org/10.1109/VECIMS.2003.1227046
Ohkado M, Sawada H. Realtime detection and identification of plural speakers using a microphone array. In VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems. Institute of Electrical and Electronics Engineers Inc. 2003. p. 151-156. 1227046 https://doi.org/10.1109/VECIMS.2003.1227046
Ohkado, M. ; Sawada, Hideyuki. / Realtime detection and identification of plural speakers using a microphone array. VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems. Institute of Electrical and Electronics Engineers Inc., 2003. pp. 151-156
@inproceedings{56456464009849fbabab3b925e37f57c,
title = "Realtime detection and identification of plural speakers using a microphone array",
abstract = "Voices are used as primary media in human communication. A human voice is a sound generated by the complex movements of the vocal organs, and is the most important media employed for communication in daily life to logical discussions. A human is able to detect the position of a source sound in 3D space by perceiving the time difference reaching to both ears. Furthermore we can selectively listen to an objective voice in the crowd. This paper presents real time detection and identification of a particular person among plural speakers using a microphone array. The system identifies the position of a particular speaker, and enhances the voice signal selectively.",
keywords = "Acoustic noise, Acoustical engineering, Data mining, Discrete Fourier transforms, Human voice, Intelligent systems, Loudspeakers, Mechanical systems, Microphone arrays, Signal processing",
author = "M. Ohkado and Hideyuki Sawada",
year = "2003",
doi = "10.1109/VECIMS.2003.1227046",
language = "English",
pages = "151--156",
booktitle = "VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Realtime detection and identification of plural speakers using a microphone array

AU - Ohkado, M.

AU - Sawada, Hideyuki

PY - 2003

Y1 - 2003

N2 - Voices are used as primary media in human communication. A human voice is a sound generated by the complex movements of the vocal organs, and is the most important media employed for communication in daily life to logical discussions. A human is able to detect the position of a source sound in 3D space by perceiving the time difference reaching to both ears. Furthermore we can selectively listen to an objective voice in the crowd. This paper presents real time detection and identification of a particular person among plural speakers using a microphone array. The system identifies the position of a particular speaker, and enhances the voice signal selectively.

AB - Voices are used as primary media in human communication. A human voice is a sound generated by the complex movements of the vocal organs, and is the most important media employed for communication in daily life to logical discussions. A human is able to detect the position of a source sound in 3D space by perceiving the time difference reaching to both ears. Furthermore we can selectively listen to an objective voice in the crowd. This paper presents real time detection and identification of a particular person among plural speakers using a microphone array. The system identifies the position of a particular speaker, and enhances the voice signal selectively.

KW - Acoustic noise

KW - Acoustical engineering

KW - Data mining

KW - Discrete Fourier transforms

KW - Human voice

KW - Intelligent systems

KW - Loudspeakers

KW - Mechanical systems

KW - Microphone arrays

KW - Signal processing

UR - http://www.scopus.com/inward/record.url?scp=20544472983&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=20544472983&partnerID=8YFLogxK

U2 - 10.1109/VECIMS.2003.1227046

DO - 10.1109/VECIMS.2003.1227046

M3 - Conference contribution

SP - 151

EP - 156

BT - VECIMS 2003 - 2003 International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems

PB - Institute of Electrical and Electronics Engineers Inc.

ER -