Harmonic sound stream segregation using localization and its application to speech stream segregation

Tomohiro Nakatani, Hiroshi G. Okuno

Research output: Contribution to journalArticle

32 Citations (Scopus)

Abstract

Sound stream segregation is essential to understand auditory events in the real world. In this paper, we present a new method of segregating a series of harmonic sounds. The harmonic structure and sound source direction are used as clues for segregation. The direction information of the sources is used to extract fundamental frequencies of individual harmonic sounds, and harmonic sounds are segregated according to the extracted fundamental frequencies. Sequential grouping of harmonic sounds is achieved by using both sound source directions and fundamental frequencies. An application of the harmonic stream segregation to speech stream segregation is presented. It provides effective speech stream segregation using binaural microphones. Experimental results show that the method reduces the spectrum distortions and the fundamental frequency errors compared to an existing monaural system, and that it can segregate three simultaneous harmonic streams with only two microphones.

Original languageEnglish
Pages (from-to)209-222
Number of pages14
JournalSpeech Communication
Volume27
Issue number3
DOIs
Publication statusPublished - 1999 Apr
Externally publishedYes

Fingerprint

Segregation
segregation
Harmonic
Acoustic waves
Fundamental Frequency
Microphones
grouping
Speech
Sound
Harmonics
Localization
Grouping
event
Series
Experimental Results
Direction compound

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

Harmonic sound stream segregation using localization and its application to speech stream segregation. / Nakatani, Tomohiro; Okuno, Hiroshi G.

In: Speech Communication, Vol. 27, No. 3, 04.1999, p. 209-222.

Research output: Contribution to journalArticle

@article{89f0e60a6dc849a2939ba2654b487d22,
title = "Harmonic sound stream segregation using localization and its application to speech stream segregation",
abstract = "Sound stream segregation is essential to understand auditory events in the real world. In this paper, we present a new method of segregating a series of harmonic sounds. The harmonic structure and sound source direction are used as clues for segregation. The direction information of the sources is used to extract fundamental frequencies of individual harmonic sounds, and harmonic sounds are segregated according to the extracted fundamental frequencies. Sequential grouping of harmonic sounds is achieved by using both sound source directions and fundamental frequencies. An application of the harmonic stream segregation to speech stream segregation is presented. It provides effective speech stream segregation using binaural microphones. Experimental results show that the method reduces the spectrum distortions and the fundamental frequency errors compared to an existing monaural system, and that it can segregate three simultaneous harmonic streams with only two microphones.",
author = "Tomohiro Nakatani and Okuno, {Hiroshi G.}",
year = "1999",
month = "4",
doi = "10.1016/S0167-6393(98)00079-X",
language = "English",
volume = "27",
pages = "209--222",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "3",

}

TY - JOUR

T1 - Harmonic sound stream segregation using localization and its application to speech stream segregation

AU - Nakatani, Tomohiro

AU - Okuno, Hiroshi G.

PY - 1999/4

Y1 - 1999/4

N2 - Sound stream segregation is essential to understand auditory events in the real world. In this paper, we present a new method of segregating a series of harmonic sounds. The harmonic structure and sound source direction are used as clues for segregation. The direction information of the sources is used to extract fundamental frequencies of individual harmonic sounds, and harmonic sounds are segregated according to the extracted fundamental frequencies. Sequential grouping of harmonic sounds is achieved by using both sound source directions and fundamental frequencies. An application of the harmonic stream segregation to speech stream segregation is presented. It provides effective speech stream segregation using binaural microphones. Experimental results show that the method reduces the spectrum distortions and the fundamental frequency errors compared to an existing monaural system, and that it can segregate three simultaneous harmonic streams with only two microphones.

AB - Sound stream segregation is essential to understand auditory events in the real world. In this paper, we present a new method of segregating a series of harmonic sounds. The harmonic structure and sound source direction are used as clues for segregation. The direction information of the sources is used to extract fundamental frequencies of individual harmonic sounds, and harmonic sounds are segregated according to the extracted fundamental frequencies. Sequential grouping of harmonic sounds is achieved by using both sound source directions and fundamental frequencies. An application of the harmonic stream segregation to speech stream segregation is presented. It provides effective speech stream segregation using binaural microphones. Experimental results show that the method reduces the spectrum distortions and the fundamental frequency errors compared to an existing monaural system, and that it can segregate three simultaneous harmonic streams with only two microphones.

UR - http://www.scopus.com/inward/record.url?scp=0032630841&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032630841&partnerID=8YFLogxK

U2 - 10.1016/S0167-6393(98)00079-X

DO - 10.1016/S0167-6393(98)00079-X

M3 - Article

AN - SCOPUS:0032630841

VL - 27

SP - 209

EP - 222

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 3

ER -