Sound ontology for computational auditory scene analysis

Tomohiro Nakatani, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

This paper proposes that sound ontology should be both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the definition of a harmonic structure differs or the precision of extracted harmonic structures differs. Therefore, sound ontology is needed as a common knowledge representation of sounds. Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a flexible interface. Therefore, sound ontology is needed to fulfill the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.

Original languageEnglish
Title of host publicationProceedings of the National Conference on Artificial Intelligence
Editors Anon
Place of PublicationMenlo Park, CA, United States
PublisherAAAI
Pages1004-1010
Number of pages7
Publication statusPublished - 1998
Externally publishedYes
EventProceedings of the 1998 15th National Conference on Artificial Intelligence, AAAI - Madison, WI, USA
Duration: 1998 Jul 261998 Jul 30

Other

OtherProceedings of the 1998 15th National Conference on Artificial Intelligence, AAAI
CityMadison, WI, USA
Period98/7/2698/7/30

Fingerprint

Ontology
Acoustic waves
Knowledge representation
Terminology
Speech recognition
Interfaces (computer)

ASJC Scopus subject areas

  • Software

Cite this

Nakatani, T., & Okuno, H. G. (1998). Sound ontology for computational auditory scene analysis. In Anon (Ed.), Proceedings of the National Conference on Artificial Intelligence (pp. 1004-1010). Menlo Park, CA, United States: AAAI.

Sound ontology for computational auditory scene analysis. / Nakatani, Tomohiro; Okuno, Hiroshi G.

Proceedings of the National Conference on Artificial Intelligence. ed. / Anon. Menlo Park, CA, United States : AAAI, 1998. p. 1004-1010.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Nakatani, T & Okuno, HG 1998, Sound ontology for computational auditory scene analysis. in Anon (ed.), Proceedings of the National Conference on Artificial Intelligence. AAAI, Menlo Park, CA, United States, pp. 1004-1010, Proceedings of the 1998 15th National Conference on Artificial Intelligence, AAAI, Madison, WI, USA, 98/7/26.
Nakatani T, Okuno HG. Sound ontology for computational auditory scene analysis. In Anon, editor, Proceedings of the National Conference on Artificial Intelligence. Menlo Park, CA, United States: AAAI. 1998. p. 1004-1010
Nakatani, Tomohiro ; Okuno, Hiroshi G. / Sound ontology for computational auditory scene analysis. Proceedings of the National Conference on Artificial Intelligence. editor / Anon. Menlo Park, CA, United States : AAAI, 1998. pp. 1004-1010
@inproceedings{164a9acd1bc742cea388d452bb35598e,
title = "Sound ontology for computational auditory scene analysis",
abstract = "This paper proposes that sound ontology should be both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the definition of a harmonic structure differs or the precision of extracted harmonic structures differs. Therefore, sound ontology is needed as a common knowledge representation of sounds. Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a flexible interface. Therefore, sound ontology is needed to fulfill the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.",
author = "Tomohiro Nakatani and Okuno, {Hiroshi G.}",
year = "1998",
language = "English",
pages = "1004--1010",
editor = "Anon",
booktitle = "Proceedings of the National Conference on Artificial Intelligence",
publisher = "AAAI",

}

TY - GEN

T1 - Sound ontology for computational auditory scene analysis

AU - Nakatani, Tomohiro

AU - Okuno, Hiroshi G.

PY - 1998

Y1 - 1998

N2 - This paper proposes that sound ontology should be both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the definition of a harmonic structure differs or the precision of extracted harmonic structures differs. Therefore, sound ontology is needed as a common knowledge representation of sounds. Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a flexible interface. Therefore, sound ontology is needed to fulfill the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.

AB - This paper proposes that sound ontology should be both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the definition of a harmonic structure differs or the precision of extracted harmonic structures differs. Therefore, sound ontology is needed as a common knowledge representation of sounds. Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a flexible interface. Therefore, sound ontology is needed to fulfill the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.

UR - http://www.scopus.com/inward/record.url?scp=0031632818&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031632818&partnerID=8YFLogxK

M3 - Conference contribution

SP - 1004

EP - 1010

BT - Proceedings of the National Conference on Artificial Intelligence

A2 - Anon, null

PB - AAAI

CY - Menlo Park, CA, United States

ER -