Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems

Kazunori Komatani, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

Spoken dialogue systems must inevitably deal with out-of-grammar utterances. We address this problem in multi-domain spoken dialogue systems, which deal with more tasks than a single-domain system. We defined a topic by augmenting a domain about which users want to find more information, and we developed a method of recovering out-of-grammar utterances based on topic estimation, i.e., by providing a help message in the estimated domain. Moreover, domain extensibility, that is, the ability to add new domains to the system, should be inherently retained in multi-domain systems. To estimate domains without sacrificing extensibility, we collected documents from the Web as training data. Since the data contained a certain amount of noise, we used latent semantic mapping (LSM), which enables robust topic estimation by removing the effects of noise from the data. Experimental results showed that our method improved topic estimation accuracy by 23.2 points for data including out-of-grammar utterances.

Original languageEnglish
Pages (from-to)863-870
Number of pages8
JournalSpeech Communication
Volume50
Issue number10
DOIs
Publication statusPublished - 2008 Oct
Externally publishedYes

Fingerprint

Spoken Dialogue Systems
Grammar
grammar
communication technology
Semantics
semantics
Utterance
ability
Experimental Results
Estimate

Keywords

  • Domain extensibility
  • Multi-domain spoken dialogue system
  • Out-of-grammar utterance
  • Topic estimation

ASJC Scopus subject areas

  • Modelling and Simulation
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Software
  • Communication
  • Linguistics and Language

Cite this

Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems. / Komatani, Kazunori; Ikeda, Satoshi; Ogata, Tetsuya; Okuno, Hiroshi G.

In: Speech Communication, Vol. 50, No. 10, 10.2008, p. 863-870.

Research output: Contribution to journalArticle

@article{fc52207ad28e4816bb35fa3d5e5db92b,
title = "Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems",
abstract = "Spoken dialogue systems must inevitably deal with out-of-grammar utterances. We address this problem in multi-domain spoken dialogue systems, which deal with more tasks than a single-domain system. We defined a topic by augmenting a domain about which users want to find more information, and we developed a method of recovering out-of-grammar utterances based on topic estimation, i.e., by providing a help message in the estimated domain. Moreover, domain extensibility, that is, the ability to add new domains to the system, should be inherently retained in multi-domain systems. To estimate domains without sacrificing extensibility, we collected documents from the Web as training data. Since the data contained a certain amount of noise, we used latent semantic mapping (LSM), which enables robust topic estimation by removing the effects of noise from the data. Experimental results showed that our method improved topic estimation accuracy by 23.2 points for data including out-of-grammar utterances.",
keywords = "Domain extensibility, Multi-domain spoken dialogue system, Out-of-grammar utterance, Topic estimation",
author = "Kazunori Komatani and Satoshi Ikeda and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2008",
month = "10",
doi = "10.1016/j.specom.2008.05.010",
language = "English",
volume = "50",
pages = "863--870",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "10",

}

TY - JOUR

T1 - Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems

AU - Komatani, Kazunori

AU - Ikeda, Satoshi

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2008/10

Y1 - 2008/10

N2 - Spoken dialogue systems must inevitably deal with out-of-grammar utterances. We address this problem in multi-domain spoken dialogue systems, which deal with more tasks than a single-domain system. We defined a topic by augmenting a domain about which users want to find more information, and we developed a method of recovering out-of-grammar utterances based on topic estimation, i.e., by providing a help message in the estimated domain. Moreover, domain extensibility, that is, the ability to add new domains to the system, should be inherently retained in multi-domain systems. To estimate domains without sacrificing extensibility, we collected documents from the Web as training data. Since the data contained a certain amount of noise, we used latent semantic mapping (LSM), which enables robust topic estimation by removing the effects of noise from the data. Experimental results showed that our method improved topic estimation accuracy by 23.2 points for data including out-of-grammar utterances.

AB - Spoken dialogue systems must inevitably deal with out-of-grammar utterances. We address this problem in multi-domain spoken dialogue systems, which deal with more tasks than a single-domain system. We defined a topic by augmenting a domain about which users want to find more information, and we developed a method of recovering out-of-grammar utterances based on topic estimation, i.e., by providing a help message in the estimated domain. Moreover, domain extensibility, that is, the ability to add new domains to the system, should be inherently retained in multi-domain systems. To estimate domains without sacrificing extensibility, we collected documents from the Web as training data. Since the data contained a certain amount of noise, we used latent semantic mapping (LSM), which enables robust topic estimation by removing the effects of noise from the data. Experimental results showed that our method improved topic estimation accuracy by 23.2 points for data including out-of-grammar utterances.

KW - Domain extensibility

KW - Multi-domain spoken dialogue system

KW - Out-of-grammar utterance

KW - Topic estimation

UR - http://www.scopus.com/inward/record.url?scp=52949125236&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=52949125236&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2008.05.010

DO - 10.1016/j.specom.2008.05.010

M3 - Article

AN - SCOPUS:52949125236

VL - 50

SP - 863

EP - 870

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 10

ER -