Analyzing temporal transition of real user's behaviors in a spoken dialogue system

Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Managing various behaviors of real users is indispensable for spoken dialogue systems to operate adequately in real environments. We have analyzed various users' behaviors using data collected over 34 months from the Kyoto City Bus Information System. We focused on "barge-in" and added barge-in rates to our analysis. Temporal transitions of users' behaviors, such as automatic speech recognition (ASR) accuracy, task success rates and barge-in rates, were initially investigated. We then examined the relationship between ASR accuracy and barge-in rates. Analysis revealed that the ASR accuracy of utterances inputted with barge-ins was lower because many novices, who were not accustomed to the timing when to utter, used the system. We also observed that the ASR accuracy of utterances with barge-ins differed based on the barge-in rates of individual users. The results indicate that the barge-in rate can be used as a novel user profile for detecting ASR errors.

Original languageEnglish
Title of host publicationInternational Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Pages1837-1840
Number of pages4
Volume3
Publication statusPublished - 2007
Externally publishedYes
Event8th Annual Conference of the International Speech Communication Association, Interspeech 2007 - Antwerp
Duration: 2007 Aug 272007 Aug 31

Other

Other8th Annual Conference of the International Speech Communication Association, Interspeech 2007
CityAntwerp
Period07/8/2707/8/31

Keywords

  • Baree-in
  • Real user behavior
  • Spoken dialogue system

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Modelling and Simulation
  • Linguistics and Language
  • Communication

Fingerprint Dive into the research topics of 'Analyzing temporal transition of real user's behaviors in a spoken dialogue system'. Together they form a unique fingerprint.

Cite this