Multi-modal service operation estimation using DNN-based acoustic bag-of-features

Satoshi Tamura, Takuya Uno, Masanori Takehara, Satoru Hayamizu, Takeshi Kurata

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In service engineering it is important to estimate when and what a worker did, because they include crucial evidences to improve service quality and working environments. For Service Operation Estimation (SOE), acoustic information is one of useful and key modalities; particularly environmental or background sounds include effective cues. This paper focuses on two aspects: (1) extracting powerful and robust acoustic features by using stacked-denoising-autoencoder and bag-of-feature techniques, and (2) investigating a multi-modal SOE scheme by combining the audio features and the other sensor data as well as non-sensor information. We conducted evaluation experiments using multi-modal data recorded in a restaurant. We improved SOE performance in comparison to conventional acoustic features, and effectiveness of our multimodal SOE scheme is also clarified.

Original languageEnglish
Title of host publication2015 23rd European Signal Processing Conference, EUSIPCO 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2291-2295
Number of pages5
ISBN (Electronic)9780992862633
DOIs
Publication statusPublished - 2015 Dec 22
Externally publishedYes
Event23rd European Signal Processing Conference, EUSIPCO 2015 - Nice, France
Duration: 2015 Aug 312015 Sep 4

Publication series

Name2015 23rd European Signal Processing Conference, EUSIPCO 2015

Other

Other23rd European Signal Processing Conference, EUSIPCO 2015
Country/TerritoryFrance
CityNice
Period15/8/3115/9/4

Keywords

  • bag of features
  • environmental sounds
  • multimodal signal processing
  • Service operation estimation
  • stacked denoising autoencoder

ASJC Scopus subject areas

  • Media Technology
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint

Dive into the research topics of 'Multi-modal service operation estimation using DNN-based acoustic bag-of-features'. Together they form a unique fingerprint.

Cite this