Large Scale Environmental Sound Classification Based on Efficient Feature Extraction

Xiaoyan Wang, Hao Zhou, Zhi Liu, Yu Gu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In recent years, plenty of studies endeavor to analyze the life auditory scenarios via mining non-speech sounds. Conventional audio recognition schemes clearly bound the feature extraction and recognition stages, such as in speech recognition. However, such separation leads to inconsistency in the purposes at each stage. The recognition stage contributes to portray the global data distribution focusing on 'relationship' between signal samples. However, such consideration can hardly be embedded into feature extraction process which centered on the local structure, thus, the prominent 'relation' information is destroyed. In this paper, we propose a unified acoustic recognition framework taking advantage of primitive feature input without injuring discriminant information and adopting effective classification scheme accordingly. We formulate the sound into subspace representation and initially adopt Grassmannian distance to classify the subspace-indexed non-speech sounds. To validate the proposed framework, we conducted experiments using RWCP Sound Scene Database. The experimental results demonstrated the proposed framework achieved fine recognition performance with high efficiency.

Original languageEnglish
Title of host publicationProceedings - 45th International Conference on Parallel Processing Workshops, ICPPW 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages421-425
Number of pages5
Volume2016-September
ISBN (Electronic)9781509028252
DOIs
Publication statusPublished - 2016 Sept 23
Event45th International Conference on Parallel Processing Workshops, ICPPW 2016 - Philadelphia, United States
Duration: 2016 Aug 162016 Aug 19

Other

Other45th International Conference on Parallel Processing Workshops, ICPPW 2016
Country/TerritoryUnited States
CityPhiladelphia
Period16/8/1616/8/19

Keywords

  • Grassmann manifold
  • non-speech sound recognition
  • spectrogram
  • subspace learning

ASJC Scopus subject areas

  • Software
  • Mathematics(all)
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Large Scale Environmental Sound Classification Based on Efficient Feature Extraction'. Together they form a unique fingerprint.

Cite this