Application of auditory image model for speech event detection

Minoru Tsuzaki*, Satomi Tanaka, Hiroaki Kato, Yoshinori Sagisaka

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

1 Citation (Scopus)

Abstract

To provide an appropriate model for perception of temporal structures of speech, we applied a comprehensive computational model of the human auditory peripherals to detect changes in speech signals that potentially indicate arrivals of new events. In each tonotopic sub-band, an increase in the activation level was taken into account for the plausibility of a new event, while a decrease was ignored. The total contour obtained by integrating the sub-band information exhibited sharp peaks and dips compared to the loudness contour. A quantitative evaluation to estimate the speaking rate of natural speech also demonstrated that the event-plausibility model performs better than the loudness model.

Original languageEnglish
Pages677-680
Number of pages4
Publication statusPublished - 2005
Event9th European Conference on Speech Communication and Technology - Lisbon, Portugal
Duration: 2005 Sep 42005 Sep 8

Conference

Conference9th European Conference on Speech Communication and Technology
Country/TerritoryPortugal
CityLisbon
Period05/9/405/9/8

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Application of auditory image model for speech event detection'. Together they form a unique fingerprint.

Cite this