Application of auditory image model for speech event detection

Minoru Tsuzaki, Satomi Tanaka, Hiroaki Kato, Yoshinori Sagisaka

Research output: Contribution to conferencePaper

1 Citation (Scopus)

Abstract

To provide an appropriate model for perception of temporal structures of speech, we applied a comprehensive computational model of the human auditory peripherals to detect changes in speech signals that potentially indicate arrivals of new events. In each tonotopic sub-band, an increase in the activation level was taken into account for the plausibility of a new event, while a decrease was ignored. The total contour obtained by integrating the sub-band information exhibited sharp peaks and dips compared to the loudness contour. A quantitative evaluation to estimate the speaking rate of natural speech also demonstrated that the event-plausibility model performs better than the loudness model.

Original languageEnglish
Pages677-680
Number of pages4
Publication statusPublished - 2005 Dec 1
Event9th European Conference on Speech Communication and Technology - Lisbon, Portugal
Duration: 2005 Sep 42005 Sep 8

Conference

Conference9th European Conference on Speech Communication and Technology
CountryPortugal
CityLisbon
Period05/9/405/9/8

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'Application of auditory image model for speech event detection'. Together they form a unique fingerprint.

  • Cite this

    Tsuzaki, M., Tanaka, S., Kato, H., & Sagisaka, Y. (2005). Application of auditory image model for speech event detection. 677-680. Paper presented at 9th European Conference on Speech Communication and Technology, Lisbon, Portugal.