Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank

Satomi Tanaka, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    To simulate the perceptual extraction of temporal structures of speech, the authors have been proposing an event-plausibilty model that detects the occurrence of subevents in continuous speech signals based on a auditory processing. One of its core components is the filterbank module that simulates the mechanical frequency analysis of the basilar membrane in the cochlea. In this paper, output by the new model using a dynamic compressive gammachirp (dcGC) auditory filterbank was compared with the previous model using a gammatone auditory filterbank. The most important difference between these filters was the nonlinear dynamic level-dependence of the new filter; the previous filterbank was linear. Simulation results revealed that no significant advantage for the new filter (dcGC) was observed for event detection by the event-plausiblity model, which suggests that the algorithm for the event-plausibility model has robustness against differences in peripheral auditory processing.

    Original languageEnglish
    Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
    Pages493-496
    Number of pages4
    Volume1
    Publication statusPublished - 2007
    Event8th Annual Conference of the International Speech Communication Association, Interspeech 2007 - Antwerp
    Duration: 2007 Aug 272007 Aug 31

    Other

    Other8th Annual Conference of the International Speech Communication Association, Interspeech 2007
    CityAntwerp
    Period07/8/2707/8/31

    Fingerprint

    Event Detection
    Speech Signal
    event
    Filter
    Processing
    Model Robustness
    Frequency Analysis
    frequency analysis
    Model
    Nonlinear Dynamics
    Membrane
    Module
    Output
    Membranes
    simulation
    Simulation

    Keywords

    • Auditory image model
    • Auditory processing
    • Event detection
    • Temporal perception

    ASJC Scopus subject areas

    • Computer Science Applications
    • Software
    • Modelling and Simulation
    • Linguistics and Language
    • Communication

    Cite this

    Tanaka, S., Tsuzaki, M., Kato, H., & Sagisaka, Y. (2007). Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 1, pp. 493-496)

    Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. / Tanaka, Satomi; Tsuzaki, Minoru; Kato, Hiroaki; Sagisaka, Yoshinori.

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 1 2007. p. 493-496.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Tanaka, S, Tsuzaki, M, Kato, H & Sagisaka, Y 2007, Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. vol. 1, pp. 493-496, 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, Antwerp, 07/8/27.
    Tanaka S, Tsuzaki M, Kato H, Sagisaka Y. Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 1. 2007. p. 493-496
    Tanaka, Satomi ; Tsuzaki, Minoru ; Kato, Hiroaki ; Sagisaka, Yoshinori. / Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 1 2007. pp. 493-496
    @inproceedings{8062f126486d4d42a4cd009918505e02,
    title = "Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank",
    abstract = "To simulate the perceptual extraction of temporal structures of speech, the authors have been proposing an event-plausibilty model that detects the occurrence of subevents in continuous speech signals based on a auditory processing. One of its core components is the filterbank module that simulates the mechanical frequency analysis of the basilar membrane in the cochlea. In this paper, output by the new model using a dynamic compressive gammachirp (dcGC) auditory filterbank was compared with the previous model using a gammatone auditory filterbank. The most important difference between these filters was the nonlinear dynamic level-dependence of the new filter; the previous filterbank was linear. Simulation results revealed that no significant advantage for the new filter (dcGC) was observed for event detection by the event-plausiblity model, which suggests that the algorithm for the event-plausibility model has robustness against differences in peripheral auditory processing.",
    keywords = "Auditory image model, Auditory processing, Event detection, Temporal perception",
    author = "Satomi Tanaka and Minoru Tsuzaki and Hiroaki Kato and Yoshinori Sagisaka",
    year = "2007",
    language = "English",
    isbn = "9781605603162",
    volume = "1",
    pages = "493--496",
    booktitle = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

    }

    TY - GEN

    T1 - Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank

    AU - Tanaka, Satomi

    AU - Tsuzaki, Minoru

    AU - Kato, Hiroaki

    AU - Sagisaka, Yoshinori

    PY - 2007

    Y1 - 2007

    N2 - To simulate the perceptual extraction of temporal structures of speech, the authors have been proposing an event-plausibilty model that detects the occurrence of subevents in continuous speech signals based on a auditory processing. One of its core components is the filterbank module that simulates the mechanical frequency analysis of the basilar membrane in the cochlea. In this paper, output by the new model using a dynamic compressive gammachirp (dcGC) auditory filterbank was compared with the previous model using a gammatone auditory filterbank. The most important difference between these filters was the nonlinear dynamic level-dependence of the new filter; the previous filterbank was linear. Simulation results revealed that no significant advantage for the new filter (dcGC) was observed for event detection by the event-plausiblity model, which suggests that the algorithm for the event-plausibility model has robustness against differences in peripheral auditory processing.

    AB - To simulate the perceptual extraction of temporal structures of speech, the authors have been proposing an event-plausibilty model that detects the occurrence of subevents in continuous speech signals based on a auditory processing. One of its core components is the filterbank module that simulates the mechanical frequency analysis of the basilar membrane in the cochlea. In this paper, output by the new model using a dynamic compressive gammachirp (dcGC) auditory filterbank was compared with the previous model using a gammatone auditory filterbank. The most important difference between these filters was the nonlinear dynamic level-dependence of the new filter; the previous filterbank was linear. Simulation results revealed that no significant advantage for the new filter (dcGC) was observed for event detection by the event-plausiblity model, which suggests that the algorithm for the event-plausibility model has robustness against differences in peripheral auditory processing.

    KW - Auditory image model

    KW - Auditory processing

    KW - Event detection

    KW - Temporal perception

    UR - http://www.scopus.com/inward/record.url?scp=56149107882&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=56149107882&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:56149107882

    SN - 9781605603162

    VL - 1

    SP - 493

    EP - 496

    BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

    ER -