Speech enhancement based on multiple directivity patterns using a microphone array

Toshiyuki Sekiya, Tetsunori Kobayashi

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    Original languageEnglish
    Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    Volume1
    Publication statusPublished - 2004
    EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
    Duration: 2004 May 172004 May 21

    Other

    OtherProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing
    CountryCanada
    CityMontreal, Que
    Period04/5/1704/5/21

    Fingerprint

    Speech enhancement
    directivity
    Microphones
    microphones
    disturbances
    augmentation
    simultaneous equations
    Speech recognition
    speech recognition
    estimates
    subtraction
    Acoustic waves
    acoustics

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering
    • Signal Processing
    • Acoustics and Ultrasonics

    Cite this

    Sekiya, T., & Kobayashi, T. (2004). Speech enhancement based on multiple directivity patterns using a microphone array. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 1)

    Speech enhancement based on multiple directivity patterns using a microphone array. / Sekiya, Toshiyuki; Kobayashi, Tetsunori.

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 2004.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Sekiya, T & Kobayashi, T 2004, Speech enhancement based on multiple directivity patterns using a microphone array. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 1, Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Que, Canada, 04/5/17.
    Sekiya T, Kobayashi T. Speech enhancement based on multiple directivity patterns using a microphone array. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1. 2004
    Sekiya, Toshiyuki ; Kobayashi, Tetsunori. / Speech enhancement based on multiple directivity patterns using a microphone array. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 2004.
    @inproceedings{7646868dab164e2bbff9ed2d5e79e6bd,
    title = "Speech enhancement based on multiple directivity patterns using a microphone array",
    abstract = "A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45{\%} error reduction.",
    author = "Toshiyuki Sekiya and Tetsunori Kobayashi",
    year = "2004",
    language = "English",
    volume = "1",
    booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

    }

    TY - GEN

    T1 - Speech enhancement based on multiple directivity patterns using a microphone array

    AU - Sekiya, Toshiyuki

    AU - Kobayashi, Tetsunori

    PY - 2004

    Y1 - 2004

    N2 - A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    AB - A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    UR - http://www.scopus.com/inward/record.url?scp=4544377381&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=4544377381&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:4544377381

    VL - 1

    BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

    ER -