Speech enhancement based on multiple directivity patterns using a microphone array

Toshiyuki Sekiya, Tetsunori Kobayashi

    研究成果: Conference contribution

    1 引用 (Scopus)

    抄録

    A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    元の言語English
    ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    1
    出版物ステータスPublished - 2004
    イベントProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
    継続期間: 2004 5 172004 5 21

    Other

    OtherProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing
    Canada
    Montreal, Que
    期間04/5/1704/5/21

    Fingerprint

    Speech enhancement
    directivity
    Microphones
    microphones
    disturbances
    augmentation
    simultaneous equations
    Speech recognition
    speech recognition
    estimates
    subtraction
    Acoustic waves
    acoustics

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering
    • Signal Processing
    • Acoustics and Ultrasonics

    これを引用

    Sekiya, T., & Kobayashi, T. (2004). Speech enhancement based on multiple directivity patterns using a microphone array. : ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (巻 1)

    Speech enhancement based on multiple directivity patterns using a microphone array. / Sekiya, Toshiyuki; Kobayashi, Tetsunori.

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻 1 2004.

    研究成果: Conference contribution

    Sekiya, T & Kobayashi, T 2004, Speech enhancement based on multiple directivity patterns using a microphone array. : ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻. 1, Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Que, Canada, 04/5/17.
    Sekiya T, Kobayashi T. Speech enhancement based on multiple directivity patterns using a microphone array. : ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻 1. 2004
    Sekiya, Toshiyuki ; Kobayashi, Tetsunori. / Speech enhancement based on multiple directivity patterns using a microphone array. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻 1 2004.
    @inproceedings{7646868dab164e2bbff9ed2d5e79e6bd,
    title = "Speech enhancement based on multiple directivity patterns using a microphone array",
    abstract = "A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45{\%} error reduction.",
    author = "Toshiyuki Sekiya and Tetsunori Kobayashi",
    year = "2004",
    language = "English",
    volume = "1",
    booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

    }

    TY - GEN

    T1 - Speech enhancement based on multiple directivity patterns using a microphone array

    AU - Sekiya, Toshiyuki

    AU - Kobayashi, Tetsunori

    PY - 2004

    Y1 - 2004

    N2 - A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    AB - A novel speech segregation method using a microphone array with multiple directivities is proposed and applied to speech recognition under existence of disturbance speech. Conventional microphone array techniques use only single directivity of their own. It is very difficult for this kind of array technique to remove the influence of the disturbance. In our method, redundant simultaneous equations of the amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good estimates of disturbances. The spectral subtraction is applied with these estimates of disturbances, and the perfect enhancement of target speech is performed. The experimental results of double talk recognition with 20 K vocabulary show that the proposed enhancement technique is effective to achieve 45% error reduction.

    UR - http://www.scopus.com/inward/record.url?scp=4544377381&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=4544377381&partnerID=8YFLogxK

    M3 - Conference contribution

    VL - 1

    BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

    ER -