Speech enhancement using square microphone array for mobile devices

Shintaro Takada, Tetsuji Ogawa, Kenzo Akagiri, Tetsunori Kobayashi

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    4 Citations (Scopus)

    Abstract

    In this paper, we propose a new type of speech enhancement method that is suitable for mobile devices used in noisy environments. For the sake of achieving high-performance speech recognition and auditory perception in the mobile devices, disturbance noises have to be removed under the requirements of a space-saving microphone arrangement and a low computational cost. The proposed method can reduce both the directional and the diffuse noises under the requirements for the mobile devices by applying the square microphone array and the low-cost processing that consists of multiple null beam-forming, their minimum power channel selection and Wiener filtering. The effectiveness of the proposed method is clarified for speech recognition accuracies and speech qualities under the condition in which both the directional and the diffuse noises exist simultaneously: it reduced 40% of recognition errors and improved PESQ-based MOS value by 0.75 point.

    Original languageEnglish
    Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    Pages313-316
    Number of pages4
    DOIs
    Publication statusPublished - 2008
    Event2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP - Las Vegas, NV
    Duration: 2008 Mar 312008 Apr 4

    Other

    Other2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
    CityLas Vegas, NV
    Period08/3/3108/4/4

    Fingerprint

    Speech enhancement
    Microphones
    microphones
    Mobile devices
    speech recognition
    Speech recognition
    augmentation
    auditory perception
    Wiener filtering
    requirements
    beamforming
    Costs
    disturbances
    costs
    Processing

    Keywords

    • Array signal processing
    • Mobile devices
    • Speech enhancement
    • Speech recognition
    • Wiener tittering

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering
    • Signal Processing
    • Acoustics and Ultrasonics

    Cite this

    Takada, S., Ogawa, T., Akagiri, K., & Kobayashi, T. (2008). Speech enhancement using square microphone array for mobile devices. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 313-316). [4517609] https://doi.org/10.1109/ICASSP.2008.4517609

    Speech enhancement using square microphone array for mobile devices. / Takada, Shintaro; Ogawa, Tetsuji; Akagiri, Kenzo; Kobayashi, Tetsunori.

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2008. p. 313-316 4517609.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Takada, S, Ogawa, T, Akagiri, K & Kobayashi, T 2008, Speech enhancement using square microphone array for mobile devices. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings., 4517609, pp. 313-316, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Las Vegas, NV, 08/3/31. https://doi.org/10.1109/ICASSP.2008.4517609
    Takada S, Ogawa T, Akagiri K, Kobayashi T. Speech enhancement using square microphone array for mobile devices. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2008. p. 313-316. 4517609 https://doi.org/10.1109/ICASSP.2008.4517609
    Takada, Shintaro ; Ogawa, Tetsuji ; Akagiri, Kenzo ; Kobayashi, Tetsunori. / Speech enhancement using square microphone array for mobile devices. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2008. pp. 313-316
    @inproceedings{de87da342b1c464eaa2a8f5ab85be7f1,
    title = "Speech enhancement using square microphone array for mobile devices",
    abstract = "In this paper, we propose a new type of speech enhancement method that is suitable for mobile devices used in noisy environments. For the sake of achieving high-performance speech recognition and auditory perception in the mobile devices, disturbance noises have to be removed under the requirements of a space-saving microphone arrangement and a low computational cost. The proposed method can reduce both the directional and the diffuse noises under the requirements for the mobile devices by applying the square microphone array and the low-cost processing that consists of multiple null beam-forming, their minimum power channel selection and Wiener filtering. The effectiveness of the proposed method is clarified for speech recognition accuracies and speech qualities under the condition in which both the directional and the diffuse noises exist simultaneously: it reduced 40{\%} of recognition errors and improved PESQ-based MOS value by 0.75 point.",
    keywords = "Array signal processing, Mobile devices, Speech enhancement, Speech recognition, Wiener tittering",
    author = "Shintaro Takada and Tetsuji Ogawa and Kenzo Akagiri and Tetsunori Kobayashi",
    year = "2008",
    doi = "10.1109/ICASSP.2008.4517609",
    language = "English",
    isbn = "1424414849",
    pages = "313--316",
    booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

    }

    TY - GEN

    T1 - Speech enhancement using square microphone array for mobile devices

    AU - Takada, Shintaro

    AU - Ogawa, Tetsuji

    AU - Akagiri, Kenzo

    AU - Kobayashi, Tetsunori

    PY - 2008

    Y1 - 2008

    N2 - In this paper, we propose a new type of speech enhancement method that is suitable for mobile devices used in noisy environments. For the sake of achieving high-performance speech recognition and auditory perception in the mobile devices, disturbance noises have to be removed under the requirements of a space-saving microphone arrangement and a low computational cost. The proposed method can reduce both the directional and the diffuse noises under the requirements for the mobile devices by applying the square microphone array and the low-cost processing that consists of multiple null beam-forming, their minimum power channel selection and Wiener filtering. The effectiveness of the proposed method is clarified for speech recognition accuracies and speech qualities under the condition in which both the directional and the diffuse noises exist simultaneously: it reduced 40% of recognition errors and improved PESQ-based MOS value by 0.75 point.

    AB - In this paper, we propose a new type of speech enhancement method that is suitable for mobile devices used in noisy environments. For the sake of achieving high-performance speech recognition and auditory perception in the mobile devices, disturbance noises have to be removed under the requirements of a space-saving microphone arrangement and a low computational cost. The proposed method can reduce both the directional and the diffuse noises under the requirements for the mobile devices by applying the square microphone array and the low-cost processing that consists of multiple null beam-forming, their minimum power channel selection and Wiener filtering. The effectiveness of the proposed method is clarified for speech recognition accuracies and speech qualities under the condition in which both the directional and the diffuse noises exist simultaneously: it reduced 40% of recognition errors and improved PESQ-based MOS value by 0.75 point.

    KW - Array signal processing

    KW - Mobile devices

    KW - Speech enhancement

    KW - Speech recognition

    KW - Wiener tittering

    UR - http://www.scopus.com/inward/record.url?scp=51449115019&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=51449115019&partnerID=8YFLogxK

    U2 - 10.1109/ICASSP.2008.4517609

    DO - 10.1109/ICASSP.2008.4517609

    M3 - Conference contribution

    AN - SCOPUS:51449115019

    SN - 1424414849

    SN - 9781424414840

    SP - 313

    EP - 316

    BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

    ER -