Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System

Vimal Manohar, Szu Jui Chen, Zhiqi Wang, Y. Fujita, Shinji Watanabe, Sanjeev Khudanpur

研究成果: Conference contribution

9 被引用数 (Scopus)

抄録

This paper summarizes our acoustic modeling efforts in the Johns Hopkins University speech recognition system for the CHiME-5 challenge to recognize highly-overlapped dinner party speech recorded by multiple microphone arrays. We explore data augmentation approaches, neural network architectures, front-end speech dereverberation, beamforming and robust i-vector extraction with comparisons of our in-house implementations and publicly available tools. We finally achieved a word error rate of 69.4% on the development set, which is a 11.7% absolute improvement over the previous baseline of 81.1%, and release this improved baseline with refined techniques/tools as an advanced CHiME-5 recipe.

本文言語English
ホスト出版物のタイトル2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings
出版社Institute of Electrical and Electronics Engineers Inc.
ページ6665-6669
ページ数5
ISBN(電子版)9781479981311
DOI
出版ステータスPublished - 2019 5
外部発表はい
イベント44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Brighton, United Kingdom
継続期間: 2019 5 122019 5 17

出版物シリーズ

名前ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2019-May
ISSN(印刷版)1520-6149

Conference

Conference44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
国/地域United Kingdom
CityBrighton
Period19/5/1219/5/17

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル