The 2020 ESPnet update: New features, broadened applications, performance improvements, and future plans

Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang

研究成果: Conference contribution

抄録

This paper describes the recent development of ESPnet (https://github.com/espnet/espnet), an end-to-end speech processing toolkit. This project was initiated in December 2017 to mainly deal with end-to-end speech recognition experiments based on sequence-to-sequence modeling. The project has grown rapidly and now covers a wide range of speech processing applications. Now ESPnet also includes text to speech (TTS), voice conversation (VC), speech translation (ST), and speech enhancement (SE) with support for beamforming, speech separation, denoising, and dereverberation. All applications are trained in an end-to-end manner, thanks to the generic sequence to sequence modeling properties, and they can be further integrated and jointly optimized. Also, ESPnet provides reproducible all-in-one recipes for these applications with state-of-the-art performance in various benchmarks by incorporating transformer, advanced data augmentation, and conformer. This project aims to provide up-to-date speech processing experience to the community so that researchers in academia and various industry scales can develop their technologies collaboratively.

本文言語English
ホスト出版物のタイトル2021 IEEE Data Science and Learning Workshop, DSLW 2021
出版社Institute of Electrical and Electronics Engineers Inc.
ISBN(電子版)9781665428255
DOI
出版ステータスPublished - 2021 6 5
外部発表はい
イベント2021 IEEE Data Science and Learning Workshop, DSLW 2021 - Toronto, Canada
継続期間: 2021 6 52021 6 6

出版物シリーズ

名前2021 IEEE Data Science and Learning Workshop, DSLW 2021

Conference

Conference2021 IEEE Data Science and Learning Workshop, DSLW 2021
国/地域Canada
CityToronto
Period21/6/521/6/6

ASJC Scopus subject areas

  • 人工知能
  • 情報システム
  • 教育

フィンガープリント

「The 2020 ESPnet update: New features, broadened applications, performance improvements, and future plans」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル