Using ASR methods for OCR

Ashish Arora, Paola Garcia, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun Chieh Chang, Babak Rekabdar, Bagher Babaali, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal

研究成果: Conference contribution

8 被引用数 (Scopus)

抄録

Hybrid deep neural network hidden Markov models (DNN-HMM) have achieved impressive results on large vocabulary continuous speech recognition (LVCSR) tasks. However, the recent approaches using DNN-HMM models are not explored much for text recognition. Inspired by the current work in automatic speech recognition (ASR) and machine translation, we present an open vocabulary sub-word text recognition system. The sub-word lexicon and sub-word language model (LM) helps in overcoming the challenge of recognizing out of vocabulary (OOV) words, and a time delay neural network (TDNN) and convolution neural network (CNN) based DNN-HMM optical model (OM) efficiently models the sequence dependency in the line image. We present results on 12 datasets with training data varying from 6k lines to 600k lines. The system is built for 8 languages, i.e., English, French, Arabic, Chinese, Farsi, Tamil, Russian, and Korean. We report competitive results on several commonly used handwritten and printed text datasets.

本文言語English
ホスト出版物のタイトルProceedings - 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
出版社IEEE Computer Society
ページ663-668
ページ数6
ISBN(電子版)9781728128610
DOI
出版ステータスPublished - 2019 9
外部発表はい
イベント15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019 - Sydney, Australia
継続期間: 2019 9 202019 9 25

出版物シリーズ

名前Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN(印刷版)1520-5363

Conference

Conference15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
国/地域Australia
CitySydney
Period19/9/2019/9/25

ASJC Scopus subject areas

  • コンピュータ ビジョンおよびパターン認識

フィンガープリント

「Using ASR methods for OCR」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル