Character recognition in Japanese historical documents via adaptive multi-region model

Yueyu Wang, Seiichiro Kamata

研究成果: Conference contribution

抄録

In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.

元の言語English
ホスト出版物のタイトル2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018
出版者Institute of Electrical and Electronics Engineers Inc.
ページ404-409
ページ数6
ISBN(電子版)9781538651612
DOI
出版物ステータスPublished - 2019 2 12
イベントJoint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018 - Kitakyushu, Japan
継続期間: 2018 6 252018 6 28

出版物シリーズ

名前2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018

Conference

ConferenceJoint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018
Japan
Kitakyushu
期間18/6/2518/6/28

Fingerprint

Character recognition
Character Recognition
Model
Data Augmentation
Experiment
Experiments

ASJC Scopus subject areas

  • Signal Processing
  • Control and Optimization
  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Information Systems

これを引用

Wang, Y., & Kamata, S. (2019). Character recognition in Japanese historical documents via adaptive multi-region model. : 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018 (pp. 404-409). [8641033] (2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICIEV.2018.8641033

Character recognition in Japanese historical documents via adaptive multi-region model. / Wang, Yueyu; Kamata, Seiichiro.

2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018. Institute of Electrical and Electronics Engineers Inc., 2019. p. 404-409 8641033 (2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018).

研究成果: Conference contribution

Wang, Y & Kamata, S 2019, Character recognition in Japanese historical documents via adaptive multi-region model. : 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018., 8641033, 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018, Institute of Electrical and Electronics Engineers Inc., pp. 404-409, Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018, Kitakyushu, Japan, 18/6/25. https://doi.org/10.1109/ICIEV.2018.8641033
Wang Y, Kamata S. Character recognition in Japanese historical documents via adaptive multi-region model. : 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018. Institute of Electrical and Electronics Engineers Inc. 2019. p. 404-409. 8641033. (2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018). https://doi.org/10.1109/ICIEV.2018.8641033
Wang, Yueyu ; Kamata, Seiichiro. / Character recognition in Japanese historical documents via adaptive multi-region model. 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 404-409 (2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018).
@inproceedings{8ddb3fbd7ed04059a6a52d801f7f4280,
title = "Character recognition in Japanese historical documents via adaptive multi-region model",
abstract = "In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.",
author = "Yueyu Wang and Seiichiro Kamata",
year = "2019",
month = "2",
day = "12",
doi = "10.1109/ICIEV.2018.8641033",
language = "English",
series = "2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "404--409",
booktitle = "2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018",

}

TY - GEN

T1 - Character recognition in Japanese historical documents via adaptive multi-region model

AU - Wang, Yueyu

AU - Kamata, Seiichiro

PY - 2019/2/12

Y1 - 2019/2/12

N2 - In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.

AB - In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.

UR - http://www.scopus.com/inward/record.url?scp=85063223673&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063223673&partnerID=8YFLogxK

U2 - 10.1109/ICIEV.2018.8641033

DO - 10.1109/ICIEV.2018.8641033

M3 - Conference contribution

T3 - 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018

SP - 404

EP - 409

BT - 2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -