Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation

Naoto Inuzuka, Tetsuya Suzuki

2021

Abstract

We plan to use a text line segmentation method based on machine learning in our transcription support system for handwritten Japanese historical document in Kana, and are searching for a data synthesis method of annotated document images because it is time consuming to manually annotate a large set of document images for training data for machine learning. In this paper, we report our synthesis method of annotated document images designed for a Japanese historical document. To compare manually annotated Japanese historical document images and annotated document images synthesized by the method as training data for an object detection algorithm YOLOv3, we conducted text line segmentation experiments using the object detection algorithm. The experimental results show that a model trained by the synthetic annotated document images are competitive with that trained by the manually annotated document images from the view point of a metric intersection-over-union.

Download


Paper Citation


in Harvard Style

Inuzuka N. and Suzuki T. (2021). Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation.In Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-486-2, pages 628-634. DOI: 10.5220/0010330206280634


in Bibtex Style

@conference{icpram21,
author={Naoto Inuzuka and Tetsuya Suzuki},
title={Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation},
booktitle={Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2021},
pages={628-634},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010330206280634},
isbn={978-989-758-486-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation
SN - 978-989-758-486-2
AU - Inuzuka N.
AU - Suzuki T.
PY - 2021
SP - 628
EP - 634
DO - 10.5220/0010330206280634