Table 1: Performance evaluation on OCR-D historical doc-
uments before and after applying page frame detection
method [training images 10 and test images 114].
Dataset Before After
Train 78.15% 84.84%
Test 78.98% 83.47%
Table 2: Performance evaluation on UW-III contemporary
documents before and after applying page frame detection
method [training images 14 and test images 60].
Dataset Before After
Train 87.79% 94.01%
Test 84.75% 91.44%
images from UW-III dataset as contemporary docu-
ments where we used 14 images for training and 60
images for testing.
There is no existing page frame evaluation method
available in the community. Due to that reason, we
decided to evaluate by considering OCRed text output
performance of the Tesseract OCR system. Tesser-
act Fraktur language model has used to do the OCR
over the OCR-D historical document images. Simi-
larly, English + German language model has used for
UW-III document images. We used the Levenshtein
distance algorithm to check the OCR accuracy of a
generated text file. First of all, we calculate the accu-
racy of the OCRed text output over the original noisy
image and then we again calculate the accuracy over
the same images after removing noise by using our
page frame method.
We also tried to use ABBYY online OCR system
with Fraktur language model but failed to do the OCR
for all the training dataset. Surprisingly, only one
document performed OCR with Fraktur model and
another one document with the combination of En-
glish and German language model over original doc-
ument images. We also used that system after ap-
plying the new page frame method and found that
only two documents generate the OCRed text with
Fraktur and three documents with English and Ger-
man model. The anyOCR system performs well for
Latin typescript documents and it does not contain
the Fraktur language model. So, we could not test
our page frame detection method with this anyOCR
system. Table 1 shows that our page frame method
increases the OCRed text accuracy from 78.98% to
83.47% for historical documents. Similarly, it also
performed well over contemporary document images
by improving the accuracy from 84.75% to 91.44%
(Table 2).
5 CONCLUSION
In this paper, we proposed an advanced page frame
detection method for historical document images.
This mechanism is good enough to remove various
kind of noises from page boundary. For the evaluation
of our implemented page frame detection method, we
used different OCR systems to check the accuracy of
optical character recognition before and after apply-
ing this newly developed page frame approach. We
have used two different datasets to test our mecha-
nism. In both cases, we achieved better accuracy after
applying our page frame method. Moreover, the pro-
posed page frame detection method is able to improve
the OCR accuracy up to 4.49% for the historical and
6.69% for the contemporary document images.
REFERENCES
Bukhari, S. S., Kadi, A., Ayman, J. M., and Dengel, A.
(2017). anyocr: An open-source ocr system for his-
torical archives. In ICDAR.
Bukhari, S. S., Shafait, F., and Breuel, T. M. (2012). Bor-
der noise removal of camera-captured document im-
ages using page frame detection. In Iwamura, M. and
Shafait, F., editors, Camera-Based Document Analysis
and Recognition, pages 126–137, Berlin, Heidelberg.
Springer Berlin Heidelberg.
Canny, J. (1986). A computational approach to edge de-
tection. IEEE Transactions on Pattern Analysis and
Machine Intelligence, 8:679–698.
Cinque, L., Levialdi, S., Lombardi, L., and Tanimoto, S.
(2002). Segmentation of page images having artifacts
of photocopying and scanning. Pattern Recognition,
35(5):1167 – 1177. Handwriting Processing and Ap-
plications.
G. Randall, J. Jakubowicz, R. G. v. G. and Morel, J. (2008).
Lsd: A fast line segment detector with a false detec-
tion control. IEEE Transactions on Pattern Analysis
and Machine Intelligence, 32:722–732.
Gari, A., Khaissidi, G., Mrabti, M., Chenouni, D., and
Yacoubi, M. E. (2017). Skew detection and correc-
tion based on hough transform and harris corners. In
2017 International Conference on Wireless Technolo-
gies, Embedded and Intelligent Systems (WITS), pages
1–4.
Itseez (2014). The OpenCV Reference Manual, 2.4.9.0 edi-
tion.
Itseez (2015). Open source computer vision library.
Phillips, I. T. (1996). UW-III english/technical document
image database manual. User’s Reference Manual for
the UW English/Technical Document Image Database
III.
Reza, M. M., Rakib, M. A., Bukhari, S. S., and Dengel, A.
(2018). A high-performance document image layout
analysis for invoices. In DAS2018.
A Robust Page Frame Detection Method for Complex Historical Document Images
563