5.2 Evaluation on the Database
We have tested the proposed system on 52 color
images of various invoices and forms in real
situation. Among 52 images we manually found 7
images with some problems when added text crosses
the line of a pre-printed table. Some parts of
characters can be confused with text line at the patch
level. In this case these parts may remain in the
template image. This is the main drawback of the
method. Most errors are justified by the quality of
the document itself where sometimes it can be bad.
We have achieved 86.7 % of correctly aligned
document images.
6 CONCLUSION AND
PERSPECTIVES
We have proposed the first pixel-based image
registration suited for color document images
approach. It is based on the NLM inter-images.
Our proposed approach can align non rigid images
with high accuracy and tolerate spatial distortions.
We have applied the proposed algorithm to the
registration of digitized business documents to their
template. We use this image alignment to separate
the pre-printed forms from the added text. The input
images must have a reduced skew and a small offset
in order to be perfectly aligned at pixel level,
otherwise, we have to increase the radius of the
search windows.
ACKNOWLEDGEMENT
This work is granted by ITESOFT for the project DOD.
REFERENCES
Y.Y. Tang, C.Y. Suen, C.D. Yan, and M. Cheriet,
“Financial Document Processing Based on Staff Line
and Description Language,” IEEE Trans. Systems,
Man, and Cybernetics, vol. 25, no. 5, pp. 738-753,
1995.
R. Casey, D. Ferguson, K. Mohiuddin, and E. Walach,
“Intelligent Forms Processing System,” Machine
Vision and Applications, vol. 5, no. 5, pp. 143-155,
1992.
S.L. Taylor, R. Fritzson, and J.A. Pastor, “Extraction of
Data From Preprinted Forms,” Machine Vision and
Applications, vol. 5, no. 5, pp. 211-222, 1992.
T.M. Ha and H. Bunke, “Model-Based Analysis and
Understanding of Check Forms,” Int’l J. Pattern
Recognition and Artificial Intelligence, vol. 8, no. 5,
pp. 1,053-1,081, 1994.
H. Peng, F. Long etc. "Document Image Recognition
Based on Template Matching of Component Block
Projections", IEEE Trans. on Pattern Analysis and
Machine Intelligence, 2003, 25(9):1188-1192.
Z. Yang, S. Cohen. ''Image registration and object
recognition using affine invariants and convex hulls'',
IEEE Trans Image Process. 1999;8(7):934-46. doi:
10.1109/83.772236.
D. H. Ballard, "Generalizing the Hough Transform to
Detect Arbitrary Shapes," Pattern Recognition, 13, No.
2, 1981, pp111-122.
Aitken CL, et al. "Tumor localization and image
registration of F-18 FDG coincidence detection scans
with computed tomographic scans". Clin Nucl Med.
2002 Apr; 27(4):275-82.
A. Rastogi and S. N. Srihari, "Recognizing textual blocks
in document images using the Hough transform," TR
86-01, Dept. of Computer Science, SUNY Buffalo,
NY, Jan 1986.
Garris, M.D., Grother, P.J, ''Generalized form registration
using structure-based techniques'', Proceedings of the
Fifth Annual Symposium on Document Analysis and
Information Retrieval, pp.321–334 (1996).
S. C. Hinds, J. L. Fisher and D. P. D'Amato, "A Document
Skew Detection Method Using Run-Length Encoding
and the Hough Transform," 10th International
Conference on Pattern Recognition, vol. 1, pp.464-
468, 1990.
F. Cesarini, M. Gori, S. Marinai, and G. Soda,
“INFORMys: A Flexible Invoice-Like Form-Reader
System,” IEEE Trans. Pattern Analysis and Machine
Intelligence, vol. 20, no. 7, pp. 710-745, July 1998.
D.P. Lopresti, “String Techniques for Detecting
Duplicates in Document Databases,” Int’l J. Document
Analysis and Recognition, vol. 2, no. 4, pp. 186-199,
2000.
L., Tseng, and R., Chen, "The recognition of form
documents based on three types of line segments,"
Proc of 4th Int Conf on Document Analysis and
Recognition, 1, pp.71-75, 1997.
K., Fan, and M., Chang, "Form document identification
using line structure based features," Proc of 14th Int
Conf on Pattern Recognition, 2, pp.1098-1100, 1998.
T., Watanabe, and X., Huang, "Automatic acquisition of
layout knowledge for understanding business cards,"
Proc of 4th Int Conf on Document Analysis and
Recognition, 1, pp.216-220, 1997.
R. Safari, N.N.et al, "Document registration using
projective geometry," IEEE Trans on Image
Processing, 6(9), pp.1337-1341, 1997.
H. Peng, F. Long, Z. Chi, D. Feng, and W. Siu,
“Document Image Matching Based on Component
Blocks,” Proc. Int’l Conf. Image Processing, pp. 601-
604, Sept. 2000.
J. Hu, R. Kashi, and G.Wilfong, “Document Image
Layout Comparison and Classification,” Proc. Sixth
VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications
172