Research on Transmission Light and Recognition Algorithms of
Invoice Check Code
Jintao Zhang
1, a
, Jianyi Kong
1, b
, Xingdong Wang
1, c
, Wei Sun
1, d, *
1
Key Laboratory of Metallurgical Equipment and Control, Ministry of Education, Wuhan University of Science and
Technology, Hubei Wuhan 430081, China
Keywords: Invoice recognition; Reverse transmission illumination; double threshold segmentation; machine vision.
Abstract: Invoice checking code is one of the key factors affecting invoice reimbursement, but some invoice checking
codes are covered by red seal, resulting in lack of information. In order to solve the identification problem
of defect invoice check code, in this paper, the color distribution in the check code region and the
interaction between color and light are studied, and the scheme of invoice monochrome light reverse
transmission is formulated. By analyzing the gray histogram of R, G and B three-channel images, a three-
channel weighted graying method for invoice images is proposed. After locating the region by vertical
horizontal projection of the check codes, the binarization of the check codes is realized by double threshold
segmentation, and the single character segmentation is obtained by vertical projection. Finally, character
recognition is carried out by template matching method. The experimental results show that the above
method can eliminate the influence of red seals and improve the accuracy of the identification of check
codes for defective invoices.
1 INTRODUCTION
Invoice Check Code is one of the important bases
for checking the authenticity of invoices. In the
process of invoicing, various uncertainties lead to
various degrees of corruption of invoice check codes
(as shown in Figure 1, the check codes are covered
by red seals). It is impossible for businesses to
distinguish the authenticity of invoices and bring
great difficulties to invoice reimbursement.
Therefore, the research on automatic recognition of
check codes covered by red seals will help to reduce
the investment of manpower and material resources
(Sonka M, Hlavác V, Boyle R, 2014).
In order to solve the problem of character
recognition in complex situations such as character
being covered by seals, some scholars have studied
it at present.To solve the problem of image
decolorization, H. Du et al. (Du H , He S , Sheng B ,
et al, 2015). Proposed a color-gray conversion
method based on regional saliency model. S. Liu
(Liu, SF, 2017) and others proposed to use Gabor
filter and S Obel operator to extract features first,
then K-means algorithm to segment regions, and
compare the characters to be recognized with
standard fonts. Finally, two character recognition
parameters, stroke cross-section and energy density,
were designed to increase recognition. Other
adaptability and robustness. (A Namane et al, 2010).
Proposed the method of using complementary
similarity measure (CSN) as classifier and feature
extractor to recognize degraded characters, which
enhanced the recognition ability of characters with
poor print quality. For non-uniform illumination
image, (Vo G et al, 2016). Proposed a robust matrix
decomposition method to solve the problem of
robust regression for binarization of images in strong
noise inhomogeneous background. This method
automatically segments binary images into
foreground and background regions in the case of
high observation noise level and uneven background
intensity. The experimental results show that the
method is more robust to high image noise and
uneven background. In order to improve the
efficiency of binarization, (Soua M et al, 2014)
proposed a hybrid binarization Keymens method
(HBK) parallel to Nvidia GTX 660 graphics
processing unit (GPU) (Soua et al. at the
International Symposium on Communication,
Control and Signal Processing, 2014). Our
implementation combines fine-grained and coarse-