3 EXPERIMENTS
For the system verification, we did not use standard
English invoice sets because our application is
specialized to Turkish banking and invoicing system.
However, we built a representative set of invoices on
which we show our results.
We performed a number of experiments to
validate the proposed system. In all the experiments
we used the same weigh parameters for Equation (1)
(β
=1,
=1). In addition, we kept the training and
the testing invoice sets completely different to avoid
any memorization problems of machine learning
algorithms. The training set includes about 320
invoices and the test set includes 80 invoices. The
invoices in the test set are all from different issuer
companies so that can be considered as 80 distinct
classes. For the OCR engine, we used a commercial
product with the same parameter set.
For the performance metric we use Part Match
Scores (PMS) that returns the amount of matching
between a detected part and the annotated part.
=
∗()
(
)
(
)
(5)
=
∗()
(
)
(
)
(6)
where
is the detected part,
is the annotated part.
IA means Intersection Area between
and
. PMS
g
is used for graphic-based parts (company logo etc.)
and PMS
t
is for text-based parts (receiver etc.). NoW
calculates the Number of Words in the given area.
The genetic based algorithm performs mostly
better than the sequential method. For some of the
invoice parts, it improves the results considerably
(e.g., tax office field), while the results for some fields
are slightly worse (e.g., data label)
Although genetic optimization method is four
times slower than the sequential optimization, it
produces an overall performance around 6% better as
shown in Figure 5. Note that the local optimization
results are mostly worse than the other methods as
expected. Note also that these results are comparable
to state of the of art commercial invoice processing
products that use known invoice classes, which
makes our system very promising.
4 CONCLUSIONS
We presented a novel method for invoice parsing. The
proposed method does not use any invoice classes and
each invoice is considered as a new case. We
employed ideas from Part Based Modeling
approaches of general object detection to handle the
high variations between the invoices. The proposed
method can be extended with new part detectors
conveniently without modifying the main
optimization framework. The experiments performed
on the real invoice data show the applicability of the
method for the real life employment. For the future
work, we plan to use a more sophisticated
optimization methods and augment the text field
detectors with image based features to handle OCR
engine problems.
ACKNOWLEDGEMENTS
This work is supported by TUBITAK TEYDEB
project number 3130882.
REFERENCES
L. Hardy, "The Evolution of Participation Banking in
Turkey." Al Nakhlah Online Journal of Soutwest Asia
and Islamic Civilization (2012).
F. Khan, "How ‘Islamic’is Islamic banking?" Journal of
Economic Behavior & Organization 76.3 (2010),
pp.805-820.
"Payment of Supplier’s Due Amounts." Kuwait Finance
House. Kuwait Finance House. Web. 26 Jan. 2015.
<http://www.kfh.com/en/commercial/murabahaa/pay
ment-of-suppliers-due-amounts.aspx>.
E. Sorio, A. Bartoli, G. Davanzo, & E. Medvet, (2010,
September). Open world classification of printed
invoices. In Proceedings of the 10th ACM symposium
on Document engineering (pp. 187-190). ACM.
H. Hamza, Y. Belaïd, & A. Belaïd, (2007). Case-based
reasoning for invoice analysis and recognition. In Case-
Based Reasoning Research and Development (pp. 404-
418). Springer Berlin Heidelberg.
B. Klein, A. R. Dengel, & A. Fordan, (2004). smartFIX: An
adaptive system for document analysis and
understanding. In Reading and Learning (pp. 166-186).
Springer Berlin Heidelberg.
M. A. Fischler, & R. A. Elschlager, The representation and
matching of pictorial structures. IEEE Transactions on
Computers, 22(1) (1973), 67-92.
P. F. Felzenszwalb, D. P. Huttenlocher. "Pictorial structures
for object recognition." International Journal of
Computer Vision 61.1 (2005): 55-79.
B. Forcher, S. Agne, A. Dengel, M. Gillmann, & T. Roth-
Berghofer, "Towards understandable explanations for
document analysis systems." Document Analysis
Systems (DAS), 2012 10th IAPR International
Workshop on. IEEE, 2012.