Building upon the FineReader engine, recognition improvements were noticeable
with both fuzzy recognizer and dictionary-based post-processing. The former system
achieved a success rate comparable to that of a mature commercial software package
and is open to further enhancement. The output filters developed can increase the
output trustworthiness, especially if the appropriate dictionary resources are available.
Combining these two systems compatibly has not yet been fully accomplished.
Further work can include the development of an automatic parameter adjustment
system based on measurable properties of the documents being processed, the
definition of better word distance metrics, the introduction of more accurate heuristics
and the development of an ancient word dictionary for improved spell checking.
Acknowledgements
This work was partly supported by: the “Programa de Financiamento Plurianual de
Unidades de I&D (POCTI), do Quadro Comunitário de Apoio III”; the FCT project
POSI/SRI/41201/2001; “Programa do FSE-UE, PRODEP III, no âmbito do III
Quadro Comunitário de apoio”; and program FEDER. We also wish to express our
acknowledgments to the Portuguese Bibioteca Nacional, whose continuous support
has made possible this work.
References
1. R. Buse, Z.Q. Liu, J. Bezdek, “Word Recognition Using Fuzzy Logic”, in IEEE Transactions
on Fuzzy Systems, vol. 10, no. 1, Fev. 2001, pp. 65-76
2. João M.C. Sousa and Uzay Kaymak, "Fuzzy Decision Making in Modeling and Control",
World Scientific, Singapore and UK, Dec. 2002
3. R. Buse, Z.Q. Liu, T. Caelli, “A structural and relational approach to handwritten word
recognition”, in IEEE Trans. Syst., Man, Cybern., vol. 27, no. 25, Oct. 1997, pp 847-861
4. Parker, J.R., “Algorithms for Image Processing and Computer Vision”, John Wiley & Sons,
New York, USA, 1998
5. N. Otsu, "A threshold selection method from gray level histograms", IEEE Transactions on
Systems, Man, and Cybernetics, vol. 9, pp. 62-66, 1979
6. Godfried Toussaint, “Solving Geometric Problems with the Rotating Calipers”, in Proc.
IEEE MELECON'83, 1983, pp. A10.02/1-4
7. James D. Foley et al, “Computer Graphics – Principles and Practice”, Second Edition in C,
Addison-Wesley, Reading, Massachussets, USA, 1990
8. C. L. Hwang, K. Yoon, “Multiple Attribute Decision Making, Methods and Applications, A
State-of-the-Art-Survey”, Springer-Verlag, Berlin, Germany, 1981
9. K. Atkinson, GNU Aspell Homepage, http://aspell.net, GNU Project
10. V. I. Levenshtein, "Binary codes capable of correcting deletions, insertions and reversals",
Soviet Physics Doklady, vol. 6, pp. 707-710, 1966
11. Álvaro Ferreira de Véra, “Orthographia ou modo para escrever certo na lingua Portuguesa”,
17th century, available at Biblioteca Nacional
12. ABBYY FineReader Homepage, http://www.abbyy.com, ABBYY Software House
107