4 Concluding Remarks and Future Work
Bernoulli HMMs have been proposed for off-line handwriting recognition in order to
directly model text image data in binary form. Empirical results have reported on two
tasks of off-line handwritten word recognition: Arabic subwords from CENPARMI cor-
pus, and English words from IAM database. In both cases each word (subword) has
been modelled with one HMM, and only the required preprocess to obtain binary im-
ages of same height, has been done. Feature vectors of different sizes, as well as HMMs
with different number of states have been tested. The results on the Arabic subwords
task are promising. In the case of the IAM words, the results were very similar to those
obtained using HMMs with one Gaussian per state.
Ongoing work is focused on the use of Bernoulli HMMs at subword (character)
level and extend them by using Bernoulli mixtures instead of single Bernoulli probabil-
ity functions in each state. A first step is to study the optimal number of states, training
iterations and Bernoulli components, as was done in [3] for the case of Gaussian com-
ponents. Then, we also plan to include explicit modelling of invariances in Bernoulli
components. In addition we plan to compare the results with other recognisers, mainly
with Gaussian HMMs recognisers.
References
1. Plamondon, R., Srihari, S.N.: On-Line and Off-Line Handwriting Recognition: A Compre-
hensive Survey. IEEE Trans. on PAMI 22 (2000) 63–84
2. Toselli, A.H., Juan, A., Keysers, D., Gonz´alez, J., Salvador, I., H. Ney, Vidal, E., Casacu-
berta, F.: Integrated Handwriting Recognition and Interpretation using Finite-State Models.
International Journal of Pattern Recognition and Artificial Intelligence 18 (2004) 519–539
3. G¨unter, S., Bunke, H.: HMM-based handwritten word recognition: on the optimization of
the number of states, training iterations and Gaussian components. Pattern Recognition 37
(2004) 2069–2079
4. Xue, H., Govindaraju, V.: Hidden Markov Models Combining Discrete Symbols and Con-
tinuous Attributes in Handwriting Recognition. IEEE Trans. on PAMI 28 (2006) 458–462
5. Rabiner, L., Juang, B.H.: Fundamentals of speech recognition. Prentice-Hall (1993)
6. Young, S., et al.: The HTK Book. Cambridge University Engineering Department (1995)
7. Al-Ohali, Y., Cheriet, M., Suen, C.: Databases for recognition of handwritten Arabic
cheques. Pattern Recognition 36 (2004) 111–121
8. Marti, U., Bunke, H.: The IAM-database: an English sentence database for off-line hand-
writing recognition. Int. J. Doc. Anal. Recogn. (2002) 39–46
9. Zimmermann, M., Bunke, H.: Automatic segmentation of the IAM off-line database for
handwritten English text. In: Proc. of the ICPR 2002. Volume 4., Quebec (Canada) (2002)
35–39
10. Johansson, S., Leech, G., Goodluck, H.: Manual of information to accompany the Lancaster-
Oslo/Bergen Corpus of British English, for use with digital Computers. Department of En-
glish, University of Oslo, Norway. (1978)
11. Romero, V., Gim´enez, A., Juan, A.: Explicit Modelling of Invariances in Bernoulli Mixtures
for Binary Images. In: Proc. of the 3rd Iberian Conference on Pattern Recognition and Im-
age Analysis (IbPRIA 2007). Volume 4477 of Lecture Notes in Computer Science., Girona
(Spain), Springer-Verlag (2007) 539–546