sequence (simply put: one-word commands) belongs
to a particular word. For each word in the lexicon, an
HMM is (i.e. for each word to be recognized) created.
Once an input value (word) is present (observation),
this is quasi compared with each HMM. The best
result represents the recognized word. Verification
with HMM is a statistical method. Determining the
degree of match between the test sample and the
reference as a probability.
2-piece stochastic process:
First Part: State machine: Describes probabilities of
the transitions of the states.
Second Part: modeling the output of the font pattern
when entering the respective state output probability
distributions.
Allocation of the state to the magazine segment.
Feature of the chain is determined by the
characteristics of the segments.
Training: Baum-Welch algorithm (optimization of
HMM)
Improvement of a given model in response to certain
sample data (training pattern) in such a way that the
optimized model generates the training set with equal
or greater probability.
Decoding: Viterbi algorithm
Calculation of the optimal path through the state
sequence (Viterbi path). So the maximum probability
of generating the observation sequence.
Is used to "discharging" of the hidden state
sequence, that generates a maximum likelihood, a
valid sequence of outputs, is given by the model.
3 RESULTS
Description of the database:
To make recognition systems comparable, this system
must come on a standardized database to the
application. In the case of the Old German
handwriting ( Kurrent ) this is not possible, since such
a database does not exist. It had to be redesigned a
database. This database consists of approximately
6000 samples. This is made up of collections of 6
writers, here also some of Alexander von Humboldt.
Characteristic lines and characteristic features of the
writing thickness allow conclusions on the writer.
First, the test was carried out with 110 test images.
The recognition rate was 0.4234. In order to present
an apparent delta, more 440 test images were
deducted from the training data.
Then the model 11 and the model features 9
characteristics was examined. The window setting
was w8X4 .
The term dimension in this context means: features or
characteristics.
Table 1: Without correction (Slope, Slant, Scale).
In the model with two additional features (12
dimensions), the difference is clear. At 550 test
images, the recognition rate was increased by
approximately 0.1!
Table 2: Without correction (Slant).
Testimages 10-Dimensions 12-Dimensions
110 0.5834 0.6038
550 0.5195 0.6284
Table 3: With correction (Slope, Slant, Scale).
Testimages 10-Dimensions 12-Dimensions
110 0.5636 0.554545
550 0.549091 0.516364
1100 0.534545 0.495455
4 CONCLUSIONS
Other Windows settings are investigated.
(e.g. w14X13).
In order to improve the recognition result, the number
of samples must be increased noticeably. In this
publication should be shown merely, that through an
extension of the feature extractions, here specific for
Old German handwriting (Kurrent), an improvement
of the recognition is reached.
REFERENCES
Fink, G. A., 2008. Markov Models for Pattern Recognition,
From Theory to Applications. Springer, Heidelberg.
Wienecke, M., Fink, G. A., and Sagerer, G., 2005. Toward
Automatic Video-based Whiteboard Reading. Int.
Journal on Document Analysis and Recognition, vol.
Testimages 10-Dimensions 12-Dimensions
110 0.4234 0.4545
550 0.327 0.425
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods