SIMULTANEOUS RECONSTRUCTION AND RECOGNITION OF
NOISY CHARACTER-LIKE SYMBOLS
L´aszl´o Cz´uni
1
,
´
Agnes Lipovits
2
and D´avid Papp
1
1
Dept. of Electrical Engineering and Information Systems, University of Pannonia, Egyetem street 10., Veszpr´em, Hungary
2
Dept. of Mathematics, University of Pannonia, Egyetem street 10., Veszpr´em, Hungary
Keywords:
Image Reconstruction, Markov Random Field, Optical Character Recognition, Radon Transform
Abstract:
In our article we deal with the simultaneous problem of reconstruction and recognition of binary symbols
loaded with heavy additive noise. We introduce a Markov Random Field (MRF) model where a shape energy
term is responsible to find a solution similar to a tested hypothesis. This way we could increase the precision
of the reconstruction process the only question is how to find out the right hypotheses which helps the recon-
struction the best way. Fortunately the new energy term gives us the answer: the tested hypotheses with the
minimal shape energy component designates the right shape.
1 INTRODUCTION
The reconstruction of binary images from noisy ob-
servations is a common problem in image processing.
In several applications, besides observations, we have
some a priori information about the possible shapes
of the typical objects (such as letters or other well de-
fined symbols) to help the reconstruction. Often we
would also like to recognize the observed symbols,
not possible without prior noise filtering.
In our paper we show an MRF based solution for this
twofold problem: the proposed algorithm makes re-
construction suitable for visual purposes or for further
processing, and it also recognizes the symbols. For
a priori information about the shape of objects, their
Radon transform is stored as simple shape descrip-
tor vectors. The Radon transform is fast to compute,
requires only 1D information to store, and tolerates
some distortion of the original shapes.
2 THE MRF MODEL
Markov Random Field (MRF) is a probability model
based on local characteristics (Geman and Graffigne,
1986). In an MRF, the sites are related to one an-
other via a neighborhood system. A generalized def-
inition of Markov Random Field can be given using
graphs. Let G = (S, E) be an undirected graph where
S = s
1
,s
2
,...,s
N
is a finite set of vertices (sites) of
the graph, and E is the set of edges of the graph. By
definition, two sites of the graph, s
i
and s
j
are neigh-
bors if there is an edge connecting them. Given a
site s, the set of points which are neighbors of s (the
neighborhood of s) is denoted by V
s
. By definition,
V = {V
s
|s ∈S} is a neighborhood system for G if
s /∈V
s
and s ∈V
r
⇔ r ∈V
s
(1)
We assign to each site of the graph a label λ from
a finite set of labels Λ. Such an assignment is called
a configuration, denoted by ω. The set of all possi-
ble configurations is denoted by Ω. The configuration
restricted to a subset T ⊂ S is denoted by ω
T
. The
value given to a site s by the configuration ω is rep-
resented by ω
s
. We assign probability measures to
the set Ω of all possible configurations ω. The local
characteristics of a probability measure P defined on
the set Ω of all possible configurations are the con-
ditional probabilities of the form P(ω
s
|ω
S−s
), that is,
the probability that the site s is assigned the label ω
s
,
given the values at all other sites of the graph. By def-
inition, a probability measure χ is a Markov Random
Field with respect to a neighborhood system V if
∀ω ∈Ω : P(χ = ω) > 0 (2)
∀s ∈ S,∀ω ∈ Ω : P(ω
s
|ω
S−s
) = P(ω
s
|ω
V
) (3)
so that the local characteristics of the probability mea-
sure depend only on the knowledge of the labels at the
neighboring sites.
By definition, C ⊂ S is a clique, if every pair of
points in C are neighbors. We can define a potentialV
as a way to assign a numberV
a
(ω) to every subconfig-
uration ω
A
of a configuration ω. Given the potential
197
Czúni L., Lipovits Á. and Papp D..
SIMULTANEOUS RECONSTRUCTION AND RECOGNITION OF NOISY CHARACTER-LIKE SYMBOLS.
DOI: 10.5220/0003861101970200
In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP-2012), pages 197-200
ISBN: 978-989-8565-03-7
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)