by image registration, where an image represents the
target and another image is the search space. This
problem has been studied at different ways, but it is
difficult to find a robust and accuracy scheme and it
depends of the target representation and the condi-
tions to localize it. Image registration is one of the
fundamental tasks in vision systems. However, this
is not an easy topic because there are several factors
that affect performance of the vision systems as sen-
sor noise, different views of the observer, motion per-
turbations, changes on objects state by motion and at-
mospheric and illumination conditions.
Image registration can be defined as mapping
between images, considering spatial and intensity
differences. Let be I
1
and I
2
images related by
I
2
(x, y)=g(I
1
(f(x, y))) (1)
where f is a spatial transformation between two coor-
dinates and g is a radiometric transformation. Image
registration is defined as estimation of geometric and
radiometric transformations, such that two images
could be compared by detections of coincidences. It
is important to realize that if the number of parame-
ters that define the relation between two images of
the same object is increased, then the complexity of
searching is increased too.
Image registration depends of three elements
(Brown, 1992; Zitova, 2003):
• Characteristic space. In image registration, it is
important to determinate which set of characteris-
tics defines the best representation of image. Selec-
tion is affected by different factors and conditions,
such as quantity of obtained information, sensiti-
vity to properties of sensor and scene and compu-
tational cost. Sometimes, a characteristic space is
created with intensity levels at pixels or a transfor-
mation on them, as FFT. Another schemes are de-
fined by structural features (borders, contorns, in-
terest points, centroids) or texture properties (con-
trast, homogeneity, correlation).
• Similitude measure. This measure identifies the
compatibility degree between two images. Simi-
litude metric is used for finding the required para-
meters in a mapping between related images. Some
of the most used similitude measures are cross-
correlation, sum of absolute difference, sum of
square difference and phase correlation. Further-
more, it is posible to use methods more complex as
bayesian detectors and neural networks. According
with application, if characteristics space and simi-
litude metric are correctly selected, then it is pos-
sible to ignore some non-relevant distortions for a
correct matching.
• Search strategy. In case where only displacement is
required, it is sufficient with a sequential search to
determine mapping. However, if mapping requires
more parameters, searching must be more com-
plex. Some techniques used are hierarchy search,
relax labeling, dynamic programming and heuris-
tic search. The number of parameters that defines
the mapping and computational cost are the most
important factors to determine the search strategy.
2.2 Visual Tracking of Objects
Visualtracking is used in a wide range of applications,
but there is not an algorithm to be used in whatever
conditions. In general, tracking methods can be sepa-
red in two groups: tracking based on motion detection
and tracking based on models.
Tracking based on motion detection uses detection
algorithms as optical flow, gaussian mixture or image
difference. This approach has a good performance
and it is posible to work on no-rigid objects. However,
these schemes do not use a target model, they are sen-
sible to false detections and tend to loss tracking on
target, if displacements are very small.
Tracking based on models uses image registration
for detection because it is defined a target model.
These techniques are more robust and it is possible
to use image analysis more complex to obtain mea-
surements. This approach has major computational
cost, so deformation of objects must be considered.
However, information about behavior could help to
enhance efficacy (Hong, 2002).
In (Comaniciu, 2003), an object model is crea-
ted with a probability distribution function pdf from
histogram. The target position is defined by Bhat-
tacharyya coefficient as similitude metric. The search
strategy begins on previous position of target and it is
guided by a derivative kernel. This approach is fast,
an exhaustive search is not required and subpixel mea-
surement is obtained. But, size changes are not sup-
ported.
In (Son, 2002), a correlation window is adapted to
size changes of targets. Using temporal and spatial
gradient between two consecutive images, the occu-
pation ratio in window is obtained. If ratio is less then
window is decreased one pixel, otherwise the window
is increased. In (Chien, 2000), correlation window is
adapted by modeling with motion vectors. Direction
and magnitude of contraction or expansion depend on
simple image processing. This approach is useful if
displacement are small.
Probabilistic approaches have been suitable solu-
tions to above presented problems, such as adapta-
bility in tracking based on target behavior. In (Ras-
mussen, 2001), it is proposed to take advantage of
a set of ramdom samples around prediction of target
geometric parameters and to use a pdf as similitude
measure. The evaluation of the samples set defines the
measurement process, where samples with low prob-
SUBPIXEL VISUAL TRACKING BASED ON ADAPTIVE STRATEGIES
287