three elements (n = 3): column, row (pixel coordi-
nates) and gray level of reference image at these co-
ordinates. The gray level of the reference image has
been selected as an element of the observation vec-
tor since it is the observation that we want to match
with the given gray level in the test image using the
BCA. The spatial coordinates have also been selected
as part of the observations, since inaccuracy in their
measurement can happen, because of the image ac-
quisition process.
In order to calculate the matrices A
i
, B
i
and E
i
(see
equation 3), the partial derivatives of the function F
i
with respect to the parameters and with respect to ob-
servations must be worked out.
For instance, using affine motion, the terms B
i
, A
i
and E
i
are expressed as follows:
B
i
=
I
1
x
− a
1
I
2
x
− a
2
I
2
y
, I
1
y
− b
1
I
2
x
− b
2
I
2
y
, 1.0
(1×3)
A
i
=
−x
i
I
2
x
, −y
i
I
2
x
, −I
2
x
, −x
i
I
2
y
, −y
i
I
2
y
, −I
2
y
(1×6)
E
i
= −
I
1
(x
i
, y
i
) − I
2
(x
0
i
, y
0
i
)
(1×1)
(5)
where I
1
x
, I
1
y
, I
2
x
and I
2
y
have been introduced to sim-
plify notation as: I
1
x
= I
1
x
(x
i
, y
i
), I
1
y
= I
1
y
(x
i
, y
i
), I
2
x
=
I
2
x
(x
0
i
, y
0
i
) and I
2
y
= I
2
y
(x
0
i
, y
0
i
), being I
1
x
(x
i
, y
i
), I
1
y
(x
i
, y
i
),
the gradients of the reference image and I
2
x
(x
0
i
, y
0
i
) and
I
2
y
(x
0
i
, y
0
i
) the gradients of the test image.
3 ALGORITHM PROPOSED
Our proposed algorithm can be summarized in these
four sequential steps:
1. Detection and description of interest points:
The SIFT technique is applied for detecting and
performing the description of the points of inter-
est in both images.
2. Matching of interest points: For each interest
point belonging to the first image a K-NN search
strategy is performed to find the k-closest inter-
est points at the second image. At the end of this
process, a set of point pairs is obtained.
3. Estimate first approximation using random
sampling: For estimating the first approximation
of the motion parameters a random sampling tech-
niques is used to determine a good initial solution.
4. Final motion estimation using GLS: The GLS
motion estimator is applied using as observations
all the pixels into the overlapped area in order to
move to more accurate solution. The process is
finished when ∆χ is close to 0, which is usually
fulfilled in a few iterations.
4 EXPERIMENTS
In order to test our approach in Image Registration
problems, a set of challenging sets of image pairs
have been selected. They can be downloaded from
Oxford’s Visual Geometry Group web page
2
. They
present five types of changes between images in 8
different sets of images: Blur: bikes and tree sets,
illumination: leuven set, jpg compresion: ubc set,
zoom+rotation: bark and boat sets, and viewpoint:
graf and wall sets. To check the accuracy of the regis-
tration, the normalized correlation coefficient (NCC)
similarity measure has been calculated using the pix-
els of the overlapped area of both images. The NCC
gives values from −1.0 (low similarity) to 1.0 (high
similarity). The NCC is expressed as follows, with
µ
1
,µ
2
being the average of the gray level of both im-
ages and ℜ the overlapped area:
NCC(I
1
, I
2
) =
∑
(x
i
,y
i
)∈ℜ
[(I
1
− µ
1
)(I
2
− µ
2
)]
q
∑
(x
i
,y
i
)∈ℜ
(I
1
− µ
1
)
2
∑
(x
i
,y
i
)∈ℜ
(I
2
− µ
2
)
2
(6)
I
1
and I
2
have been introduced to simplify notation as:
I
1
= I
1
(x
i
, y
i
), I
2
= I
2
(x
0
i
, y
0
i
)
We have focused on showing the results when
solving challenging situations like the zoom+rotation
and viewpoint sets of pair of images, particularly on
boat, bark, graf and wall sets. The affine motion
model has been used for images from sets bark and
boat since there is not a viewpoint change. The main
difficulty of this set is the presence of large rotations
and changes of scale. The presence of moderate and
large viewpoint changes forces to use the projective
motion model instead of the affine one for images
from graf and wall sets.
Table 1 shows the average NCC calculated for the
experiments performed with the images belonging to
each set, after initial estimation and after final GLS
estimation. In general, the feature-based technique
provides a good but not excellent (in terms of ac-
curacy) initial estimation of the motion parameters,
which are accurately improved after the GLS estima-
tion.
Figure 1 and 2 shows the results of the registration
process obtained for some of the most difficult pairs
from the four studied sets. The discontinuous white
line mark the boundary of the reference image (i.e the
first image of the pair).
In general the proposed method is able to regis-
ter all the images from bark and boat sets, but suffers
2
http://www.robots.ox.ac.uk/ vgg/research/affine/
index.html
VISAPP 2007 - International Conference on Computer Vision Theory and Applications
388