well as the figures of merit we are going to use in
order to fairly compare the competing algorithms.
4.1 Experimental Setup
The experimental setup is described analytically in
(Baker and Matthews, 2004). In brief, we have an
input image I
0
and we crop a rectangular area of the
image. By adding an appropriate translation in the
coordinates of the points corresponding to the corners
of the cropped image and adding Gaussian noise with
standard deviation σ
p
we perturb them. The four ini-
tial points, and their warped versions defines the pa-
rameter vector of the projective transformation. U-
sing these values, we map all target points and warp
I
0
to create a reference image I
r
. The competing algo-
rithms then are applied for the alignment of I
r
with I
0
.
In order to create a reference image I
r
in the second
set of our experiments, we follow a similar procedure.
Note though that in this case we select three points in-
stead of four, from the rectangular cropped area (i.e.
top left and right corner and bottom middle point) and
use them in order to define the six parameters of the
affine transform.
In order to measure the quality of the estimated
parameters we use the Mean Square Distance (MSD)
between the exact warped version of the four (three)
points and their estimated counterparts. More for-
mally, if we denote as p
r
the parameter vector which
we use in defining the reference profile I
r
, p
j
the cur-
rent estimation of the corresponding algorithm at j-th
iteration and C the set of the four(three) above me-
ntioned points, we use the mean of the following se-
quence
e( j) =
1
8(6)
∑
x∈C
||T(x;p
r
) − T(x;p
j
)||
2
. (16)
Each element of the mean sequence (i.e. for a spe-
cific value of iteration index j) is obtained by aver-
aging over a large number of image pairs that dif-
fer in the noise realization, and captures the learn-
ing ability of the algorithms (average rate of con-
vergence (Baker and Matthews, 2004)). However,
in order to not present biased results, we compute
the above mentioned mean sequence for those rea-
lizations where both algorithms have converged. The
convergence criterion is that the square distance e( j)
at a prescribed maximal iteration j
max
is below a ce-
rtain threshold T
MSD
, that is e( j
max
) ≤ T
MSD
.
As a second figure of merit we use the percentage
of converging (PoC) runs (frequency of convergence
(Baker and Matthews, 2004)). This quantity is the
percentage of runs that converge up to maximal iter-
ation j
max
, based again on the above mentioned con-
vergence criterion. PoC is depicted as a function of
the point deviation σ
p
, the most important factor that
affects the performance of both algorithms.
Since it is natural to prefer an algorithm that con-
verges quickly with high probability, we propose a
third figure of merit that captures exactly this point
(Evangelidis and Psarakis, 2007). In other words we
propose the generation of a histogram depicting the
probability of successful convergence at each itera-
tion. Specifically a run of an algorithm on an im-
age pair realization will be considered as having con-
verged at iteration n when the squared error e( j) goes
below the threshold T
MSD
for the first time at iteration
j = n. It is clear that we prefer a histogram to be con-
centrated over mostly small iteration-numbers.
In all experiments that follow we use T
MSD
=
1 pixel
2
.
4.2 Minimal Case
In this subsection we present the results we obtained
from the first set of experiments we have conducted.
As it is above described, in this case we create the
reference profile by using a projective transform and
we model the warping process by using a transforma-
tion of the same class.
4.2.1 Experiment I
In the first experiment, the alignment algorithms try
to compensate only the geometric distortions since
this is the only that has been applied to images.
Specifically, we use the “Takeo” image (Baker and
Matthews, 2004) as input image and we create 500
different reference profiles for each integer values of
σ
p
in the range [1, 10]. For each one of the 500 rea-
lizations, we permit the algorithms to make 15 itera-
tions (j
max
= 15). Since no intensity noise or photo-
metric distortion is applied to image, we expect MSD
to reach very low levels which cannot be zero due to
finite precision arithmetic.
Figure 1 depicts the relative performance of the
two algorithms. As we mentioned above, we present
the arithmetic mean of the sequence e( j) for those
realizations where both algorithms have converged.
Three cases are investigated; (a) σ
p
= 2, (b) σ
p
= 6
and (c) σ
p
= 10. In all these cases our algorithm ex-
hibits a significantly smaller MSD which is order(s)
of magnitude better than the one obtained by the LK
scheme. Furthermore concerning the PoC, as we can
see from Figure 1.(d), our algorithm exhibits better
performance for all values of σ
p
. Specifically for
strong deformations (σ
p
= 10) the improvement can
become quite significant (18%). As far as the proba-
bility of successful convergence is concerned, we ap-
plied the algorithms for a maximal number of 100 it-
erations (j
max
= 100). In Figure 2 the resulting graphs
PROJECTIVE IMAGE ALIGNMENT BY USING PROJECTIVE IMAGE ALIGNMENT
417