3.2 Constraints on Camera Parameters
In this section, we briefly consider the uniqueness of
the minimum of (19). If the point correspondences
{m
i
,m
′
i
} are exact and consistent with the camera
model P , the minimum value of (19) is 0. How-
ever, it is not self-evident whether this minimum value
is attained at finitely many points in the parameter
space. It is clear that the solution is not unique in
the strict sense since there are four possible solutions
for the motion parameters when E is given up to sign
(Hartley and Zisserman, 2003). In addition, it is well
known that for perspective cameras the constraint of
constant internal parameters is not sufficient for self-
calibration in the two-view case (Hartley and Zisser-
man, 2003). Hence, additional constraints are needed
and here we assume that the values of parameters s
and γ in (5) are known. In particular, the values s= 0
and γ= 1 were used in all our experiments since they
are the correct values for most digital cameras which
have zero skew and square pixels.
3.3 Robustness for Outliers
In practice, the tentative point correspondences
{m
i
,m
′
i
} may contain false matches which can easily
deteriorate the calibration. However, in such cases the
algorithm of Section 3.1 can be used together with the
RANSAC algorithm to provide robustness for false
matches (Hartley and Zisserman, 2003). In detail,
given n correspondences in total, one may randomly
select subsets of p correspondences, p ≪ n, and es-
timate the camera parameters for each subset by the
generic algorithm (the step (v) in the algorithm may
be omitted here for efficiency). Thereafter the esti-
mate which has most inliers according to error (18)
is refined using all the inliers. The value p = 15 was
used in our experiments and the RANSAC algorithm
was implemented following the guidelines in (Hartley
and Zisserman, 2003).
3.4 Three Views
The calibration algorithm described in Section 3.1 ex-
tends straightforwardly to the three-view case. Using
correspondences over three views instead of only two
views increases the stability of the self-calibration. In
addition, the constraints for camera parameters, dis-
cussed in Section 3.2, may be relaxed in the three-
view case if necessary.
The details of the three-viewcalibration procedure
are as follows. Given the point correspondences and
an initial guess for the internal camera parameters,
one may estimate the essential matrix for a pair of
views in the same manner as in the two-view case.
However, now there are three different view pairs and
each pair has its own essential matrix. Our aim is
to minimize the total angular error which is obtained
by summing together the cost functions (19) for each
view pair. The minimization is carried out in a sim-
ilar manner as in the two-view case. First, we mini-
mize the total angular error over the internal camera
parameters (we use the eight point algorithm to com-
pute each essential matrix independently of one an-
other). Thereafter we initialize the external camera
parameters using the estimated essential matrices and
minimize the total angular error over all the camera
parameters.
The three-viewapproachdescribed abovedoes not
require that the point correspondences extend over all
the three views. It is sufficient that there is a set of
two-view correspondences for each view pair. How-
ever, in the case of real data which may contain out-
liers it is most straightforward to use three-view cor-
respondences in the RANSAC framework.
4 EXPERIMENTS
4.1 Synthetic Data
In the first experiment we simulated self-calibration
using randomtwo-viewand three-viewconfigurations
with synthetic data. We used a data set consist-
ing of points uniformly distributed into the volume
[−5,5]
3
\[−2,2]
3
defined by the cubes [−5,5]
3
and
[−2,2]
3
, i.e., there were no points inside the smaller
cube where the cameras were positioned. The first
camera was placed at the origin and the second and
third camera were randomly positioned so that their
distances from the origin were between 1 and 2. In the
three-view case it was additionally required that the
distance between the second and third camera was at
least 1. The orientation of the cameras was such that
at least 40% of the points observed by the first cam-
era were within the field of view of the other cameras.
For each such configuration the points were viewed
by five cameras obeying projections (6)-(10) and the
observed image points were perturbed by a Gaussian
noise with a standard deviation of one pixel. The
true values of the camera parameters were f = 800,
u
0
=500, v
0
=500 for all the five cameras. The maxi-
mum value of the view angle θ was 60 degrees for the
perspective camera, 80 degrees for the orthographic
camera and 90 degrees for the others.
We self-calibrated each of the above five cameras
from varying number of point correspondences us-
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
32