ric model. We present an automatic hybrid matching
approach with uncalibrated images using the hybrid
epipolar geometry to establish a relation between om-
nidirectional and perspective images.
2 HYBRID IMAGE MATCHING
USING EPIPOLAR GEOMETRY
To deal with the problem of the robust hybrid match-
ing a common strategy is to establish a geometrical re-
lation between the views of the 3D scene. We have se-
lected a strategy that does not require any information
about the mirror. A geometrical approach which en-
capsulates the projective geometry between two views
is used. Epipolar geometry (EG) is the intrinsic pro-
jective geometry between two views. It is indepen-
dent of the scene structure, and only depends on the
cameras’ internal parameters and relative pose (Hart-
ley and Zisserman, 2000). This approach needs pairs
of putative corresponding points between the views.
In this work we use the SIFT descriptor (Lowe, 2004).
Sturm proposes a hybrid epipolar geometry, where
a point in the perspective image is mapped to its corre-
sponding epipolar conic in the omnidirectional image.
Recently Barreto and Daniliidis (Barreto and Dani-
ilidis, 2006) have exposed a more general scheme
where they compare the mixture of central cameras,
including pin-hole, hyperbolic and parabolic mirrors
in catadioptric systems and perspective cameras with
radial distortion.
The fundamental matrix F encapsulates the epipo-
lar geometry. The dimension of this matrix depends
on the image types we want to match. In the hybrid
case we have two options, a 4 × 3 matrix in the case of
para-catadioptric and perspective cameras or 6 × 3 in
the case of hyperbolic mirror and perspective cameras
in a catadioptric system. In (Barreto and Daniilidis,
2006) for this last case a 6 × 6 matrix is considered,
which can result in a very difficult corresponding es-
timation problem.
2.1 EG with Perspective and
Catadioptric Cameras
In general the relation between omnidirectional and
perspective images with the fundamental matrix can
be established by
ˆ
q
T
c
F
cp
q
p
= 0 (1)
subscripts p and c denote perspective and catadioptric
respectively.
From Eq.1 with known corresponding points bet-
ween the two images, we can derive the hybrid fun-
damental matrix. Points in the perspective image are
defined in common homogeneous coordinates. Points
in the omnidirectional image are defined depending
on the shape of the epipolar conic. The general repre-
sentation for any shape of epipolar conic is a 6-vector.
A special case where the shape of the conic is a circle
the coordinate vector has four elements. These repre-
sentations are called the “lifted coordinates” of a point
in the omnidirectional image.
In the hybrid epipolar geometry points in the
perspective image are mapped to its corresponding
epipolar conic in the omnidirectional image. Conics
can be represented in homogeneous coordinates as the
product
ˆ
q
T
c = 0, where
ˆ
q
T
represents the lifted coor-
dinates of the omnifirectional point q. In this work
we have two representations for this point, one of
them is the general homogeneus form of a conic, a
6-vector
ˆ
q = (q
2
1
, q
2
2
, q
2
3
, q
1
q
2
, q
1
q
3
, q
2
q
3
)
T
. The other
one constraints the shape of the conic to be a circle
ˆ
q = (q
2
1
+ q
2
2
, q
1
q
3
, q
2
q
3
, q
2
3
)
T
. These representations
are called the “lifted coordinates” of the omnidirec-
tional point q.
If the point in the omnidirectional image is repre-
sented with a 6-vector lifted coordinates
ˆ
q, the fun-
damental matrix is 6 × 3 (F63) in such a way that
c ∼ F
cp
q
p
. When the 4-vector lifted coordinates is
used the fundamental matrix is 4 × 3 and the conic
(circle) is obtained by the same product c ∼ F
cp
q
p
.
2.2 Computation of the Hybrid
Fundamental Matrix
The algorithm used to compute the fundamental ma-
trix is similar to the 8-point algorithm (Hartley and
Zisserman, 2000) for the purely perspective case, with
the difference that the points in the omnidirectional
images are given in lifted coordinates.
The automatic computation of the fundamental
matrix is summarized as follows:
1. Initial Matching. Scale invariant features are ex-
tracted in each image and matched based on their
intensity neighborhood.
2. RANSAC Robust Estimation. Repeat for n sam-
ples, where n is determined adaptively:
(a) Select a random sample of k corresponding
points, where k depends on what model we are
using (if F43, k = 11 or if F63, k = 17)
1
. Com-
pute the hybrid fundamental matrix F
cp
as de-
scribed above.
1
Matrices are up to scale, so we need the number of ele-
ments of the matrix minus one corresponding points.
ICINCO 2008 - International Conference on Informatics in Control, Automation and Robotics
126