2013) and (Fraundorfer et al., 2010) use extra sensors
to obtain one or more rotational motion information.
The 1-point algorithm in (Scaramuzza, 2011) and 2-
point algorithm in (Ortin et al., 2001) solve the
relative pose with an assumption of only 3DoF planar
motion. Due to the low computational complexity,
many 1-point and 2-point algorithms are especially
applied for the visual odometry of mobile robots
which are equipped with low computing resources.
The relative pose estimation algorithm cannot
obtain the scale of motion because of the inherent
Epipolar geometry of two-view motion. This fact is
considered as the main drawback of the 2D-2D
feature based motion estimation approaches. To
overcome this issue, 3D-2D correspondences are
often employed in some n-point algorithms, which
are known as the perspective from n points (PnP)
algorithms. The PnP algorithm mostly needs less
number of features than the relative n-point
algorithm. In addition, the metric scale of robot
motion can be obtained without using extra motion
sensors. In conventional PnP algorithms, at least three
3D-2D correspondences are needed for the 6DoF
pose estimation. However, the number of
correspondences can be reduced in visual odometry
with a planar motion constraint.
In this paper, we assume a mobile robot is
restricted in 3DoF planar motion. With this
assumption, we introduce a new 2-point PnP
algorithm to reduce computation complexity and fast
visual odometry. Using only two 3D-2D
correspondences obtained from a RGB-D camera, we
derive a linear equation to solve the 3DoF motion
problem. Figure 1 shows a basic geometrical
constraint of the proposed approach.
2 A NEW 2-POINT ALGORITHM
The main goal of this proposed approach is to find the
rigid-body transformation matrix by employing
perspective projection model. Let us assume that a 3D
point
,,
⊺
in a world coordinate system and
its corresponding 2D point
,
⊺
in the camera’s
image plane
are given. Then the matrix can be found
by minimizing (1):
←argmin
∑‖
‖
,
(1)
where represents the number of 3D-2D feature
correspondences.
In 3D Euclidean space, the rigid-body
transformation matrix has 6 degrees of freedom; 3
for rotation and 3 for translation. If the motion of the
camera is restricted in a planar space, then the degrees
of freedom of can be reduced to 3; 1 for rotation
and 2 for translation. When the camera is moving on
the X-Z plane, the perspective projection matrix
can be defined as:
cos
sin 0
sin
cos
sin
cos
sin 0 cos
,
(2)
where
,
⊺
represents the focal lengths of the
camera in x and y directions, and
,
⊺
represents
the principal point (position of the optical center of
the image). Based on the definition of the perspective
projection matrix of planar motion; the relation
between and can be represented with projection
matrix as follows:
≅
cos
sin 0
sin
cos
sin
cos
sin 0 cos
1
.
(3)
In (3),
,,
⊺
is represented in homogeneous
coordinate system and it satisfies
,,1
⊺
≅
/,/,1
⊺
. To calculate the rigid-body
transformation between the two camera coordinate
systems using least square minimization, (3) is
converted to a linear system and the result
is as follows:
→
0
sin
cos
0
,
1,…,
(4)
where represents the number of 3D-2D
correspondences, and represent 24 and
21 matrices respectively. Since has 4
unknowns, at least two 3D-2D corrspondences (
2) are required to calculate .
The rotation of the planar motion is represented
by two variables; sin,cos in (4). The calculated
values for these two variables using (4) generally do
not satisfy the Pythagorean Theorem: sin
cos
1. Consequently the rotation matrix does
not satisfy the orthonormality. Moreover,
sinandcosare different representations of the
single rotational angle . Therefore it is preferred to
reduce the unknowns of by either removing
sinorcos, or combining sinandcos using
the linear equation.
Let us convert in (4) to ′′′ by
using the following trigonometric functions.
sincos
sin
cossin
cos
,
(5)
VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications
644