model is determined with RANSAC and used in
obstacle detection. They use an offline learning
method for finding the paths and the extended road
surface.
The related works are mainly focused on VO
based localization. Their methods have problems
with inliers selection in dynamic environment and
with visual information loss in fast changing
situations.
In this paper, for localization, an ego-motion
model based special data filter method is proposed
for selecting as inliers only the static features from
the dynamic environment and for overcoming visual
information loss in fast changing situations.
The 3D images are registered along with position
estimated by VO and ego-motion model in order to
build 3D map. The 3D map also consists of 3D road
surface. Especially road surface extraction is a tough
task for vision.
There are two major problems for the road
surface extraction.
1) The road surface extraction is strongly influenced
by non-uniform illumination, variant road surface
textures, and road surface conditions. Many
researchers have proposed methods of road surface
determination (Guo, Sofman and Dahlkamp, 2006
).
However there are still open issues in terms of
accuracy and time cost efficiency.
2) The road surface, many times, consists of
homogeneous textures. It means that there are no 3D
reconstructed points on the road surface.
In this paper, simple X-Y (Front-view)
projection method is proposed for road surface
extraction, and accumulated dense map (temporal
filter) is used for obtaining 3D positions of road
surface. The method of accumulated dense map
along with robot path provides rich 3D information
even though features of road surface are not
textured.
This paper consists of three main sections. The
localization is presented in the section 2.The map
building is presented in the section 3. The
experiments and their results are presented in the
section 4.
The conclusions highlight the achievements of
the work.
2 PROPOSED LOCALIZATION
2.1 Pre-processing for Selecting Good
Tracking Features
The VO is based on the 3D points obtained by
triangulation of features determined by feature
tracking method (
Shi, 1994). In this paper we do not
use our own triangulation method. We directly used
the 3D points provided by Tyzx (Tyzx.com) dense
stereo engine. Unfortunately the 3D points are
affected by noise due to different reasons. It causes
inaccurate translation/rotation estimation of VO.
Therefore 3D noise elimination procedure is
required before determining inliers for translation/
rotation estimation. This 3D noise is filtered
following image projection procedure. The
coordinate system is configured as X (west-east), Y
(north-south), and Z (current position to ahead)).
There are three ways in which the projection can be
achieved: X-Y projection (Front-view), Y-Z
projection (Side-view), and X-Z projection (Top-
view). The X-Y projection method is adopted in this
paper because noise points and object points are
easily separated. Scoring map (500 x 312) in the X-
Y projection is achieved by following equation:
,
)(
,
)(
minmin
Y
YY
r
XX
c
ii
Δ
−
=
Δ
=
(1)
)()
312/,500/
minmaxminmax
YYYXXX −=
∵
where
c and
are column and row of 500 x 312
image,
i
X
and
i
Y
are 3D point coordinates, and
min
X
and
min
Y
are the minimum values of
i
X
and
i
Y
respectively.
Each cell of the scored map is filled with number
of projected 3D points by corresponding Equation
(1). The cells which have smaller scored number
than the threshold (
threshold
N
) are eliminated.
The
threshold
N
is determined by following off-line
recursive iterations only at the beginning. The
threshold
N
starts with “0”, it increments until the
average Y value of X-Y projected points becomes a
positive floating point value. Due to the down
orientation of the Y axis and due to the positioning
of the origin of the world coordinate system on the
road surface, the initial average values are negative
floating point values. In each iteration, the cells
which have smaller scored number than the
threshold are eliminated. After iteration is finished,
the remaining 3D points are restored from scoring
map. These 3D points will be used for VO.
2.2 Inliers Selection for Visual
Odometry
In dynamic environments, the selection of inliers in
Visual Odometry is very important. Unlike other
Visual Odometry approaches (Nistér, Howard,
RELIABLE LOCALIZATION AND MAP BUILDING BASED ON VISUAL ODOMETRY AND EGO MOTION
MODEL IN DYNAMIC ENVIRONMENT
317