and show the technical readings in depth in Results.
The Discussion section covers additional ideas, like
tracking, to further develop the solution. The final
section, Conclusions, summarizes the paper and the
results overall.
2 RELATED WORK
The plane equation method is not the first idea involv-
ing depth sensors and tracking since the sensors have
become consumer products. Other propositions have
been made to be used for either feature extraction or
to gather data. Here are a few of those ideas:
In a paper (Ataer-Cansizoglu et al., 2013), track-
ing is done by extracting points and planes from depth
data and an extended prediction algorithm is used.
The paper does not include details on how the actual
feature detection is accomplished.
The paper (Fallon et al., 2012), introduces Kinect
Monte Carlo Localization (KMCL) to calculate pose
from point cloud provided by a depth sensor. KMCL
uses a 3D-map of the surroundings and uses simulated
depth and color images as a comparison. The KMCL
requires a model of the tracked environment which
differentiates it from the proposed method.
For a more detailed information on different meth-
ods, please see (Taskinen et al., 2015), a literature re-
view on different depth related tracking methods in
augmented reality. The report lists a total of 21 depth
sensor related tracking and scanning articles and cat-
egorizes them by their main purpose: Localization,
reconstruction or both.
2.1 Plane Filtering
Plane filtering is a way to make the tremendous
amount of data a depth sensor produces a little eas-
ier to handle in computational sense. Any pixel-by-
pixel method becomes very unusable when the accu-
racy of one pixel can vary between 1 mm to 30 mm.
The problem is compounded by the fact that the data
is most likely handled in 3D-coordinates. To counter
this problem, the data needs to be handled in larger
groups, hence the plane filtering.
The base idea is to take only a few points (sam-
ples) from data and derive possible planes from them.
To actually use these planes in mesh building and
tracking, a further edge detection step must be per-
formed.
In the article (Biswas and Veloso, 2012) a Ransac
based plane filtering system is proposed. In short, the
plane detection is done by randomly selecting sample
points to form planes and then continuing on, again
randomly, to extend that plane with additional sam-
ples. Our approach improves stability in cases where
random sampling may result in unpredictable data
loss. Additionally, our approach is configurable dur-
ing run time, allowing a clear tradeoff between exe-
cution time and accuracy. Random selection can be
tuned in similar fashion, but the resulting behaviour
is not so easily predictable. One set of random points
can generate more than satisfactory results for a single
run, but on the next run the same number of random
points can yield sub-par results. With our approach
the results show a constant quality level for each grid
size. This allows the fine tuning to be performed in a
predictable fashion.
2.2 Simultaneous Localization and
Mapping
Simultaneous Localization And Mapping (SLAM)
has been originally developed for conventional color
cameras but the idea can be processed with depth sen-
sors also (Riisgaard and Blas, 2005). Closest prac-
tical example of depth SLAM would be KinectFu-
sion (Newcombe et al., 2011), where depth sensor is
used to map the data and aid the tracking in visually
obscure situations like in dark spaces or colorless en-
vironments.
Real-Time mapping in SLAM has always been a
problem in computer power and memory use, and in
KinectFusion case the mapped model is in a memory
limited cube of voxels. Other solutions have solved
the memory issue by limiting the quality of the map
or by using extensions to memory that may slow the
computation (Whelan et al., 2012).
The part of the SLAM our method uses, does not
require as much detail. This keeps the computation
and memory consumption at a lower level, so it will
not be an issue (See Section 5).
3 PLANE EQUATION
DETECTION
This paper introduces a new method to be used with
depth data to cover all the applications using multi-
ple tracking methods with limited computing power.
In terms of computing power, the plane equations are
relatively easy to calculate.
Detecting planes in a depth image is simple when
the right mathematical tools are being used. The three
states of the proposed plane detection are illustrated
in Figure 1. By using a split grid over the image, the
detection task can be quantified into smaller tasks. In
SIGMAP 2017 - 14th International Conference on Signal Processing and Multimedia Applications
18