Figure 6: Background ROC curve for K-NN (Yellow), SVM
(Purple), Decision Tree (Green), Random Forest (Aqua)
and N
¨
aive Bayes (Red).
co-registered image and depth data acquired from a
vehicle based mobile mapping system. The depth
data was acquired from the analysis of stereo pairs of
panoramic images with errors present that are com-
mon in stereo analysis. The methods used reflect the
need to find a technique that a non-expert user can use
to train the system to do a relatively inexact recogni-
tion process to find all the objects of interest with a
consequential significant but manageable false alarm
rate. Bounding boxes are used to identify objects of
interest as well as random background examples for
training. A large number of features have been in-
vestigated with the thesis that machine learning will
select the most useful ones. Features have been ex-
plored from traditional RGB images as well as from
depth images using, in some cases, the same algo-
rithms by regarding the depth images as monochrome
grey scale images. The classification results show that
image features perform better than depth features but
a combination of image and depth features performs
the best. The conclusion is that even quite coarse
depth features can improve performance.
Future work will explore larger feature sets for
each class, with more example classes. However it
has to be noted that, for training, the smallest number
of training examples is desired to improve the training
workflow for the user.
ACKNOWLEDGEMENTS
This work is supported by the Cooperative Research
Centre for Spatial Information, whose activities are
funded by the Australian Commonwealths Coopera-
tive Research Centres Programme. It provides PhD
scholarship for Michael Borck and partially funds
Professor Geoff West’s position. The authors would
like to thank John Ristevski and Anthony Fassero
from Earthmine and Landgate, WA for making avail-
able the datasets used in this work.
REFERENCES
Alexe, B., Deselaers, T., and Ferrari, V. (2010). What is
an object? Computer Vision and Pattern Recognition,
IEEE Computer Society Conference on.
Badami, I., St
¨
uckler, J., and Behnke, S. (2013). Depth-
enhanced hough forests for object-class detection and
continuous pose estimation. Semantic Perception,
Mapping and Exploration, SPME-2013.
Bay, H., Ess, A., Tuytelaars, T., and Gool, L. V. (2008).
Speeded-up robust features (surf). Computer Vision
and Image Understanding, 110(3):346 – 359.
Besl, P. J. (1988). Active, optical range imaging sensors.
Machine vision and applications, 1(2):127–152.
Cadena, C. and Ko
ˇ
secka, J. (2013). Semantic parsing for
priming object detection in rgb-d scenes. In Semantic
Perception, Mapping and Exploration (SPME) 2013.
Coleman, S., Scotney, B., and Suganthan, S. (2007). Fea-
ture extraction on range images - a new approach. In
Robotics and Automation, 2007 IEEE International
Conference on, pages 1098 –1103.
Dalal, N. and Triggs, B. (2005). Histograms of oriented
gradients for human detection. In Schmid, C., Soatto,
S., and Tomasi, C., editors, International Conference
on Computer Vision & Pattern Recognition, volume 2,
pages 886–893.
Dem
ˇ
sar, J., Zupan, B., Leban, G., and Curk, T. (2004). Or-
ange: From experimental machine learning to interac-
tive data mining. In Boulicaut, J.-F., Esposito, F., Gi-
annotti, F., and Pedreschi, D., editors, Knowledge Dis-
covery in Databases: PKDD 2004, pages 537–539.
Springer.
Guinn, J. (2002). Enhanced formation flying validation
report (jpl algorithm). NASA Goddard Space Flight
Center Rept, pages 02–0548.
He, D.-C. and Wang, L. (1991). Texture features based on
texture spectrum. Pattern Recognition, 24(5):391 –
399.
Huang, J., Lu, J., and Ling, C. (2003). Comparing naive
bayes, decision trees, and svm with auc and accuracy.
In Data Mining, 2003. ICDM 2003. Third IEEE Inter-
national Conference on, pages 553–556.
Jackson, D. A. (1993). Stopping rules in principal compo-
nents analysis: a comparison of heuristical and statis-
tical approaches. Ecology, pages 2204–2214.
Kaiser, H. F. (1960). The application of electronic comput-
ers to factor analysis. Educational and psychological
measurement.
Kurita, T. and Boulanger, P. (1992). Computation of sur-
face curvature from range images using geometrically
intrinsic weights. MVA, pages 389–392.
Ling, C. X., Huang, J., and Zhang, H. (2003). Auc: a bet-
ter measure than accuracy in comparing learning al-
gorithms. In Advances in Artificial Intelligence, pages
329–341. Springer.
ICPRAM2014-InternationalConferenceonPatternRecognitionApplicationsandMethods
660