by the first author, additionally. The OpenCV deci-
sion tree model implementation was used to learn the
object classification with the sample vectors.
3.1.2 Processing the Data and Evaluation
In detail, the concretely implemented procedure
works as follows. Here, all constants are experimen-
tally determined to train the models in less than the
given 120 minutes and to classify the provided images
in less than 30 minutes.
Step 1: Training. Up to 25 images were down-
loaded from the Internet for each object on the list. All
downloaded images were segmented by color and for
each resulting segment, 39083 segments altogether, a
feature vector V with 300 entries was computed (car-
dinality of MPD, MCD and MCCD = 100). After the
association of the feature vectors to 1000 clusters with
k-means clustering, the cluster model is build from the
cluster associations.
Using the cluster model the decision tree model is
trained with a sample vector for each segment struc-
tured as follows: Each sample vector has k + 2 entries
(i.e. chosen cluster count +2). The first k entries con-
tain the number of segments associated to the respec-
tive cluster in the neighborhood of the actual segment
of the image. In this context, neighborhood means that
the bounding boxes of the segments overlap or have
a distance less than 3 pixels. The entry k + 1 contains
the cluster number of the actual segment and the value
of the entry k + 2 is the category identifier of the ac-
tual category of the image.
Step 2: Classification of the Unknown Images.
For each segment in the image the feature vectors V
and the sample vectors are created (without the cate-
gory of the image). The decision tree model predicts
the image category with the sample vectors. Each pre-
dicted category of the image and the number of seg-
ments in the neighborhood of the actual segment is
stored. The category with the most number of seg-
ments in the neighborhood is chosen as the category
of the image.
During the challenge one image was classified
correctly, 14 images were falsely classified and for the
remaining 30 images no category was found (on 9 im-
ages there was not any classifiable object).
4 FUTURE WORK
Our first results are encouraging, but in the future, the
implementation of our approach has to be faster with
an increased object recognition success rate.
For that, the image preprocessing and the segmen-
tation algorithm have to be improved, in order to sup-
port a better classification. Smoothing the distance
histograms to reduce measurement artifacts, using a
clustering algorithm with a variable cluster count to
get a cluster model with less but more precise clusters
and using more spatial relations of the segments for a
more accurate decision tree model is also desirable.
The goal is to implement the approach as a
real-time object recognition system feasible for au-
tonomous multi-copters, i.e. flying robots with several
propellers.
REFERENCES
Alegre, E., Alaiz-Rodrguez, R., Barreiro, J., and Ruiz, J.
(2009). Use of contour signatures and classification
methods to optimize the tool life in metal machining. Es-
tonian Journal of Engineering, 1:3–12.
B
¨
assmann, H. and Kreyss, J. (2004). Bildverarbeitung Ad
Oculos. Springer, Berlin, Heidelberg, New York, 4th edi-
tion.
Berry, M. J. A. and Linoff, G. (1997). Data Mining:
Techniques For Marketing, Sales, and Customer Support.
John Wiley & Sons Inc., New York, Chichester, Wein-
heim, Brisbane, Singapore, Toronto.
Bradski, G. and Kaehler, A. (2008). Learning OpenCV:
Computer Vision with the OpenCV Library. O’Reilly
Media Inc., Beijing, Cambridge, Farnham, K
¨
oln, Se-
bastopol, Taipei, Tokyo.
Han, J. and Kamber, M. (2006). Data Mining: Concepts and
Techniques. Morgan Kaufman Publishers, Amsterdam,
Boston, Heidelberg, London, New York, Oxford, Paris,
San Diego, San Francisco, Singapore, Sydney, Tokyo,
2nd edition.
J
¨
ahne, B. (2005). Digitale Bildverarbeitung. Springer,
Berlin, Heidelberg, New York, 6th edition.
OpenCV (2010). OpenCV (open source computer vision)
library. http://opencv.willowgarage.com/wiki/.
Shuang, F. (2001). Shape representation and retrieval using
distance histograms. Technical report, Dept. of Comput-
ing Science, University of Alberta.
SRVC (2009). Semantic robot vision challenge.
http://www.semantic-robot-vision-challenge.org.
Steinm
¨
uller, J. (2008). Bildanalyse. Von der Bildver-
arbeitung zur r
¨
aumlichen Interpretation von Bildern.
Springer, Berlin, Heidelberg.
SEMANTIC OBJECT RECOGNITION USING CLUSTERING AND DECISION TREES
673