Oriented Gradients (HOG) are computed both on the
raw 2D intensity image and also on the superpixels
mean image for comparison purposes. Principal
Component Analysis (PCA) is also employed for
feature dimensionality reduction. Traditional
classifiers like Support Vector Machine (SVM) and
AdaBoost are trained and tested in order to prove that
using superpixels for hypotheses validation can
strongly reduce the processing time while the quality
of the pedestrian detection results is not so affected.
The rest of the paper is organized as follows: in
section 2 we present the related work, in section 3 the
system overview, in section 4 the superpixel-based
pedestrian hypotheses detection, in section 5 the
extracted features and the classifiers that are used, in
section 6 the experimental results and finally in
section 7 we draw the conclusions of this work.
2 RELATED WORK
Superpixels are clusters of pixels based on local
image features. SLIC superpixels described in
(Achanta et al., 2012) represent a fast approach that
can be used for segmenting gray levels images in
separate superpixels. They may be used for reducing
the complexity of subsequent image processing tasks
like obstacle detection. Pedestrian hypotheses are
usually extracted from the set of detected obstacles by
imposing some pedestrian specific geometrical
constraints. The hypotheses are then used for
reducing the search space, resulting in a faster
pedestrian detection process.
Usually stereovision based approaches are widely
used first in traffic scenes obstacle segmentation
(Oniga and Nedevschi, 2010) and second for
validating the obstacle classification results (Bertozzi
et al., 2008). Features extraction and feature-based
classifiers represent intermediate steps in obstacles
classification. Features are usually extracted from the
2D appearance obstacle images but they can also
integrate depth information and optical flow motion
information. Obstacle classifiers may be trained
directly on the extracted features or on a subset of
relevant features (You and Ruichek, 2012). A high
quality of the stereo-reconstruction process (Pantilie
and Nedevschi, 2012) is absolutely necessary for
obtaining a dense and accurate 3D points map. Based
on this map, several algorithms like points grouping
(Pocol et al., 2008) or density map analysis
(Nedevschi et al., 2009) may be used for obstacle
segmentation. In comparison with monocular vision
based techniques that uses symmetry (Bertozzi et al.,
2000), edges (Bertozzi and Broggi, 1998) and
textures (Heikkila and Pietikainen, 2006) from
intensity information, stereovision based obstacle
segmentation approaches (Broggi et al., 2011,
Nedevschi et al., 2004, Llorca et al., 2012) are clearly
superior.
Methods that divide the image pixels into regions
having the properties that all pixels from a separate
region are similar with respect to a chosen similarity
metric are presented in (Felzenszwalb and
Huttenlocher, 2004) and (Xiaofeng and Malik, 2003).
A graph where the nodes are the image pixels and the
edges represent a neighborhood relationship between
pixels is computed. These methods represent the basis
of the, nowadays very common and superior,
superpixels based image segmentation approaches. In
(Giosan and Nedevschi, 2014) we proposed a novel
obstacle detection method based on the original scene
segmentation in superpixels. The method combined
the intensity, depth and motion information within the
SLIC superpixels. A novel algorithm was proposed
for superpixels clustering into obstacles and obstacles
refinement. A method for very close obstacles
separation was developed based on the motion
vectors analysis of their component superpixels. The
results showed a very good obstacles detection with
precise segmentation of their surfaces which is
particularly useful for subsequent processes like
pedestrian detection. Continuing this work, in this
paper we propose a novel method for superpixels-
based generation and validation of pedestrian
hypotheses. The superpixels benefits in the pedestrian
detection process are clearly highlighted.
In the literature, several methods use different
discriminant features like shapes and edges (Broggi
et al., 2000), contours (Hilario et al., 2005), contour
templates (Gavrila and Philomin, 1999; Gavrila,
2000; Giosan and Nedevschi, 2009), symmetries
(Havasi et al., 2004), Haar features (Papageorgiou
and Poggio, 2000), HOG features (Dalal and Triggs,
2005) used for pedestrian detection. Usually these
features are firstly extracted on pedestrian hypotheses
and then fed into classifiers that are able to distinguish
between pedestrians and other traffic scene obstacles.
A lot of different methods exist for feature based
obstacle and specifically pedestrian classification
(Giosan and Nedevschi, 2012). In (Rivlin et al., 2002,
Lun et al., 2007), a SVM classifier is used for
recognizing pedestrians and bikes in traffic scenes. A
powerful Adaboost classifier built upon some
characteristics of rectangular edge description is
proposed in (Yi et al., 2010) for high accuracy
pedestrian recognition. Neural networks are also used
for high-accuracy pedestrian and other obstacles
classification (Toth and Aach, 2003). The state of the