to achieve the segmentation in the detection windows.
This novel proposed segmentation method is com-
posed of two steps:
Firstly, the HOG and SVM detection process com-
puted for the whole window in (Dalal and Triggs,
2005) is used in sub-parts of the window to pro-
vide more local shape information. The likelihood of
each contour segment of the window as being a part
of a human silhouette is computed and gives a pre-
segmentation (see Figure 1(e)) where the gray level
of a segment is proportional to its likelihood).
Secondly, the contour segment cycle that is the most
representative of a pedestrian is obtained by a Dijk-
stra’s algorithm in an oriented graph. This graph is
made with the contour segments as vertices and the
neighborhood between couple of close contour seg-
ments as edges. The integration of the knowledge on
the researched class (here the pedestrians) is obtained
by weighting the edges of the graph with the pre-
segmentation data. The optimal cycle finally gives the
human silhouette and provides the segmentation (see
Figure 1(f)).
Due to the human shape complexity, errors fre-
quently appear in the obtained results. Nevertheless,
some of them can be easily located. To this end the
process depicted above is iterated in the problematic
areas with updated graph features. Thus, each itera-
tion may improve the result.
The remainder of the paper is organized as fol-
lows: Section 2 reviews the human segmentation and
the use of contour segments. Section 3 describes the
pre-segmentation process. Section 4 presents the ori-
ented graph approach used for segmentation. Section
5 develops the iterative algorithm. Experimental re-
sults are presented in Section 6, followed by conclu-
sions in Section 7.
2 RELATED WORK
Human Detection and Segmentation. The descrip-
tor and classifier combination is the most used frame-
work in human detection. The descriptor converts an
image into a vector of discriminative features and the
classifier compares the features of a tested image to
the features of images of an annotated database. HOG
(Dalal and Triggs, 2005) and Haar wavelets (Oren
et al., 1997) are the most used descriptors. SVM
(Vapnik, 1995) and Adaboost (Freund and Schapire,
1995) are the most used classifiers.
Simultaneous detection and segmentation can
stem from the research of the region of interest (ROI)
which can be based on depth (with stereo as in
(Kang et al., 2002)) or color (by normalized cut as in
(Mori et al., 2007)). Otherwise, the silhouette can be
found by a template matching (Lin and Davis, 2010)
(Munder and Gavrila, 2006) where the image con-
tours are compared to the silhouettes of a codebook.
The relevance of ROI or the similarity to a template
of the codebook gives the detection. Gathering of the
ROI or finding template delineates the silhouette and
also achieves the segmentation.
Hernandez (Hernandez et al., 2010) performs face
detection and skin color model for seed initializa-
tion in a graph cut process. This initialization is
provided by a previously computed pose estimation
in (Pishchulin et al., 2012). Wang (Wang and Koller,
2011) finally minimizes an energy that simultane-
ously takes into account the body parts localization
and the segmentation.
Contour Segment Approaches. As silhouette shape
is well-descriptive of the human class, there is a range
of methods based on the analysis of its parts. Indeed
Shotton (Shotton et al., 2008) demonstrates that a few
number of fragments of outline contours permit hu-
man recognition. For segmentation, Ferrari (Ferrari
et al., 2006) focuses on the succession of descriptive
contour segments. Wu (Wu and Nevatia, 2007) builds
a classifier to recognize human parts from edgelet fea-
tures (detection) and a classifier to recognize the fore-
ground pixels (segmentation). The two classifiers are
used together to carry out the two processes simulta-
neously. Gao (Gao et al., 2009) generates from the
contour a feature named Adaptive Contour Feature
that at the same time defines a weak classifier for hu-
man detection and segmentation. Hariharan (Hariha-
ran et al., 2011) combines information from different
part detectors to classify category-specific object con-
tours. Lastly, Sharma (Sharma and Davis, 2007) finds
the relevant contour segment cycles from an oriented
graph. Then, the cycles are integrated in a Markov
Random Field and a graph cut selects the one which
are related to silhouette and achieves the segmenta-
tion.
We want that segmentation deals with an usual
and efficient detection method. Our approach, which
inversely as (Sharma and Davis, 2007), searches the
prior cue first and then the cycle, is so well-adapted.
3 PRE-SEGMENTATION
In (Dalal and Triggs, 2005), the HOG and SVM com-
bination allows the detection. For each detection win-
dow, the only obtained information is the decision
VISAPP2013-InternationalConferenceonComputerVisionTheoryandApplications
406