local region level information to update these mod-
els. Brisson (Martel Brisson and Zaccarin, 2007) also
use pixel color information in the YUV color space
to build a Gaussian mixture shadow model (GMSM).
Nevertheless, seeing as the approach is pixel-based,
the obtained model’s accuracy is dependent on the
color-based classifier’s results throughout time. In
(Huang and Chen, 2008) shadows are identified by
building pixel-based local region shadow models us-
ing GMMs. A global model is also estimated and used
to update the local region models when movement is
rare. The background, foreground and shadow mod-
els are built into a MRF energy function. However,
this method’s weak classifier which provides infor-
mation for the learning of cast shadow, requires the
definition of several parameters that are not adaptable
to illumination changes.
On the other hand, Porikli (Porikli and Thornton,
2005) models shadows by multivariate Gaussians us-
ing RGB color information provided by a pre-filter.
This approach does not require color space transfor-
mation and, seeing as it uses multiple independent
layers to model each shadow pixel, it is more flexible
than the standard GMMs approach to model shadow.
The shadow models are achieved using color informa-
tion provided by a pre-filter that evaluates color vari-
ation such as in (Horprasert et al., 1999). Shadow
pixels are distinguished using these models and mis-
classification are corrected using shadow flow, which
once again is a color-based analysis. One of the main
drawbacks of this method, is the fact that it does not
perform any sort of spatial contextualization of the
pixel’s label. Therefore, foreground pixels which pos-
sess similar color information to modeled shadow are
misclassified.
The method here presented overcomes this flaw by
considering that a pixel’s label is influenced by its
neighboring pixel labels. In a general matter, this
method is composed by a cascade of two classifiers.
To the results of these classifiers, spatial contextual-
ization is induced to correct misclassifications. The
first classifier is a weak classifier, which purpose is
analyzing every segmented foreground pixel and de-
termining whether a pixel is possibly shadow by mea-
suring the similarity between color and texture of
the foreground and corresponding background. This
is done by estimating the Color Normalized Cross-
Correlation (CNCC). This information is used to
build or update statistical models that describe the
RGB appearance of shadow pixels. These multi-
layered pixel-based models are used by the second
classifier (strong classifier) to identify cast shadows.
Nonetheless, erroneous classifications may seriously
compromise the foreground segmentation process. To
minimize the number of misclassifications, the pixel’s
neighboring labels are taken into account. To do so,
two distinct and independent approaches were imple-
mented and compared. One, consists on a pyramidal
decomposition of kernel density estimators (PKDE),
which has as main goal ascertaining probabilistic rep-
resentations of the surrounding pixel labels to im-
prove the results given by the pre-filter. Another tech-
nique also analyzes the spatial label dependencies us-
ing a Markov Random Field (MRF) energy function
which is minimized by the graph cut algorithm.
2 WEAK SHADOW CLASSIFIER
The weak classifier evaluates the segmented fore-
ground pixels to determine whether a pixel is a possi-
ble shadow pixel. The main goal of this classifier is
not to detect shadows accurately, but to filter out some
impossible shadow pixels. The results of this classi-
fier will be used further on by the strong classifier.
The approach here presented estimates the CNCC be-
tween each segmented pixel I
t
and the corresponding
background pixel Bp
t
(see subsection 2.1). To im-
prove the results of this classifier, two distinct tech-
niques were independently applied and compared.
One, uses a PKDE method (presented in subsection
5.1), while another method ascertains the pixel’s la-
bel by using a MRF approach (presented in subsec-
tion 5.2). A quantitative and qualitative analysis of
the results of these two techniques can be found in
subsection 6.1.
2.1 Color Normalized
Cross-Correlation (CNCC)
This classifier measures the similarity of color and
texture between foreground and background, by es-
timating the CNCC (Grest et al., 2003). More pre-
cisely, a pixel is classified as shadow if its texture
is correlated with the corresponding texture of Bp(t).
In order to estimate the CNCC, the brightness infor-
mation is split from the color values, which is done
by representing the pixel’s color in the bi-conic HSL
space (Grest et al., 2003) (see Figure 1.(a)). To mea-
sure the similarity, the correlation between both can
be estimated by projecting the RGB color vector onto
the chromatic HS plane in order to calculate the Eu-
clidean values of hue (h) and saturation (s). This al-
lows the estimation of the scalar product between the
referred pixels (h,s,L) which is proportional to their
correlation. This is quite simple to understand, see-
ing as if they have similar hues (small angle between
SHADOW MODELING AND DETECTION FOR ROBUST FOREGROUND SEGMENTATION IN HIGHWAY
SCENARIOS
149