mance of a binary classification, however most of
these metrics assume that there is approximately a
balanced quantity of elements in the classes. In this
dataset the foreground usually amounts to less than
the 10% of pixels in the image, hence we choose the
true positive rate (TPR), and the Matthews Correla-
tion Coefficient (MCC), the former is independent of
the class distribution, while the later is designed to
measure the quality of the classification even with un-
balanced classes.
During the evaluation of the algorithm it was clear
that scales (box sizes) larger than 11 were not appro-
priate for the segmentation of relative small objects
in movement (like the hands and forearms), also the
body boundaries are not properly located. Therefore
we first seek for a combination of scales between 1
and 11 that provides the best results, for this partic-
ular dataset the selected scales were 1,3 and 5. The
results are summarized in tables 1 & 2.
Table 1: True positive Rate.
Sequence Regular Σ − ∆ TPR Proposed Σ − ∆ TPR
1 37.94% 67.46%
2 55.73% 68.95%
3 31.61% 69.23%
4 57.63% 73.25%
5 48.62% 71.94 %
6 65.86% 78.05%
Table 2: Matthews Correlation Coefficient.
Sequence Regular Σ − ∆ MCC Proposed Σ − ∆ MCC
1 0.557 0.683
2 0.707 0.678
3 0.524 0.721
4 0.725 0.731
5 0.656 0.713
6 0.767 0.774
The TPR of the proposed method outperforms the
regular Σ∆ in every test sequence, this can be at-
tributed to the better detection of the limbs in motion,
specially the shins and forearms (see figure 6).
An interesting feature of the proposed algorithm
can be analysed with table 1, our method offers a large
improvement for sequences 1, 3 and 5 (30.06% aver-
age) however the improvement for sequences 2, 4 and
6 is smaller (13.71% average). This is related to the
background of the sequences, on the first group the
background has several objects of different colors on
it i.e. it contains borders, the background of the later
group has a single color and is nearly flat (see figure
3). The absence of borders lowers the effectiveness of
the proposed algorithm as the input information for
the Σ∆ comes mainly from the intensities of neigh-
bouring pixels.
Figure 3: Sequences 2,4,6 on the left side, sequences 1, 3, 5
on the right side.
While the TPR shows a significant improvement
of our algorithm over the regular Σ∆, the MCC shows
cases where there is not a significant improvement
over the base algorithm. This can be attributed to the
nature of the dataset, where the moving object (human
body) is present and in motion on the first frames, this
generates ghosts on every scene for both algorithms,
these ghosts last longer in our algorithm thus increas-
ing the amount of False Positives on the first frames,
drawing down the average MCC for the sequence.
This can be seen in figures 4 and 7, while the first
frames show an MCC for the proposed algorithm un-
der the MCC of the regular sigma delta, on the next
frames (when the ghost starts to fade) the MCC of
our algorithm is better, even in the second sequence,
where out algorithm had an average MCC under the
regular Σ∆ (fig. 4).
Again the nature of the background seems to have
influence on how long the ghosts last, scenes 1, 3
and 5. have ghosts that last shorter than the ghosts
in scenes 2,4,6.
3.1 Performance
As stated on section 2, one of the main features of the
Σ∆ Background subtraction is its computational effi-
ciency, therefore we briefly analyze the performance
penalty of the multiscale features and the new classi-
fication criterion.
A GNU Octave implementation of both algo-
rithms was tested on an core i7 processor at 3.3 Ghz,
on this set up the average the regular Σ∆ can pro-
cess 6.72 million pixels per second. The speed of the
proposed extension depends on the number of scales
A ROBUST BACKGROUND SUBTRACTION ALGORITHM USING THE A ∑-∆ ESTIMATION - Applied to the
Visual Analysis of Human Motion
487