Table 1: Real-time processing constraints can be satisfied
up to 128x128 target sizes.
Track Window Size 32x32 64x64 128x128 256x256
Processing Time (ms) 2.272 10.129 35.961 151.056
represents amount of filter shift applied at each frame.
Online parameter tuning is another interaction
with tracker. In many CF based tracker, size depen-
dent parameters are set at the track initialization and
kept fixed since target size is not known. Although
size dependent parameters can vary from different
trackers, our method allows their online adjustments
resulting in better adaptability and increased capabi-
lity of rejection of similar targets in the vicinity. For
(Bolme et al., 2010), density distribution of prepro-
cessing window and size of PSR window are such pa-
rameters. Unified effect of online parameter update
together with filter alignment are given in Sec.4.
4 EXPERIMENTAL RESULTS
Testing our solution in the presence of scale chan-
ges, track drifts and erroneous track initialization is
crucial since these issues are primarily addressed in
this paper. Therefore, we evaluated performance on
scenarios from three different datasets. From Vivid
(Collins et al., 2005) egtest01-02-03; from Aircraft
tracking (Mian, 2008) small1, occlusion1; and from
CVPR2013 benchmark (Wu et al., 2013) Sylvester,
Walking, Walking2 are selected. These datasets fits
for testing amendments of drift prevention and size
adaptivity since they are dominated by targets having
maneuvers, in-plane and out-of plane rotations, scale
changes and deformations that generally causes track
drifts and loses. To achieve more solid results, num-
ber of scenarios obtained from egtest01-02-03 is in-
creased to 10 in total (5, 3, and 2 respectively) by
tracking auxiliary targets,whose ground truths are ma-
nually labeled, in addition to main targets given in
(Collins et al., 2005).
During the experiments main attention is paid on
quantifying the performance improvement by compa-
ring base trackers with their enhanced versions. For
evaluating performance of trackers, methodology and
metrics proposed in (Wu et al., 2013) is followed.
Hence, success and precision plots are generated to
reveal track success rates (percentage of frames in
which tracking is maintained) by measuring two dif-
ferent error types; target bounding box overlap and
centralization errors. To be more precise, success
plots uses a common overlap score which is defined
as S =
|
r
t
∩r
g
|
|
r
t
∪r
g
|
where r
t
is output target bounding box
and r
g
ground truth bounding box. Although compa-
ring overlap score with a fixed threshold is enough to
obtain track success rate, success plot is generated by
sweeping this threshold from 0 to 1 for better charac-
terization. In precision plot, track success rate is dis-
closed based on center location error (CLE) that me-
asures euclidean distance between ground truth and
output track window centers. Similar to success plot,
precision plot is also generated by comparing distance
with a threshold ranging from 0 to 50. In order to rank
trackers in precision plot, performances at CLE 15 is
used while 0.5 is selected for success plot. To investi-
gate whether the proposed scheme introduces robust-
ness to initializations, temporal robustness evaluation
(TRE) and spatial robustness evaluation (SRE) are
carried out together with one-pass evaluation (OPE)
as is proposed in (Wu et al., 2013). OPE is the con-
ventional scheme in which initialization is achieved
perfectly at the first frame and tracker runs through
whole scenario. In TRE analysis, scenario is divi-
ded into 20 segments and tracks are perfectly initi-
alized at the first frame of these segments. In SRE
analysis, erroneous track initializations are simulated
by giving 8 spatial shifts including 4 center shifts and
4 corner shifts, and 4 scale variations. Spatial shifts
are given in 10% of target size while scale ratios are
0.8, 0.9, 1.1 and 1.2 to the ground truth. For the pa-
rameter setting of the base trackers, we set them as
default. Target instance extraction requires single pa-
rameter that is slic super pixel area and set to 65. For
target likelihood map generation maximum learning
rate constant α = 0.05, penalization constant β = 0.3
and consistency threshold = 0.85 are used. For filter
alignment maximum acceleration (a
thres
) and velocity
(v
thres
) thresholds are set to 0.2, and 2 while step size
µ[n] is set to 0.3λ[n].
Figure 4 illustrates success and precision plots of
6 base trackers together with their enhanced versions
while Table 2 quantitatively compares base trackers
directly with their enhanced versions to disclose the
impact of proposed solution on each of the trackers
and the effect on the average. According to Table 2,
proposed solution boosts performance of almost each
tracker at each performance aspect. Achieved im-
provement on CLE and overlap metrics indicates that
proposed solution is successful at both centralization
and size disclosure. Smallest performance increase is
achieved in TRE (overlap 6.2%, CLE 5.1) since base
trackers also cannot achieve high track success rates
due to low contrast and frequent occlusions. Obvi-
ously, boost in OPE (overlap 10.2%, CLE 12.6) is
much better than TRE since base trackers have higher
track success rates which provides proposed solution
additional time for better target learning. SRE is the
Enhancing Correlation Filter based Trackers with Size Adaptivity and Drift Prevention
477