tially the RGB model can be enough for many tasks
when color information is present. To assess this hy-
potheses, we reordered the model ranking based on
baseline and initialization perturbation results: RGB
(17.08) > RGB-LSN (17.34) > RGB-LBP (17.79) >
RGB-OG (18.13) > RGB-LBP5 (20.35) > RGB-
LSN5 (25.18). It is clearly obvious that dropping
the requirement for gray-scale handling an RGB tar-
get model is enough to perform better than any other
model.
5 CONCLUSION AND
DISCUSSION
“How well does mean-shift algorithm benefit from
saliency-based target representation?”
Despite we demonstrated that RGB model is suf-
ficient for many cases, the answer is not a straight
forward yes/no. In fact, the choice of target model
is dependent on the application, target characteristic
and sensor. We must however assert that RGB-LSN
has slightly the edge over RGB since it also handles
the gray-scale input properly. Otherwise putting the
gray-scale input aside, we can interpret that the con-
tribution of the combination of various features with
RGB is often marginal and most probably not needed.
It may be questioned about the reason why the
findings in this study may differ from those papers
which proposed the application of various features
with RGB for target modeling. The reason most prob-
ably lies on the use of limited number of test se-
quences and target specific (e.g., tracking a particu-
lar object) applications which affects the understand-
ing about the general behavior of the model. A sim-
ilar phenomenon is often observed in the evaluation
of saliency-based trackers which casts a shadow on
their true strength and motivates a careful study of
saliency based algorithms and methods. This issue
goes outside the span of the current paper and will be
addressed in the future work.
It is also worth noting that relying solely on the av-
erage ranking results of a benchmark is not necessar-
ily wise and a closer look to the underlying scores are
needed. This becomes important in choosing the ap-
propriate algorithm and target model for a specific ap-
plication since the overall score can be easily skewed
and be misleading.
ACKNOWLEDGEMENTS
Hamed R.-Tavakoli and Jorma Laaksonen were sup-
ported by The Academy of Finland under the Finnish
Center of Excellence in Computational Inference Re-
search (COIN).
REFERENCES
Borji, A., Frintrop, S., Sihite, D., and Itti, L. (2012). Adap-
tive object tracking by learning background context.
In CVPRW, pages 23–30.
Comaniciu, D., Ramesh, V., and Meer, P. (2003). Kernel-
based object tracking. IEEE Trans. Pattern Anal.
Mach. Intell., 25(5):564–577.
Dalal, N. and Triggs, B. (2005). Histograms of oriented
gradients for human detection. In CVPR, volume 1,
pages 886–893 vol. 1.
Frintrop, S. and Kessel, M. (2009). Most salient region
tracking. In ICRA.
Fukunaga, K. and Hostetler, L. (1975). The estimation
of the gradient of a density function, with applica-
tions in pattern recognition. IEEE Trans. Inf. Theory,
21(1):32–40.
Kristan, M., Pflugfelder, R., Leonardis, A., Matas, J.,
Porikli, F., Cehovin, L., Nebehay, G., Fernandez, G.,
Vojir, T., Gatt, A., Khajenezhad, A., Salahledin, A.,
Soltani-Farani, A., Zarezade, A., Petrosino, A., Mil-
ton, A., Bozorgtabar, B., Li, B., Chan, C. S., Heng, C.,
Ward, D., Kearney, D., Monekosso, D., Karaimer, H.,
Rabiee, H., Zhu, J., Gao, J., Xiao, J., Zhang, J., Xing,
J., Huang, K., Lebeda, K., Cao, L., Maresca, M., Lim,
M. K., El Helw, M., Felsberg, M., Remagnino, P.,
Bowden, R., Goecke, R., Stolkin, R., Lim, S., Ma-
her, S., Poullot, S., Wong, S., Satoh, S., Chen, W., Hu,
W., Zhang, X., Li, Y., and Niu, Z. (2013). The visual
object tracking vot2013 challenge results. In ICCVW,
pages 98–111.
Ning, J., Zhang, L., Zhang, D., and Wu, C. (2009). Robust
object tracking using joint color-texture histogram. In-
ternational Journal of Pattern Recognition and Artifi-
cial Intelligence, 23(07):1245–1263.
Ojala, T., Pietik¨ainen, M., and M¨aenp¨a¨a, T. (2002). Mul-
tiresolution gray-scale and rotation invariant texture
classification with local binary patterns. IEEE Trans.
Pattern Anal. Mach. Intell., 24(7):971–987.
Plataniotis, K. N. and Venetsanopoulos, A. N. (2000). Color
Image Processing and Applications. Springer-Verlag
New York, Inc., New York, NY, USA.
Tavakoli, H. and Moin, M. (2010). Mean-shift video track-
ing using color-lsn histogram. In Telecommunications
(IST), 2010 5th International Symposium on, pages
812–816.
Tavakoli, H. R., Moin, M. S., and Heikkila, J. (2013). Local
similarity number and its application to object track-
ing. Int J Adv Robot Syst, 10(184):1–6.
Yilmaz, A., Javed, O., and Shah, M. (2006). Object track-
ing: A survey. ACM Computing Surveys, 38(14).