OPTIMAL COMBINATION OF LOW-LEVEL FEATURES
FOR SURVEILLANCE OBJECT RETRIEVAL
∗
Virginia Fernandez Arguedas, Krishna Chandramouli, Qianni Zhang and Ebroul Izquierdo
Multimedia and Vision Research Group, School of Electronic Engineering and Computer Science
Queen Mary, University of London, Mile End Road, London, E1 4NS, U.K.
Keywords:
Object retrieval, Multi-feature fusion, Particle swarm optimisation, Surveillance videos, MPEG-7 features,
Machine learning.
Abstract:
In this paper, a low-level multi-feature fusion based classifier is presented for studying the performance of
an object retrieval method from surveillance videos. The proposed retrieval framework exploits the recent
developments in evolutionary computation algorithm based on biologically inspired optimisation techniques.
The multi-descriptor space is formed with a combination of four MPEG-7 visual features. The proposed
approach has been evaluated against kernel machines for objects extracted from AVSS 2007 dataset.
1 INTRODUCTION
Recent technological developments coupled together
with people’s concern for safety and security have
caused a wide spread application of Closed Circuit
Television (CCTV) cameras which have been widely
installed for surveillance monitoring. With such an
exponential increase in video footage, there exists
critical need for the development of automatic and in-
telligent retrieval models for objects and events to en-
able efficient media access, navigation and retrieval.
Addressing the challenges related to object indexing,
several approaches has been presented based on prob-
abilistic, statistical and biologically inspired classi-
fiers (Chandramouli and Izquierdo, 2010). Many of
these techniques generate satisfactory results for gen-
eral datasets such as movies, sports and news. How-
ever, the challenge of retrieving surveillance objects
remains a largely an open issue.
Among the approaches presented in the literature,
visual appearance based retrieval has gained much
popularity. The range of visual features used for ob-
ject retrieval from surveillance videos include, colour
histograms from different colour space and Gabor
filters. More recently, MPEG-7 based colour, tex-
ture and shape descriptors have been largely investi-
gated for multimedia indexing and retrieval (Sikora,
2002). In many of these approaches authors con-
∗
The research was partially supported by the European
Commission under contract FP7-216444 PetaMedia.
sider a single low-level descriptor to provide a high-
level degree of distinguishability among objects. In
order to generate robust and complex representation
of objects, a multi-descriptor feature space is con-
structed to represent objects extracted from surveil-
lance videos (Mojsilovic, 2005). The combination of
low-level-features to obtain higher order representa-
tions have been addressed over the years in pattern
recognition. For instance, in (Zhang and Izquierdo,
2007; Soysal and Alatan, 2003) authors proposed ap-
proaches that used combination of multiple low-level
features to index and retrieve media items. However,
to the best of our knowledge, such feature fusion ap-
proaches has not yet been applied for object retrieval
from surveillance video datasets.
In this paper, we present an optimal combination
of low-level feature spaces appropriate for surveil-
lance object retrieval. Besides, in order to study
the performance of the proposed multi-feature space
a comparison against the individual features perfor-
mance along with a linear combination of selected
features is presented. The proposed retrieval frame-
work exploits the recent developments in evolution-
ary computational algorithms based on biologically
inspired optimisation techniques. Recent develop-
ments in optimisation techniques have been inspired
by problem solving abilities of biological organisms
such as bird flocking and fish schooling. One such
technique developed by Eberhart and Kennedy is
called Particle Swarm Optimisation (PSO)(Kennedy
and Eberhart, 2001). The proposed approach has been
187
Fernandez Arguedas V., Chandramouli K., Zhang Q. and Izquierdo E..
OPTIMAL COMBINATION OF LOW-LEVEL FEATURES FOR SURVEILLANCE OBJECT RETRIEVAL.
DOI: 10.5220/0003527101870192
In Proceedings of the International Conference on Signal Processing and Multimedia Applications (SIGMAP-2011), pages 187-192
ISBN: 978-989-8425-72-0
Copyright
c
2011 SCITEPRESS (Science and Technology Publications, Lda.)