MULTIPLE TARGET TRACKING AND IDENTITY LINKING
UNDER SPLIT, MERGE AND OCCLUSION OF TARGETS AND
OBSERVATIONS
Jos
´
e C. Rubio, Joan Serrat and Antonio M. L
´
opez
Computer Vision Center and Comp. Science Dept., Universitat Aut
`
onoma de Barcelona, 08193 Cerdanyola, Spain
Keywords:
Tracking, Graphical models, MAP inference, Particle tracking, Live cell tracking, Intelligent headlights.
Abstract:
Multiple object tracking in video sequences is a difficult problem when one has to simultaneously deal with
the following realistic conditions: 1) all or most objects share an identical or very similar appearance, 2)
objects are imaged at close positions so there is a data association problem which becomes worse when the
number of targets is high, 3) the objects to be tracked may lack observations for a short or long interval, for
instance because they are not well detected or are being temporally occluded by another non-target object, and
4) their observations may overlap in the images because the objects are very near or the image results from a
2D projection from the 3D scene, giving rise to the merging and subsequently splitting of tracks. This later
condition poses the additional problem of maintaining the objects identity when their observations undergo
a merge and split. We pose the tracking and identity linking problem as one of inference on a two-layer
probabilistic graphical model and show how can it be efficiently solved. Results are assessed on three very
different types of video sequences, showing a turbulent flow of particles, bacteria growth and on-coming traffic
headlights.
1 INTRODUCTION
In the context of multiple target detection and tracking
the following definitions will help us to state the goal.
A target or object is some real moving entity, imaged
in a video sequence, that we want to follow in order to
analyze its motion for some purpose (like people and
vehicles for surveillance (Benfold and Reid, 2011),
particles in a turbulent flow for its characterization,
live micro-organisms for lineage studies (Liu et al.,
2009), (Li et al., 2007), or insects for behaviour stud-
ies (Laet et al., 2011). An observation or measure-
ment is the detection of an object as it appears in an
image. Note that a single observation can actually re-
sult from several objects whose observations overlap.
Data association is the process of relating objects
to observations. In the absence of merges/splits, each
target corresponds to a unique observation, and there-
fore targets are unambiguously identified as long as
the track construction is correct. In presence of oc-
clusions, mapping targets and observations is a dif-
ficult problem to solve. Moreover, tracking multiple
objects implies multiple object interactions and map-
ping between observations, which is costly to solve
optimally.
There are many works on visual multiple target
tracking. Only some of them try to maintain iden-
tities in addition to build tracks and, being the most
interesting type of result, we will focus on them in
the following review. The usual classification of past
works we have found is according to the strategy or
the techniques employed for data association, that is,
whether they are based on multiple hypothesis track-
ing (MHT) (Reid, 1979), joint probabilistic density
association (JPDA) (T.Fortmann et al., 1983), particle
filtering (Khan et al., 2003), integer linear program-
ming, graph algorithms (like min-cut and set cover),
inference on Bayesian networks (Nillius et al., 2006),
etc. Being MHT and JPDA the most widely used ap-
proaches, they present some drawbacks. As MHT
suffers from state space explosion when applied to
real videos, JPDA assumes a fixed number of targets,
and only considers measurements in the current frame
step.
Another relevant categorization criterion is
whether the tracking is batch (Zheng Wu and Betke,
2011), (Nillius et al., 2006) or online (Benfold
and Reid, 2011), that is, tracks (and identities) are
resolved once the whole sequence is available or it
is done as each frame is ready. Clearly, the batch
strategy has the advantage of working with all the
data along time and it makes sense to use it in
problems which do not require an online answer like
live cell tracking or turbulent flow analysis. However,
15
Rubio J., Serrat J. and López A. (2012).
MULTIPLE TARGET TRACKING AND IDENTITY LINKING UNDER SPLIT, MERGE AND OCCLUSION OF TARGETS AND OBSERVATIONS.
In Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods, pages 15-24
DOI: 10.5220/0003710600150024
Copyright
c
SciTePress