opencvlibrary/).
Human Tracking. For the experiments on hu-
man tracking, the training samples needed for
the cascade detector are taken from the Daim-
lerChrysler Pedestrian Classification Benchmark
Dataset (Munder and Gavrila, 2006). The pro-
posed algorithm has been tested on several
sequences from the CAVIAR database (http:
//homepages.inf.ed.ac.uk/rbf/CAVIAR/).
To demonstrate the feasibility of the concept de-
scribed in this paper, some results of the test sequence
ExitEnterCrossingPaths1cor are shown in Fig. 1. The
proposed method can successfully track two objects
even if they are occluded by a third person. Our own
Axis_Busstop sequence has been captured using a
PTZ camera that pans and zooms in to a person when
he is walking by a bus stop. During the course of
sequence the size of the objects changes significantly
and turning the camera causes occasionally a large
number of false measurements. Since there are also
several objects that are tracked, the results indicate
that the method is also able to track objects with a
moving camera (Fig. 1).
Face tracking. To evaluate the usefulness of our
method for tracking multiple faces, the method was
tested in conjunction with a basic face detector. For
face detection we used the face detector included in
the OpenCV library directly. The face sequence moti-
nas_multi_face_frontal used for testing is part of the
AVSS2007 dataset (http://www.elec.qmul.ac.
uk/staffinfo/andrea/avss2007_d.html). The
sequence in question includes many situations where
four targets repeatedly occlude each other while ap-
pearing and disappearing from the field of view of the
camera. The results show that the method is able to
track the objects after a total occlusion (Fig. 1).
All the tests were run on a regular Pentium 4
2.8GHz desktop PC using MATLAB. Based on the
studies on all test sets, the most computationally in-
tensive part of the method is usually detection. Also
computation of the Kalman gain (9) may take some
time depending on the number of measurements.
However, based on the performance study of the cur-
rent MATLAB implementation we are confident that
the method is feasible for different applications when
implemented in C/C++.
4 CONCLUSIONS
We have presented a new algorithm for tracking mul-
tiple objects based on detector responses. The method
utilizes the Kalman filter and Expectation Maximiza-
tion (EM) algorithms in order to update the state of
the objects and assign detector responses to them.
Current implementation uses a well-known cascade
classifier to detect the objects of interest. Preliminary
experiments conducted clearly indicate the usefulness
of the approach proposed in this paper.
REFERENCES
Dempster, A., Laird, N., and Rubin, D. (1977). Max-
imum likelihood from incomplete data via the EM
algorithm. Journal of the Royal Statistical Society,
39(1):1–38.
Fortmann, T., Bar-Shalom, Y., and Scheffe, M. (1983).
Sonar tracking of multiple targets using joint proba-
bilistic data association. IEEE-JOE, 8(3):173–184.
Gavrila, D. (2000). Pedestrian detection from a moving ve-
hicle. In Proc. ECCV, volume 1843 of LNCS, pages
37–49.
Hannuksela, J., Huttunen, S., Sangi, P., and Heikkilä, J.
(2007). Motion-based finger tracking for user inter-
action with mobile devices. In Proc. CVMP.
Huang, C., Wu, B., and Nevatia, R. (2008). Robust ob-
ject tracking by hierarchical association of detection
responses. In Proc. ECCV, volume 5303 of LNCS,
pages 788–801.
Huttunen, S. and Heikkilä, J. (2008). Multi-object tracking
using binary masks. In Proc. ICIP, pages 2640–2643.
Joo, S.-W. and Chellappa, R. (2007). A multiple-hypothesis
approach for multiobject visual tracking. IEEE-TIP,
16:2849–2854.
Kalman, R. E. (1960). A new approach to linear filtering
and prediction problems. Trans. of the ASME-Journal
of Basic Engineering, 82:35–45.
Leibe, B., Schindler, K., and Van Gool, L. (2007). Coupled
detection and trajectory estimation for multi-object
tracking. In Proc. IEEE ICCV, pages 1–8.
Lienhart, R. and Maydt, J. (2002). An extended set of haar-
like features for rapid object detection. In Proc. ICIP,
volume 1, pages 900–903.
Mikolajczyk, K., Schmid, C., and Zisserman, A. (2004).
Human detection based on a probabilistic assembly of
robust part detectors. In Proc. ECCV, volume 3021 of
LNCS, pages 69–82.
Munder, S. and Gavrila, D. (2006). An experimen-
tal study on pedestrian classification. IEEE-TPAMI,
28(11):1863–1868.
Reid, D. (1979). An algorithm for tracking multiple targets.
IEEE-TAC, 24(6):843–854.
Singh, V. K., Wu, B., and Nevatia, R. (2008). Pedestrian
tracking by associating tracklets using detection resid-
uals. In Proc. IEEE WMVC, pages 1–8.
Viola, P. and Jones, M. (2001). Rapid object detection using
a boosted cascade of simple features. In Proc. IEEE
CVPR, volume 1, pages 511–518.
Wu, B. and Nevatia, R. (2005). Detection of multiple, par-
tially occluded humans in a single image by bayesian
combination of edgelet part detectors. In Proc. IEEE
ICCV, volume 1, pages 90–97.
MULTI-OBJECT TRACKING BASED ON SOFT ASSIGNMENT OF DETECTION RESPONSES
301