APPEARANCE-BASED HUMAN GALLERY CONSTRUCTION FROM VIDEO

Kyongil Yoon, Yaser Yacoob, David Harwood, Larry Davis

Abstract

An approach for constructing a dynamic gallery of people observed in a video stream is described. We consider two scenarios that require determining the number and identity of participants: outdoor surveillance and meeting rooms. In these applications face identification is typically not feasible due to the low resolution across the face. The proposed approach automatically computes an appearance model based on the clothing of people and employs this model in constructing and matching the gallery of participants. The appearance model uses color/path-length profile and a robust distance measure based on Kernel Density Estimation (KDE) and Kullback-Leibler (KL) distance, to evaluate similarity between people and add models to the gallery. A one-to-one constraint is enforced to correctly match instances to models at each frame. In the meeting room scenario we exploit the fact that the relative locations of subjects are likely to remain unchanged for the whole sequence.

References

  1. Alexander, D. C. and Buxton, B. F. (2001). Statistical modeling of colour data. International Journal of Computer Vision, 44(2):87-109.
  2. Coleman, D., Holland, P., Kaden, N., Klema, V., and Peters, S. C. (1980). A system of subroutines for iteratively reweighted least squares computations. ACM Trans. Math. Softw., 6(3):327-336.
  3. Cover, T. M. and Thomas, J. A. (1991). Elements of Information Theory. New York: Wiley.
  4. Duda, R. O., Stork, D., and Hart, P. E. (2000). Pattern Classification. John Wiley and Sons Inc.
  5. Elgammal, A., Duraiswami, R., Harwood, D., and Davis, L. S. (2002). Background and foreground modeling using non-parametric kernel density estimation for visual surveillance. Proceedings of the IEEE, 90(7):1151-1163.
  6. Fox, J. (2002). Robust regression: Appendix to an r and s-plus companion to applied regression.
  7. Huber, P. J. (1977). Robust Statistical Procedures. Society for Industrial and Applied Mathematics.
  8. Kapur, J. N. and Kesavan, H. K. (1992). Entropy Optimization Principles with Applications. Academic Press.
  9. Kullback, S. and Leibler, R. A. (1951). On information and sufficiency. Annals of Mathematical Statistics, 22:79- 86.
  10. Nakajima, C., Pontil, M., Heisele, B., and Poggio, T. (2003). Full-body person recognition system. Pattern Recognition, 36(9):1997-2006.
  11. Press, W. H., Teukolsky, S. A., Vetterling, W. T., and Flannery, B. P. (1988). Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press.
  12. Silverman, B. W. (1986). Density estimation for statistics and data analysis. Chapman & Hall, New York.
  13. Viola, P. A. and Jones, M. J. (2001). Rapid object detection using a boosted cascade of simple features. In CVPR (1), pages 511-518.
Download


Paper Citation


in Harvard Style

Yoon K., Yacoob Y., Harwood D. and Davis L. (2007). APPEARANCE-BASED HUMAN GALLERY CONSTRUCTION FROM VIDEO . In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007) ISBN 978-989-8111-13-5, pages 332-337. DOI: 10.5220/0002142103320337


in Bibtex Style

@conference{sigmap07,
author={Kyongil Yoon and Yaser Yacoob and David Harwood and Larry Davis},
title={APPEARANCE-BASED HUMAN GALLERY CONSTRUCTION FROM VIDEO},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007)},
year={2007},
pages={332-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002142103320337},
isbn={978-989-8111-13-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007)
TI - APPEARANCE-BASED HUMAN GALLERY CONSTRUCTION FROM VIDEO
SN - 978-989-8111-13-5
AU - Yoon K.
AU - Yacoob Y.
AU - Harwood D.
AU - Davis L.
PY - 2007
SP - 332
EP - 337
DO - 10.5220/0002142103320337