Event Clustering of Lifelog Image Sequence using Emotional and Image Similarity Features

Photchara Ratsamee, Yasushi Mae, Masaru Kojima, Mitsuhiro Horade, Kazuto Kamiyama, Tatsuo Arai

Abstract

Lifelog image clustering is the process of grouping images into events based on image similarities. Until now, groups of images with low variance can be easily clustered, but clustering images with high variance is still a problem. In this paper, we challenge the problem of high variance, and present a methodology to accurately cluster images into their corresponding events. We introduce a new approach based on rankorder distance techniques using a combination of image similarity and an emotional feature measured from a biosensor. We demonstrate that emotional features along with rank-order distance based clustering can be used to cluster groups of images with low, medium, and high variance. Experimental evidence suggests that compared to average clustering precision rate (65.2%) from approaches that only consider image visual features, our technique achieves a higher precision rate (85.5%) when emotional features are integrated.

References

  1. Bay, H., Tuytelaars, T., and Van Gool, L. (2006). Surf: Speeded up robust features. pages 404-417. Springer.
  2. Blighe, M., O'Connor, N. E., Rehatschek, H., and Kienast, G. (2008). Identifying different settings in a visual diary. In Image Analysis for Multimedia Interactive Services, 2008. WIAMIS'08. Ninth International Workshop on, pages 24-27. IEEE.
  3. Bush, V. (2001). As we may think. pages 141-59.
  4. Conaire, C. O., O'Connor, N. E., Smeaton, A. F., and Jones, G. J. (2007). Organising a daily visual diary using multifeature clustering. In Electronic Imaging 2007, volume 6506, pages 65060C-65060C-11. International Society for Optics and Photonics.
  5. Doherty, A. R., Byrne, D., Smeaton, A. F., Jones, G., and Hughes, M. (2008). Investigating keyframe selection methods in the novel domain of passively captured visual lifelogs. In Proceedings of the international conference on Content-based image and video retrieval, pages 259-268. ACM.
  6. Doherty, A. R., Smeaton, A. F., Lee, K., and Ellis, D. P. (2007). Multimodal segmentation of lifelog data. In Large Scale Semantic Access to Content (Text, Image, Video, and Sound), pages 21-38. LE CENTRE DE HAUTES ETUDES INTERNATIONALES D'INFORMATIQUE DOCUMENTAIRE.
  7. Hodges, S., Williams, L., Berry, E., Izadi, S., Srinivasan, J., Butler, A., Smyth, G., Kapur, N., and Wood, K. (2006). Sensecam: A retrospective memory aid. UbiComp: Ubiquitous Computing, pages 177-193.
  8. Kulic, D., Takano, W., and Nakamura, Y. (2011). Towards lifelong learning and organization of whole body motion patterns. In Robotics Research, pages 87-97. Springer.
  9. Meng, J., Juan, Y., and Chang, S.-F. (1995). Scene change detection in an mpeg-compressed video sequence. In IS&T/SPIE's Symposium on Electronic Imaging: Science & Technology, pages 14-25. International Society for Optics and Photonics.
  10. Noris, B., Barker, M., Nadel, J., Hentsch, F., Ansermet, F., and Billard, A. (2011). Measuring gaze of children with autism spectrum disorders in naturalistic interactions. In International Conference of Engineering in Medicine and Biology Society (EMBC), pages 5356- 5359. IEEE.
  11. Wagner, J., Kim, J., and André, E. (2005). From physiological signals to emotions: Implementing and comparing selected methods for feature extraction and classification. In International Conference on Multimedia and Expo, pages 940-943. IEEE.
  12. Wang, P. and Smeaton, A. F. (2012). Semantics-based selection of everyday concepts in visual lifelogging. International Journal of Multimedia Information Retrieval, 1(2):87-101.
  13. Yang, C. K. and Cheng, S. C. (2012). A novel algorithm for key frames selection. Applied Mechanics and Materials, 182:2025-2029.
  14. Zhu, C., Wen, F., and Sun, J. (2011). A rank-order distance based clustering algorithm for face tagging. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 481-488. IEEE.
Download


Paper Citation


in Harvard Style

Ratsamee P., Mae Y., Kojima M., Horade M., Kamiyama K. and Arai T. (2014). Event Clustering of Lifelog Image Sequence using Emotional and Image Similarity Features . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-003-1, pages 618-624. DOI: 10.5220/0004741206180624


in Bibtex Style

@conference{visapp14,
author={Photchara Ratsamee and Yasushi Mae and Masaru Kojima and Mitsuhiro Horade and Kazuto Kamiyama and Tatsuo Arai},
title={Event Clustering of Lifelog Image Sequence using Emotional and Image Similarity Features},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={618-624},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004741206180624},
isbn={978-989-758-003-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014)
TI - Event Clustering of Lifelog Image Sequence using Emotional and Image Similarity Features
SN - 978-989-758-003-1
AU - Ratsamee P.
AU - Mae Y.
AU - Kojima M.
AU - Horade M.
AU - Kamiyama K.
AU - Arai T.
PY - 2014
SP - 618
EP - 624
DO - 10.5220/0004741206180624