Image Semantic Distance Metric Learning Approach for Large-scale Automatic Image Annotation

Cong Jin; Shu-Wei Jin

doi:10.5220/0005729902770283

Image Semantic Distance Metric Learning Approach for Large-scale Automatic Image Annotation

Cong Jin, Shu-Wei Jin

2016

Abstract

Learning an effective semantic distance measure is very important for the practical application of image analysis and pattern recognition. Automatic image annotation (AIA) is a task of assigning one or more semantic concepts to a given image and a promising way to achieve more effective image retrieval and analysis. Due to the semantic gap between low-level visual features and high-level image semantic, the performances of some image distance metric learning (IDML) algorithms only using low-level visual features is not satisfactory. Since there is the diversity and complexity of large-scale image dataset, only using visual similarity to learn image distance is not enough. To solve this problem, in this paper, the semantic labels of the training image set participate into the image distance measure learning. The experimental results confirm that the proposed image semantic distance metric learning (ISDML) can improve the efficiency of large-scale AIA approach and achieve better annotation performance than the other state-of-the art AIA approaches.

References

Chen, G., Song, Y., Wang, F., Zhang C., 2008. Semisupervised multilabel learning by solving a sylvester equation. SIAM International Conference on Data Mining, 410-419
Chua, T.S., Tang, J., Hong, R., et al., (2009). NUS-WIDE: a real-world web image database from National University of Singapore. ACM International Conference on Image and Video Retrieval, 48
Feng, S.L., Manmatha, R., Lavrenko, V., 2004. Multiple bernoulli relevance models for image and video annotation, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), II-1002-II-1009, Vol.2, 1002-1009
Goldberger, J., Roweis, S., Hinton, G., Salakhutdinov, R., 2005. Neighbourhood components analysis. Advances in Neural Information Processing Systems, 17, 103- 110
Grubinger, M., 2007. Analysis and evaluation of visual information systems performance. PhD thesis, Victoria University, Melbourne, Australia
Guillaumin, M., Mensink, T., Verbeek, J., Schmid, C., 2009. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation, IEEE 12th International Conference on Computer Vision. 309-316
Jin, C., Guo, J.L., 2014. Image semantic annotation approach based on the feature matching. Springer, Advances in Intelligent Systems and Computing, Vol.250, 281-288
Jin, C. Jin, S.W., 2015. Automatic image annotation using feature selection based on improving quantum particle swarm optimization. Signal Processing, 109, 172-181
Jin, C., Liu, J.A., Guo, J.L., 2015. A hybrid model based on mutual information and support vector machine for automatic image annotation. Artificial Intelligence Perspectives and Applications. Springer, 347, 29-38
Lasmar, N.E., Berthoumieu, Y., 2014. Gaussian copula multivariate modeling for texture image retrieval using wavelet transforms. IEEE Transactions on Image Processing, 23(5), 2246-2261
Liu, S., Yan, S.C., Zhang, T.Z., Xu, C.S., Liu, J., Lu, H.Q., 2012. Weakly supervised graph propagation towards collective image parsing, IEEE Transactions on Multimedia, 14(2), 361-373
Makadia, A., Pavlovic, V., Kumar, S., 2008. A new baseline for image annotation. Computer VisionECCV 2008. Springer Berlin Heidelberg, 316-329
Nakayama, H., 2011. Linear distance metric learning for large-scale generic image recognition. PhD thesis, The University of Tokyo, Japan
Nguyen, C.T., Kaothanthong, N., Tokuyama, T., Phan, X.H., 2013. A feature-word-topic model for image annotation and retrieval. ACM Transactions on the Web, 7(3), 1-12
Rahmani, R., Goldman, S., 2006. Missl: Multiple-instance semi-supervised learning, International Conference on Machine Learning, 705-712
Shi, J., Malik, J., 2000. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 888-905
Von Ahn, L., Dabbis, L., 2004. Labeling images with a computer game. SIGCHI Conference on Human Factors in Computing Systems. ACM, 319-326
Wang, C., Zhang, L., Zhang, H.J., 2008. Learning to reduce the semantic gap in web image retrieval and annotation. The 31st International ACM SIGIR Conference on Research and Development in Information Retrieval, Singapore, 355-362
Watcharapinchai, N., Aramvith, S., Siddhichai, S., 2011. Two-probabilistic latent semantic model for image annotation and retrieval, Lecture Notes in Computer Science, vol.6468, 359-369
Yashaswi, V., Jawahar, C.V., 2012. Image annotation using metric learning in semantic neighbourhoods. ECCV(3), 836-849
Zhou, D., Bousquet, O., Lal, T.N., et al., 2004. Learning with local and global consistency. Advances in Neural Information Processing Systems, 16(16), 321- 328
Zhuang, Y., Liu, X., Pan, Y., 1999. Apply semantic template to support content-based image retrieval. Lecture Notes in Computer Science. 3972, 442-449

Download

Paper Citation

in Harvard Style

Jin C. and Jin S. (2016). Image Semantic Distance Metric Learning Approach for Large-scale Automatic Image Annotation . In Proceedings of the International Conference on Internet of Things and Big Data - Volume 1: IoTBD, ISBN 978-989-758-183-0, pages 277-283. DOI: 10.5220/0005729902770283

in Bibtex Style

@conference{iotbd16,
author={Cong Jin and Shu-Wei Jin},
title={Image Semantic Distance Metric Learning Approach for Large-scale Automatic Image Annotation},
booktitle={Proceedings of the International Conference on Internet of Things and Big Data - Volume 1: IoTBD,},
year={2016},
pages={277-283},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005729902770283},
isbn={978-989-758-183-0},
}

in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Internet of Things and Big Data - Volume 1: IoTBD,
TI - Image Semantic Distance Metric Learning Approach for Large-scale Automatic Image Annotation
SN - 978-989-758-183-0
AU - Jin C.
AU - Jin S.
PY - 2016
SP - 277
EP - 283
DO - 10.5220/0005729902770283