set with a near-duplicate ground truth is identified.
We are currently working with the CoPHiR collec-
tion (Bolettieri et al., 2009) (10
8
images) to establish
whether the figures produced here are consistent.
The variation among the different distance metrics
is a novel observation. Characterisations are normally
used with either L
1
or L
2
distance, whereas in the
majority of cases either Cosine or SED/JSD performs
best. These metrics give a closer match according to
the correlation of values within the characterisations,
rather than differences in their absolute magnitude.
However the differences among all the characterisa-
tions do not seem to suggest any general rules about
the best metric to use in different contexts, which re-
quires further investigation.
ACKNOWLEDGEMENTS
We would like to thank Richard Martin and Karina
Kubiak-Ossowska of the University of Strathclyde for
help with access to the ARCHIE-WeSt HPC facilities
necessary to achieve some of the analysis.
Franco Alberto Cardillo was supported by the Na-
tional Research Council of Italy (CNR) for a Short-
term Mobility Fellowship (STM), which funded a
stay at the University of Strathclyde in Glasgow (UK)
where part of this work was done.
Richard Connor was supported by a symmet-
ric National Research Council of Italy (CNR) for a
Short-term Mobility Fellowship (STM), no. 33313,
13/05/2015, which funded a stay at the Consiglio
Nazionale delle Ricerche, Pisa, where the work was
further progressed.
REFERENCES
Bober, M. (2001). Mpeg-7 visual shape descriptors. IEEE
Transactions on circuits and systems for video tech-
nology, 11(6):716–719.
Bolettieri, P., Esuli, A., Falchi, F., Lucchese, C., Perego,
R., Piccioli, T., and Rabitti, F. (2009). Cophir: a test
collection for content-based image retrieval. CoRR,
abs/0905.4627.
Chum, O., Philbin, J., Isard, M., and Zisserman, A. (2007).
Scalable near identical image and shot detection. In
Proceedings of the 6th ACM international conference
on Image and video retrieval, pages 549–556. ACM.
Connor, R. (2015). Mir-flickr near-duplicate data. mir-
flickr-near-duplicates.appspot.com.
Connor, R., Cardillo, F., MacKenzie-Leigh, S., and Moss,
R. (2015). Identification of mir-flickr near-duplicate
images. In 10th International Conference on Com-
puter Vision Theory and Applications.
Connor, R. and Moss, R. (2012). A multivariate correla-
tion distance for vector spaces. In Navarro, G. and
Pestov, V., editors, Similarity Search and Applica-
tions, volume 7404 of Lecture Notes in Computer Sci-
ence, pages 209–225. Springer Berlin Heidelberg.
Connor, R., Simeoni, F., Iakovos, M., and Moss, R. (2011).
A bounded distance metric for comparing tree struc-
ture. Inf. Syst., 36(4):748–764.
Foo, J., Sinha, R., and Zobel, J. (2006). Discovery of image
versions in large collections. In Cham, T.-J., Cai, J.,
Dorai, C., Rajan, D., Chua, T.-S., and Chia, L.-T., edi-
tors, Advances in Multimedia Modeling, volume 4352
of Lecture Notes in Computer Science, pages 433–
442. Springer Berlin Heidelberg.
Huiskes, M. J. and Lew, M. S. (2008). The MIR Flickr
retrieval evaluation. In MIR ’08: Proceedings of the
2008 ACM International Conference on Multimedia
Information Retrieval, New York, NY, USA. ACM.
Huiskes, M. J., Thomee, B., and Lew, M. S. (2010). New
trends and ideas in visual concept detection: The MIR
Flickr retrieval evaluation initiative. In MIR ’10: Pro-
ceedings of the 2010 ACM International Conference
on Multimedia Information Retrieval, pages 527–536,
New York, NY, USA. ACM.
ISO-15938. Mpeg-7 multimedia content description inter-
face.
Jinda-Apiraksa, A., Vonikakis, V., and Winkler, S. (2013).
California-nd: An annotated dataset for near-duplicate
detection in personal photo collections. In Quality of
Multimedia Experience (QoMEX), 2013 Fifth Interna-
tional Workshop on, pages 142–147. IEEE.
Lin, J. (1991). Divergence measures based on the shannon
entropy. Information Theory, IEEE Transactions on,
37(1):145–151.
Niu, X.-m. and Jiao, Y.-h. (2008). An overview of percep-
tual hashing. Acta Electronica Sinica, 36(7):1405–
1411.
Oliva, A. and Torralba, A. (2001). Modeling the shape
of the scene: A holistic representation of the spatial
envelope. International Journal of Computer Vision,
42(3):145–175.
Ventura Royo, C. (2010). Image-based query by example
using mpeg-7 visual descriptors.
Vonikakis, V., Jinda-Apiraksa, A., and Winkler, S. (2014).
Photocluster - a multi-clustering technique for near-
duplicate detection in personal photo collections. In
Proc. of the 9th International Conference on Com-
puter Vision Theory and Applications, pages 153–161.
Won, C. S., Park, D. K., and Park, S.-J. (2002). Efficient use
of mpeg-7 edge histogram descriptor. Etri Journal,
24(1):23–30.
VISAPP 2016 - International Conference on Computer Vision Theory and Applications
654