Author:
Simone Santini
Affiliation:
Universidad Autónoma de Madrid and Spain
Keyword(s):
Novelty, Diversity, Redundancy in Query Results, Evaluation.
Related
Ontology
Subjects/Areas/Topics:
Multimedia
;
Multimedia Databases, Indexing, Recognition and Retrieval
;
Multimedia Systems and Applications
;
Telecommunications
Abstract:
This paper studies the formalization and the use of the concepts of novelty and diversity to diversify the result set of a multimedia query, avoiding the presence of uninformative results. First, we review and adapt several diversity measures proposed in the information retrieval literature. The problem of maximizing diversity being NP-complete, we propose a general greedy algorithm (dependent on a scoring function) for finding an approximate solution, and instantiate it using three scenarios: a probabilistic one, a fuzzy one, and a geometric one. Finally, we perform tests on two data sets, one in which retrieval is based on annotations and the other in which retrieval is purely visual.