fications to the BT or to ignore the suggestion.
The last use case we envision, “suggesting cluster
merge”, is complementary to the previous one. This
time, the user is offered the possibility of merging
clusters containing images that are semantically sim-
ilar but visually different (and are therefore scattered
in different points of the original BT).
Figure 8: Suggesting cluster merging.
For example, consider again Fig. 2, where the
system suggested the user alternative browsing di-
rections: the system could also offer the user the
option to persistently merge suggested clusters with
the current one. Clearly, such re-organization would
move images in suggested clusters into a single clus-
ter, that may be subsequently divided into several sub-
clusters. In the specific case of Fig. 8, the system
allows the user to merge the currently opened clus-
ter (brown bear) with clusters containing polar bear
and/or black bear images, respectively.
3 CONCLUSIONS
In this position paper we advocate the combined use
of visual content and semantics as a critical binomial
for effectiveand efficient browsingof large image col-
lections, so as to satisfy users’ expectations in quickly
locating images of interest. We have elaborated our
reasoning through a set of relevant use cases on a real
browsing system, namely PIBE. Such use cases tes-
tify how semantics can help visual content and vice
versa, both in assisting the user during her exploration
sessions and in improving image organization. We fi-
nally note that, although use cases presented in this
paper all used simple labels (free tags), the underly-
ing model exploited in the improved version of PIBE
allows the use of semantic tags, i.e., paths extracted
from a variety of existing taxonomies (semantic di-
mensions); this is known to solve problems of ambi-
guity, polysemy, etc. that plague solutions based on
free tags (Hearst, 2006; Bartolini, 2009).
In the future, we plan to provide a thorough exper-
imental analysis and comparison evaluation of PIBE
on real users with large image benchmarks.
REFERENCES
Bartolini, I. (2009). Multi-faceted Browsing Interface for
Digital Photo Collections. In Proc. of CBMI 2009,
pages 65–72, Chania, Crete.
Bartolini, I. and Ciaccia, P. (2008). Imagination: Exploiting
Link Analysis for Accurate Image Annotation. Adap-
tive Multimedia Retrieval: Retrieval, User, and Se-
mantics (LNCS), 4918:322–44.
Bartolini, I., Ciaccia, P., and Patella, M. (2004). The PIBE
Personalizable image Browsing Engine. In Proc. of
CVDB 2004, pages 43–50, Paris, France.
Bartolini, I., Ciaccia, P., and Patella, M. (2006). Adaptively
Browsing Image Databases with PIBE. Multimedia
Tools Appl., 31(3):269–286.
Dakka, W., Ipeirotis, P. G., and Wood, K. R. (2005). Au-
tomatic Construction of Multifaceted Browsing Inter-
faces. In Proc. of CIKM 2005, pages 768–775, Bre-
men, Germany.
Gao, B., Liu, T.-Y., Qin, T., Zheng, X., Cheng, Q., and
Ma, W.-Y. (2005). Web Image Clustering by Con-
sistent Utilization of Visual Features and Surrounding
Texts. In Proc. of ACM MM 2005, pages 112–121,
New York, USA.
Hearst, M. A. (2006). Clustering Versus Faceted Cate-
gories for Information Exploration. Commun. ACM,
49(4):59–61.
Heesch, D. (2008). A Survey of browsing Models for
Content-based Image Retrieval. Multimedia Tools
Appl., 40(2):261–284.
Miller, G. A. (1995). WordNet: A Lexical Database for
English. Commun. ACM, 38(11):39–41.
Pan, J.-Y., Yang, H.-J., Faloutsos, C., and Duygulu, P.
(2004). Automatic Multimedia Cross-modal Corre-
lation Discovery. In Proc. of SIGKDD 2004, pages
653–658, Seattle, USA.
Santini, S. and Jain, R. (2000). Integrated Browsing and
Querying for Image Databases. IEEE MultiMedia,
7(3):26–39.
Schaefer, G. (2010). A Next Generation Browsing Environ-
ment for Large Image Repositories. Multimedia Tools
Appl., 47(1):105–120.
Smeulders, A. W. M., Worring, M., Santini, S., Gupta,
A., and Jain, R. (2000). Content-based Image Re-
trieval at the End of the Early Years. IEEE TPAMI,
22(12):1349–1380.
Yee, K.-P., Swearingen, K., Li, K., and Hearst, M. A.
(2003). Faceted Metadata for Image Search and
Browsing. In Proc. of CHI 2003, pages 401–408, Ft.
Lauderdale, Florida, USA.
ContentMeetsSemantics:SmarterExplorationofImageCollections-PresentationofRelevantUseCases
191