for our framework and then proposed a preliminary
experimental evaluation both in terms of efficiency
and effectiveness, considering as case study a corpus
of job announcements in Italy.
The experimental evaluation showed that our ap-
proach offers good results in terms of retrieval pre-
cision and recall. Furthermore, we evaluated the ex-
ecution time of the core data structure - the kd-tree-
both in its sequential version and in a distributed one,
showing that the proposed approach is easily scalable.
REFERENCES
Albanese, M., Capasso, P., Picariello, A., and Rinaldi,
A. M. (2005). Information retrieval from the web: an
interactive paradigm. Advances in Multimedia Infor-
mation Systems, pages 17–32.
Amato, F., De Santo, A., Gargiulo, F., Moscato, V., Persia,
F., Picariello, A., and Poccia, S. (2015a). Semindex:
an index for supporting semantic retrieval of docu-
ments. In Proceedings of the IEEE DESWeb ICDE
2015.
Amato, F., De Santo, A., Moscato, V., Picariello, A.,
Serpico, D., and Sperli, G. (2015b). A lexicon-
grammar based methodology for ontology population
in e-health applications. In The 9-th International
Conference on Complex, Intelligent, and Software In-
tensive Systems (CISIS-2015). Blumenau, Brazil.
Amato, F., Mazzeo, A., Moscato, V., and Picariello, A.
(2009). A system for semantic retrieval and long-
term preservation of multimedia documents in the e-
government domain. International Journal of Web
and Grid Services, 5(4):323–338.
Amato, F., Mazzeo, A., Moscato, V., and Picariello, A.
(2014). Exploiting cloud technologies and context in-
formation for recommending touristic paths. In In-
telligent Distributed Computing VII, pages 281–287.
Springer.
Amato, F., Mazzeo, A., Penta, A., and Picariello, A. (2008).
Knowledge representation and management for e-
government documents. IFIP International Federa-
tion for Information Processing, 280:31–40.
Bouchoucha, A., Liu, X., and Nie, J.-Y. (2014). Integrat-
ing multiple resources for diversified query expansion.
In Advances in Information Retrieval, pages 437–442.
Springer.
Buey, M. G., Garrido,
´
A. L., and Ilarri, S. (2014). An ap-
proach for automatic query expansion based on nlp
and semantics. In Database and Expert Systems Ap-
plications, pages 349–356. Springer.
Carpineto, C. and Romano, G. (2012). A survey of auto-
matic query expansion in information retrieval. ACM
Computing Surveys (CSUR), 44(1):1.
Castells, P., Fernandez, M., and Vallet, D. (2007). An adap-
tation of the vector-space model for ontology-based
information retrieval. Knowledge and Data Engineer-
ing, IEEE Transactions on, 19(2):261–272.
Colace, F., De Santo, M., Greco, L., and Napoletano, P.
(2015). Weighted word pairs for query expansion.
Information Processing & Management, 51(1):179–
193.
Dalton, J., Dietz, L., and Allan, J. (2014). Entity query fea-
ture expansion using knowledge base links. In Pro-
ceedings of the 37th international ACM SIGIR confer-
ence on Research & development in information re-
trieval, pages 365–374. ACM.
Ermakova, L., Mothe, J., and Ovchinnikova, I. (2014).
Query expansion in information retrieval: What can
we learn from a deep analysis of queries? In Inter-
national Conference on Computational Linguistics-
Dialogue 2014, volume 20, pages pp–162.
Faloutsos, C. and Lin, K.-I. (1995). FastMap: A fast al-
gorithm for indexing, data-mining and visualization
of traditional and multimedia datasets, volume 24.
ACM.
Fern
´
andez, M., Cantador, I., L
´
opez, V., Vallet, D., Castells,
P., and Motta, E. (2011). Semantically enhanced in-
formation retrieval: an ontology-based approach. Web
Semantics: Science, Services and Agents on the World
Wide Web, 9(4):434–452.
Furnas, G. W., Landauer, T. K., Gomez, L. M., and Du-
mais, S. T. (1987). The vocabulary problem in human-
system communication. Communications of the ACM,
30(11):964–971.
Huang, J. X., Miao, J., and He, B. (2013). High per-
formance query expansion using adaptive co-training.
Information Processing & Management, 49(2):441–
453.
Jain, V. and Singh, M. (2013). Ontology based informa-
tion retrieval in semantic web: A survey. International
Journal of Information Technology and Computer Sci-
ence (IJITCS), 5(10):62.
Maron, M. E. and Kuhns, J. L. (1960). On relevance, proba-
bilistic indexing and information retrieval. Journal of
the ACM (JACM), 7(3):216–244.
Moscato, V., Picariello, A., and Rinaldi, A. M. (2010a).
A combined relevance feedback approach for user
recommendation in e-commerce applications. In
Advances in Computer-Human Interactions, 2010.
ACHI’10. Third International Conference on, pages
209–214. IEEE.
Moscato, V., Picariello, A., and Rinaldi, A. M. (2010b).
A recommendation strategy based on user behavior
in digital ecosystems. In Proceedings of the Interna-
tional Conference on Management of Emergent Digi-
tal EcoSystems, pages 25–32. ACM.
Pal, D., Mitra, M., and Datta, K. (2014). Improving
query expansion using wordnet. Journal of the As-
sociation for Information Science and Technology,
65(12):2469–2478.
Resnik, P. (1999). Semantic similarity in a taxonomy:
An information-based measure and its application to
problems of ambiguity in natural language.
Resnik, P. (2011). Semantic similarity in a taxonomy:
An information-based measure and its application to
problems of ambiguity in natural language. CoRR,
abs/1105.5444.
DATA2015-4thInternationalConferenceonDataManagementTechnologiesandApplications
352