loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Fabio Clarizia ; Francesco Colace ; Massimo De Santo ; Luca Greco and Paolo Napoletano

Affiliation: University of Salerno, Italy

ISBN: 978-989-8425-79-9

Keyword(s): Text retrieval, Query expansion, Term extraction, Probabilistic topic model, Relevance feedback.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Concept Mining ; Evolutionary Computing ; Information Extraction ; Interactive and Online Data Mining ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Soft Computing ; Symbolic Systems ; Web Mining

Abstract: It is well known that one way to improve the accuracy of a text retrieval system is to expand the original query with additional knowledge coded through topic-related terms. In the case of an interactive environment, the expansion, which is usually represented as a list of words, is extracted from documents whose relevance is known thanks to the feedback of the user. In this paper we argue that the accuracy of a text retrieval system can be improved if we employ a query expansion method based on a mixed Graph of Terms representation instead of a method based on a simple list of words. The graph, that is composed of a directed and an undirected subgraph, can be automatically extracted from a small set of only relevant documents (namely the user feedback) using a method for term extraction based on the probabilistic Topic Model. The evaluation of the proposed method has been carried out by performing a comparison with two less complex structures: one represented as a set of pairs of wor ds and another that is a simple list of words. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.209.80.87

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Clarizia, F.; Colace, F.; De Santo, M.; Greco, L. and Napoletano, P. (2011). A NOVEL QUERY EXPANSION TECHNIQUE BASED ON A MIXED GRAPH OF TERMS.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011) ISBN 978-989-8425-79-9, pages 84-93. DOI: 10.5220/0003660500840093

@conference{kdir11,
author={Fabio Clarizia. and Francesco Colace. and Massimo De Santo. and Luca Greco. and Paolo Napoletano.},
title={A NOVEL QUERY EXPANSION TECHNIQUE BASED ON A MIXED GRAPH OF TERMS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011)},
year={2011},
pages={84-93},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003660500840093},
isbn={978-989-8425-79-9},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011)
TI - A NOVEL QUERY EXPANSION TECHNIQUE BASED ON A MIXED GRAPH OF TERMS
SN - 978-989-8425-79-9
AU - Clarizia, F.
AU - Colace, F.
AU - De Santo, M.
AU - Greco, L.
AU - Napoletano, P.
PY - 2011
SP - 84
EP - 93
DO - 10.5220/0003660500840093

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.