A SEMANTIC CLUSTERING APPROACH FOR INDEXING DOCUMENTS

Daniel Osuna-Ontiveros; Ivan Lopez-Arevalo; Victor Sosa-Sosa

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A SEMANTIC CLUSTERING APPROACH FOR INDEXING DOCUMENTS

Topics: Clustering and Classification Methods; Information Extraction; Process Mining

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 280-285, 2011 , Paris, France

Authors: Daniel Osuna-Ontiveros ; Ivan Lopez-Arevalo and Victor Sosa-Sosa

Affiliation: CINVESTAV - IPN, Mexico

Keyword(s): Indexing models, Information retrieval, Semantic clustering, Semantic search.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Process Mining ; Symbolic Systems

Abstract: Information retrieval (IR) models process documents for preparing them for search by humans or computers. In the early models, the general idea was making a lexico-syntactic processing of documents, where the importance of the documents retrieved by a query is based on the frequency of its terms in the document. Another approach is return predefined documents based on the type of query the user make. Recently, some researchers have combined text mining techniques to enhance the document retrieval. This paper proposes a semantic clustering approach to improve traditional information retrieval models by representing topics associated to documents. This proposal combines text mining algorithms and natural language processing. The approach does not use a priori queries, instead clusters terms, where each cluster is a set of related words according to the content of documents. As result, a document-topic matrix representation is obtained denoting the importance of topics inside documents. For query processing, each query is represented as a set of clusters considering its terms. Thus, a similarity measure (e.g. cosine similarity) can be applied over this array and the matrix of documents to retrieve the most relevant documents. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.59

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Osuna-Ontiveros, D., Lopez-Arevalo, I. and Sosa-Sosa, V. (2011). A SEMANTIC CLUSTERING APPROACH FOR INDEXING DOCUMENTS. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR; ISBN 978-989-8425-79-9; ISSN 2184-3228, SciTePress, pages 280-285. DOI: 10.5220/0003663802880293

@conference{kdir11,
author={Daniel Osuna{-}Ontiveros and Ivan Lopez{-}Arevalo and Victor Sosa{-}Sosa},
title={A SEMANTIC CLUSTERING APPROACH FOR INDEXING DOCUMENTS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR},
year={2011},
pages={280-285},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003663802880293},
isbn={978-989-8425-79-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR
TI - A SEMANTIC CLUSTERING APPROACH FOR INDEXING DOCUMENTS
SN - 978-989-8425-79-9
IS - 2184-3228
AU - Osuna-Ontiveros, D.
AU - Lopez-Arevalo, I.
AU - Sosa-Sosa, V.
PY - 2011
SP - 280
EP - 285
DO - 10.5220/0003663802880293
PB - SciTePress