AUTOMATIC ESTIMATION OF THE LSA DIMENSION

Jorge Fernandes; Andreia Artífice; Manuel J. Fonseca

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

AUTOMATIC ESTIMATION OF THE LSA DIMENSION

Topics: Clustering and Classification Methods; Information Extraction; Machine Learning

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 301-305, 2011 , Paris, France

Authors: Jorge Fernandes ; Andreia Artífice and Manuel J. Fonseca

Affiliation: INESC-ID/ IST/ Technical University of Lisbon, Portugal

Keyword(s): LSA, LSA dimension, Unsupervised text classification, Bootstrapping.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: Nowadays the size of collections of information achieved considerable sizes, making the finding and exploration of a particular subject hard to achieve. One way to solve this problem is through text classification, where a theme or category is assigned to a text based on the analysis of its content. However, existing approaches to text classification require some effort and a high level of knowledge on this subject by the users, making them inaccessible to the common user. Another problem of current approaches is that they are optimized for a specific problem and can not easily be adapted to another context. In particular, unsupervised methods based on the LSA algorithm require users to define the dimension to use in the algorithm. In this paper we describe an approach to make the use of text classification more accessible to common users, by providing a formula to estimate the dimension of the LSA based on the number of texts used during the bootstrapping process. Experimental resul ts show that our formula for estimation of the LSA dimension allows us to create unsupervised solutions able to achieve results similar to supervised approaches. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.84

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Fernandes, J., Artífice, A. and J. Fonseca, M. (2011). AUTOMATIC ESTIMATION OF THE LSA DIMENSION. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR; ISBN 978-989-8425-79-9; ISSN 2184-3228, SciTePress, pages 301-305. DOI: 10.5220/0003666103090313

@conference{kdir11,
author={Jorge Fernandes and Andreia Artífice and Manuel {J. Fonseca}},
title={AUTOMATIC ESTIMATION OF THE LSA DIMENSION},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR},
year={2011},
pages={301-305},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003666103090313},
isbn={978-989-8425-79-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR
TI - AUTOMATIC ESTIMATION OF THE LSA DIMENSION
SN - 978-989-8425-79-9
IS - 2184-3228
AU - Fernandes, J.
AU - Artífice, A.
AU - J. Fonseca, M.
PY - 2011
SP - 301
EP - 305
DO - 10.5220/0003666103090313
PB - SciTePress