loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Elias de Oliveira ; Henrique Gomes Basoni ; Marcos Rodrigues Saúde and Patrick Marques Ciarelli

Affiliation: Universidade Federal do Espírito Santo, Brazil

ISBN: 978-989-758-048-2

ISSN: 2184-3228

Keyword(s): Text Classification, Social Network, Textmining.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Process Mining ; Symbolic Systems

Abstract: The classification problem has got a new importance dimension with the growing aggregated value which has been given to the Social Media such as Twitter. The huge number of small documents to be organized into subjects is challenging the previous resources and techniques that have been using so far. Futhermore, today more than ever, personalization is the most important feature that a system needs to exhibit. The goal of many online systems, which are available in many areas, is to address the needs or desires of each individual user. To achieve this goal, these systems need to be more flexible and faster in order to adapt to the user’s needs. In this work, we explore a variety of techniques with the aim of better classify a large Twitter data set accordingly to a user goal. We propose a methodology where we cascade an unsupervised following by supervised technique. For the unsupervised technique we use standard clustering algorithms, and for the supervised technique we propose the us e of a kNN algorithm and a Centroid Based Classifier to perform the experiments. The results are promising because we reduced the amount of work to be done by the specialists and, in addition, we were able to mimic the human assessment decisions 0.7907 of the time, according to the F1-measure. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.232.51.69

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
de Oliveira, E.; Gomes Basoni, H.; Rodrigues Saúde, M. and Marques Ciarelli , P. (2014). Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014) ISBN 978-989-758-048-2, ISSN 2184-3228, pages 465-472. DOI: 10.5220/0005159304650472

@conference{kdir14,
author={Elias de Oliveira. and Henrique Gomes Basoni. and Marcos Rodrigues Saúde. and Patrick Marques Ciarelli .},
title={Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)},
year={2014},
pages={465-472},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005159304650472},
isbn={978-989-758-048-2},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)
TI - Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification
SN - 978-989-758-048-2
AU - de Oliveira, E.
AU - Gomes Basoni, H.
AU - Rodrigues Saúde, M.
AU - Marques Ciarelli , P.
PY - 2014
SP - 465
EP - 472
DO - 10.5220/0005159304650472

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.