Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification

Elias de Oliveira; Henrique Gomes Basoni; Marcos Rodrigues Saúde; Patrick Marques Ciarelli

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification

Topics: Clustering and Classification Methods; Mining Text and Semi-Structured Data; Process Mining

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 465-472, 2014 , Rome, Italy

Authors: Elias de Oliveira ; Henrique Gomes Basoni ; Marcos Rodrigues Saúde and Patrick Marques Ciarelli

Affiliation: Universidade Federal do Espírito Santo, Brazil

Keyword(s): Text Classification, Social Network, Textmining.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Process Mining ; Symbolic Systems

Abstract: The classification problem has got a new importance dimension with the growing aggregated value which has been given to the Social Media such as Twitter. The huge number of small documents to be organized into subjects is challenging the previous resources and techniques that have been using so far. Futhermore, today more than ever, personalization is the most important feature that a system needs to exhibit. The goal of many online systems, which are available in many areas, is to address the needs or desires of each individual user. To achieve this goal, these systems need to be more flexible and faster in order to adapt to the user’s needs. In this work, we explore a variety of techniques with the aim of better classify a large Twitter data set accordingly to a user goal. We propose a methodology where we cascade an unsupervised following by supervised technique. For the unsupervised technique we use standard clustering algorithms, and for the supervised technique we propose the u se of a kNN algorithm and a Centroid Based Classifier to perform the experiments. The results are promising because we reduced the amount of work to be done by the specialists and, in addition, we were able to mimic the human assessment decisions 0.7907 of the time, according to the F1-measure. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.84

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

de Oliveira, E., Gomes Basoni, H., Rodrigues Saúde, M. and Marques Ciarelli, P. (2014). Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR; ISBN 978-989-758-048-2; ISSN 2184-3228, SciTePress, pages 465-472. DOI: 10.5220/0005159304650472

@conference{kdir14,
author={Elias {de Oliveira} and Henrique {Gomes Basoni} and Marcos {Rodrigues Saúde} and Patrick {Marques Ciarelli}},
title={Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR},
year={2014},
pages={465-472},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005159304650472},
isbn={978-989-758-048-2},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR
TI - Combining Clustering and Classification Approaches for Reducing the Effort of Automatic Tweets Classification
SN - 978-989-758-048-2
IS - 2184-3228
AU - de Oliveira, E.
AU - Gomes Basoni, H.
AU - Rodrigues Saúde, M.
AU - Marques Ciarelli, P.
PY - 2014
SP - 465
EP - 472
DO - 10.5220/0005159304650472
PB - SciTePress