loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Luis G. Moreno-Sandoval 1 ; Joan Felipe Mendoza-Molina 1 ; Edwin Alexander Puertas 1 ; Arturo Duque-Marín 1 ; Alexandra Pomares-Quimbaya 2 and Jorge A. Alvarado-Valencia 2

Affiliations: 1 Colombian Center of Excellence and Appropriation on Big Data and Data Analytics (CAOBA), Colombia ; 2 Pontificia Universidad Javeriana, Colombia

Keyword(s): SVM, SGD, Classification Problem, Age Classification, Twitter, Spanish.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Information Systems Analysis and Specification ; Sensor Networks ; Signal Processing ; Soft Computing ; Software Engineering

Abstract: Text classification or text categorization in social networks such as Twitter has taken great importance with the growth of applications of this process in diverse domains of society. Literature about text classifiers is significantly wide especially in languages such as English; however, this is not the case for age classification whose studies have been mainly focused on image recognition and analysis. This paper presents the results of testing linear classifiers performance in the task of identifying Twitter users age from their profile descriptions and tweets. For this purpose, a Spanish Lexicon of 45 words around the concept “cumpleaños” was created and the Gold Standard of 1541 users with age correctly identified was obtained. The experiments are presented with the description of the algorithms used to finally obtain the best seven models that permit to identify the user's age with accuracy results between 66% and 69 %. Considering the information-retrieval layer, the new results showed that accuracy was increased from 69,09% to 72,96%. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.118.32.224

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Moreno-Sandoval, L. ; Mendoza-Molina, J. ; Puertas, E. ; Duque-Marín, A. ; Pomares-Quimbaya, A. and Alvarado-Valencia, J. (2018). Age Classification from Spanish Tweets - The Variable Age Analyzed by using Linear Classifiers. In Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-298-1; ISSN 2184-4992, SciTePress, pages 275-281. DOI: 10.5220/0006811102750281

@conference{iceis18,
author={Luis G. Moreno{-}Sandoval and Joan Felipe Mendoza{-}Molina and Edwin Alexander Puertas and Arturo Duque{-}Marín and Alexandra Pomares{-}Quimbaya and Jorge A. Alvarado{-}Valencia},
title={Age Classification from Spanish Tweets - The Variable Age Analyzed by using Linear Classifiers},
booktitle={Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2018},
pages={275-281},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006811102750281},
isbn={978-989-758-298-1},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - Age Classification from Spanish Tweets - The Variable Age Analyzed by using Linear Classifiers
SN - 978-989-758-298-1
IS - 2184-4992
AU - Moreno-Sandoval, L.
AU - Mendoza-Molina, J.
AU - Puertas, E.
AU - Duque-Marín, A.
AU - Pomares-Quimbaya, A.
AU - Alvarado-Valencia, J.
PY - 2018
SP - 275
EP - 281
DO - 10.5220/0006811102750281
PB - SciTePress