loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Rui Portocarrero Sarmento 1 ; Mário Cordeiro 1 ; Pavel Brazdil 2 and João Gama 2

Affiliations: 1 University of Porto, Portugal ; 2 LIAAD-INESC TEC, Portugal

Keyword(s): Automatic Keyword Extraction, Incremental PageRank, Data Streams, Text Mining, Incremental TextRank.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Artificial Intelligence and Decision Support Systems ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Evolutionary Programming ; Information Systems Analysis and Specification ; Natural Language Interfaces to Intelligent Systems ; Performance Evaluation and Benchmarking ; Sensor Networks ; Signal Processing ; Soft Computing ; Software Engineering ; Strategic Decision Support Systems

Abstract: Text Mining and NLP techniques are a hot topic nowadays. Researchers thrive to develop new and faster algorithms to cope with larger amounts of data. Particularly, text data analysis has been increasing in interest due to the growth of social networks media. Given this, the development of new algorithms and/or the upgrade of existing ones is now a crucial task to deal with text mining problems under this new scenario. In this paper, we present an update to TextRank, a well-known implementation used to do automatic keyword extraction from text, adapted to deal with streams of text. In addition, we present results for this implementation and compare them with the batch version. Major improvements are lowest computation times for the processing of the same text data, in a streaming environment, both in sliding window and incremental setups. The speedups obtained in the experimental results are significant. Therefore the approach was considered valid and useful to the research c ommunity. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.87.200.112

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Sarmento, R.; Cordeiro, M.; Brazdil, P. and Gama, J. (2018). Incremental TextRank - Automatic Keyword Extraction for Text Streams. In Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-298-1; ISSN 2184-4992, SciTePress, pages 363-370. DOI: 10.5220/0006639703630370

@conference{iceis18,
author={Rui Portocarrero Sarmento. and Mário Cordeiro. and Pavel Brazdil. and João Gama.},
title={Incremental TextRank - Automatic Keyword Extraction for Text Streams},
booktitle={Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2018},
pages={363-370},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006639703630370},
isbn={978-989-758-298-1},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - Incremental TextRank - Automatic Keyword Extraction for Text Streams
SN - 978-989-758-298-1
IS - 2184-4992
AU - Sarmento, R.
AU - Cordeiro, M.
AU - Brazdil, P.
AU - Gama, J.
PY - 2018
SP - 363
EP - 370
DO - 10.5220/0006639703630370
PB - SciTePress