loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Martin Leginus 1 ; Leon Derczynski 2 and Peter Dolog 1

Affiliations: 1 Aalborg University, Denmark ; 2 University of Sheffield, United Kingdom

Keyword(s): Word Clouds, Recognized Named Entities, User Evaluation, Social Streams Access.

Related Ontology Subjects/Areas/Topics: Multimedia and User Interfaces ; Searching and Browsing ; Social Media Analytics ; Social Networks and Organizational Culture ; Society, e-Business and e-Government ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: Intuitive and effective access to large volumes of information is increasingly important. As social media explodes as a useful source of information, so are methods required to access these large volumes of user-generated content. Word clouds are an effective information access tool. However, those generated over social media data often depict redundant and mis-ranked entries. This limits the users’ ability to browse and explore datasets. This paper proposes a method for improving word cloud generation over social streams. Named entity expressions in tweets are detected, disambiguated and aggregated into entity clusters. A word cloud is generated from terms that represent the most relevant entity clusters. We find that word clouds with grouped named entities attain significantly broader coverage and significantly decreased content duplication. Further, access to relevant entries in the collection is improved. An extrinsic crowdsourced user evaluation of generated word clouds was performed. Word clouds with grouped named entities are rated as significantly more relevant and more diverse with respect to the baseline. In addition, we found that word clouds with higher levels of Mean Average Precision (MAP) are more likely to be rated by users as being relevant to the concepts reflected. Critically, this supports MAP as a tool for predicting word cloud quality without requiring a human in the loop. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.116.63.174

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Leginus, M.; Derczynski, L. and Dolog, P. (2015). Enhanced Information Access to Social Streams Through Word Clouds with Entity Grouping. In Proceedings of the 11th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-106-9; ISSN 2184-3252, SciTePress, pages 183-193. DOI: 10.5220/0005403101830193

@conference{webist15,
author={Martin Leginus. and Leon Derczynski. and Peter Dolog.},
title={Enhanced Information Access to Social Streams Through Word Clouds with Entity Grouping},
booktitle={Proceedings of the 11th International Conference on Web Information Systems and Technologies - WEBIST},
year={2015},
pages={183-193},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005403101830193},
isbn={978-989-758-106-9},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Web Information Systems and Technologies - WEBIST
TI - Enhanced Information Access to Social Streams Through Word Clouds with Entity Grouping
SN - 978-989-758-106-9
IS - 2184-3252
AU - Leginus, M.
AU - Derczynski, L.
AU - Dolog, P.
PY - 2015
SP - 183
EP - 193
DO - 10.5220/0005403101830193
PB - SciTePress