loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Gabriela Bosetti ; Előd Egyed-Zsigmond and Lucas Okumura Ono

Affiliation: Université de Lyon, LIRIS UMR 5205 CNRS, Bâtiment Blaise Pascal, 20 avenue Albert Einstein, 69621 Villeurbanne and France

Keyword(s): Active Learning, Human-computer Interaction, User-centric Systems, Web Information Filtering and Retrieval.

Abstract: Today, there are plenty of tools and techniques to perform text- or image-based classification of large datasets, targeting different levels of user expertise and abstraction. Specialists usually collaborate in projects by creating ground truth datasets and do not always have deep knowledge in Information Retrieval. This article presents a full platform for assisted binary classification of very large textual and text and image composed documents. Our goal is to enable human users to classify collections of several hundred thousand documents in an assisted way, within a humanly acceptable number of clicks. We propose a graphical user interface, based on several classification assistants: text- and image-based event detection, Active Learning (AL), search engine and rich visual metaphors to visualize the results. We also propose a novel query strategy in the context of Active Learning, considering the top unlabeled bi-grams and duplicated (e.g. re-tweeted) content in the target corpus to classify. These contributions are supported not only by a tool whose code is freely accessible but also by an evaluation of the impact of using the aforementioned methods on the number of clicks needed to reach a stable level of accuracy. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.14.7.53

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bosetti, G., Egyed-Zsigmond, E. and Ono, L. O. (2019). CATI: An Active Learning System for Event Detection on Mibroblogs’ Large Datasets. In Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-386-5; ISSN 2184-3252, SciTePress, pages 151-160. DOI: 10.5220/0008355301510160

@conference{webist19,
author={Gabriela Bosetti and Előd Egyed{-}Zsigmond and Lucas Okumura Ono},
title={CATI: An Active Learning System for Event Detection on Mibroblogs’ Large Datasets},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST},
year={2019},
pages={151-160},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008355301510160},
isbn={978-989-758-386-5},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - WEBIST
TI - CATI: An Active Learning System for Event Detection on Mibroblogs’ Large Datasets
SN - 978-989-758-386-5
IS - 2184-3252
AU - Bosetti, G.
AU - Egyed-Zsigmond, E.
AU - Ono, L.
PY - 2019
SP - 151
EP - 160
DO - 10.5220/0008355301510160
PB - SciTePress