loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Pedro Curto 1 ; Nuno Mamede 1 and Jorge Baptista 2

Affiliations: 1 Universidade de Lisboa and INESC-ID Lisboa/L2F – Spoken Language Lab, Portugal ; 2 Universidade de Lisboa and Universidade do Algarve, Portugal

Keyword(s): Readability, Readability Assessment Metrics, Automatic Readability Classifier, Linguistic Features Extraction, Portuguese.

Related Ontology Subjects/Areas/Topics: Computer-Supported Education ; Information Technologies Supporting Learning ; Learning/Teaching Methodologies and Assessment ; Metrics and Performance Measurement

Abstract: This paper describes a system to assist the selection of adequate reading materials to support European Portuguese teaching, especially as second language, while highlighting the key challenges on the selection of linguistic features for text difficulty (readability) classification. The system uses existing Natural Language Processing (NLP) tools to extract linguistic features from texts, which are then used by an automatic readability classifier. Currently, 52 features are extracted: parts-of-speech (POS), syllables, words, chunks and phrases, averages and frequencies, and some extra features. A classifier was created using these features and a corpus, previously annotated by readability level, using a five-levels language classification official standard for Portuguese as Second Language. In a five-levels (from A1 to C1) scenario, the best-performing learning algorithm (LogitBoost) achieved an accuracy of 75.11% with a root mean square error (RMSE) of 0.269. In a three-level s (A, B and C) scenario, the best-performing learning algorithm (C4.5 grafted) achieved 81.44% accuracy with a RMSE of 0.346. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.105.230

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Curto, P.; Mamede, N. and Baptista, J. (2015). Automatic Text Difficulty Classifier - Assisting the Selection Of Adequate Reading Materials For European Portuguese Teaching. In Proceedings of the 7th International Conference on Computer Supported Education - CSEDU; ISBN 978-989-758-107-6; ISSN 2184-5026, SciTePress, pages 36-44. DOI: 10.5220/0005428300360044

@conference{csedu15,
author={Pedro Curto. and Nuno Mamede. and Jorge Baptista.},
title={Automatic Text Difficulty Classifier - Assisting the Selection Of Adequate Reading Materials For European Portuguese Teaching},
booktitle={Proceedings of the 7th International Conference on Computer Supported Education - CSEDU},
year={2015},
pages={36-44},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005428300360044},
isbn={978-989-758-107-6},
issn={2184-5026},
}

TY - CONF

JO - Proceedings of the 7th International Conference on Computer Supported Education - CSEDU
TI - Automatic Text Difficulty Classifier - Assisting the Selection Of Adequate Reading Materials For European Portuguese Teaching
SN - 978-989-758-107-6
IS - 2184-5026
AU - Curto, P.
AU - Mamede, N.
AU - Baptista, J.
PY - 2015
SP - 36
EP - 44
DO - 10.5220/0005428300360044
PB - SciTePress