Comparative Analysis of Hate Speech Detection Models on Brazilian Portuguese Data: Modified BERT vs. BERT vs. Standard Machine Learning Algorithms

Thiago Mei Chu, Leila Weitzel, Paulo Quaresma

2024

Abstract

The Internet became the platform for debates and expression of personal opinions on various subjects. Social media have assumed an important role as a tool for interaction and communication between people. To understand this phenomenon, it is indispensable to detect and assess what characterizes hate speech and how harmful it can be to society. In this paper we present a comprehensive evaluation of Portuguese-BR hate speech identification based on BERT model and ML models as baseline. The BERT model achieves higher scores compared to the machine learning algorithms, indicating better overall performance in distinguishing between classes.

Download


Paper Citation


in Harvard Style

Mei Chu T., Weitzel L. and Quaresma P. (2024). Comparative Analysis of Hate Speech Detection Models on Brazilian Portuguese Data: Modified BERT vs. BERT vs. Standard Machine Learning Algorithms. In Proceedings of the 13th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-707-8, SciTePress, pages 392-400. DOI: 10.5220/0012770600003756


in Bibtex Style

@conference{data24,
author={Thiago Mei Chu and Leila Weitzel and Paulo Quaresma},
title={Comparative Analysis of Hate Speech Detection Models on Brazilian Portuguese Data: Modified BERT vs. BERT vs. Standard Machine Learning Algorithms},
booktitle={Proceedings of the 13th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2024},
pages={392-400},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012770600003756},
isbn={978-989-758-707-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Comparative Analysis of Hate Speech Detection Models on Brazilian Portuguese Data: Modified BERT vs. BERT vs. Standard Machine Learning Algorithms
SN - 978-989-758-707-8
AU - Mei Chu T.
AU - Weitzel L.
AU - Quaresma P.
PY - 2024
SP - 392
EP - 400
DO - 10.5220/0012770600003756
PB - SciTePress