Hate Speech Detection Using Cross-Platform Social Media Data in English and German Language

Gautam Shahi; Tim Majchrzak

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Hate Speech Detection Using Cross-Platform Social Media Data in English and German Language

Topics: Social Information Systems; Social Media Analytics

In Proceedings of the 20th International Conference on Web Information Systems and Technologies WEBIST - Volume 1, 131-140, 2024 , Porto, Portugal

Authors: Gautam Shahi ¹ and Tim Majchrzak ²

Affiliations: ¹ University of Duisburg-Essen, Germany ; ² University of Agder, Norway

Keyword(s): Hate Speech, YouTube, User Comments, Cross-Platform, Multilingual Data.

Abstract: Hate speech has grown into a pervasive phenomenon, intensifying during times of crisis, elections, and social unrest. Multiple approaches have been developed to detect hate speech using artificial intelligence, however, a generalized model is yet unaccomplished. The challenge for hate speech detection as text classification is the cost of obtaining high-quality training data. This study focuses on detecting bilingual hate speech in YouTube comments and measuring the impact of using additional data from other platforms in the performance of the classification model. We examine the value of additional training datasets from cross-platforms for improving the performance of classification models. We also included factors such as content similarity, definition similarity, and common hate words to measure the impact of datasets on performance. Our findings show that adding more similar datasets based on content similarity, hate words, and definitions improves the performance of classificat ion models. The best performance was obtained by combining datasets from YouTube comments, Twitter, and Gab with an F1-score of 0.74 and 0.68 for English and German YouTube comments. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.129.70.104

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Shahi, G. and Majchrzak, T. (2024). Hate Speech Detection Using Cross-Platform Social Media Data in English and German Language. In Proceedings of the 20th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-718-4; ISSN 2184-3252, SciTePress, pages 131-140. DOI: 10.5220/0013070000003825

@conference{webist24,
author={Gautam Shahi and Tim Majchrzak},
title={Hate Speech Detection Using Cross-Platform Social Media Data in English and German Language},
booktitle={Proceedings of the 20th International Conference on Web Information Systems and Technologies - WEBIST},
year={2024},
pages={131-140},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013070000003825},
isbn={978-989-758-718-4},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Web Information Systems and Technologies - WEBIST
TI - Hate Speech Detection Using Cross-Platform Social Media Data in English and German Language
SN - 978-989-758-718-4
IS - 2184-3252
AU - Shahi, G.
AU - Majchrzak, T.
PY - 2024
SP - 131
EP - 140
DO - 10.5220/0013070000003825
PB - SciTePress