Exploratory Analysis of Chat-based Black Market Profiles with Natural Language Processing
André Büsgen, Lars Klöser, Philipp Kohl, Oliver Schmidts, Bodo Kraft, Albert Zündorf
2022
Abstract
Messenger apps like WhatsApp or Telegram are an integral part of daily communication. Besides the various positive effects, those services extend the operating range of criminals. Open trading groups with many thousand participants emerged on Telegram. Law enforcement agencies monitor suspicious users in such chat rooms. This research shows that text analysis, based on natural language processing, facilitates this through a meaningful domain overview and detailed investigations. We crawled a corpus from such self-proclaimed black markets and annotated five attribute types products, money, payment methods, user names, and locations. Based on each message a user sends, we extract and group these attributes to build profiles. Then, we build features to cluster the profiles. Pretrained word vectors yield better unsupervised clustering results than current state-of-the-art transformer models. The result is a semantically meaningful high-level overview of the user landscape of black market chatrooms. Additionally, the extracted structured information serves as a foundation for further data exploration, for example, the most active users or preferred payment methods.
DownloadPaper Citation
in Harvard Style
Büsgen A., Klöser L., Kohl P., Schmidts O., Kraft B. and Zündorf A. (2022). Exploratory Analysis of Chat-based Black Market Profiles with Natural Language Processing. In Proceedings of the 11th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-583-8, pages 83-94. DOI: 10.5220/0011271400003269
in Bibtex Style
@conference{data22,
author={André Büsgen and Lars Klöser and Philipp Kohl and Oliver Schmidts and Bodo Kraft and Albert Zündorf},
title={Exploratory Analysis of Chat-based Black Market Profiles with Natural Language Processing},
booktitle={Proceedings of the 11th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2022},
pages={83-94},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011271400003269},
isbn={978-989-758-583-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 11th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - Exploratory Analysis of Chat-based Black Market Profiles with Natural Language Processing
SN - 978-989-758-583-8
AU - Büsgen A.
AU - Klöser L.
AU - Kohl P.
AU - Schmidts O.
AU - Kraft B.
AU - Zündorf A.
PY - 2022
SP - 83
EP - 94
DO - 10.5220/0011271400003269