3.3 Conclusion
In this research, we developed a methodology for pat-
tern exploration and event detection in news articles.
We used topic modeling to infer 1,271 topics from
a total of 236,029 articles and identified 725 unique
clusters for the chosen dataset topic. By examin-
ing shared content and contextual information within
each cluster, 1,455 events were detected. The pro-
posed methodology demonstrated promising potential
for analysis and event detection across various do-
mains, offering valuable insights for researchers and
practitioners. Further refinement and enhancements
will advance the field of event detection in news ar-
ticles and the application of automated methods to
solve relevant everyday issues.
ACKNOWLEDGEMENTS
This research is supported by the University of West
Attica.
REFERENCES
Angelov, D. (2020). Top2vec: Distributed representations
of topics. CoRR, abs/2008.09470.
Aslan Ozgul, B. and Veneti, A. (2021). The Different Orga-
nizational Structures of Alternative Media: Through
the Perspective of Alternative Media Journalists in
Turkey and Greece. Digital Journalism, 0(0):1–20.
Bouras, C. and Tsogkas, V. (2013). Enhancing news ar-
ticles clustering using word N-grams. DATA 2013 -
Proceedings of the 2nd International Conference on
Data Technologies and Applications, (1994):53–60.
Budiarto, A., Rahutomo, R., Putra, H. N., Cenggoro, T. W.,
Kacamarga, M. F., and Pardamean, B. (2021). Un-
supervised News Topic Modelling with Doc2Vec and
Spherical Clustering. Procedia Computer Science,
179(2020):40–46.
Devlin, J., Chang, M. W., Lee, K., and Toutanova, K.
(2019). BERT: Pre-training of deep bidirectional
transformers for language understanding. NAACL
HLT 2019 - 2019 Conference of the North American
Chapter of the Association for Computational Lin-
guistics: Human Language Technologies - Proceed-
ings of the Conference, 1(Mlm):4171–4186.
Flynn, C. and Dunnion, J. (2004). Event clustering in
the news domain. Lecture Notes in Artificial Intelli-
gence (Subseries of Lecture Notes in Computer Sci-
ence), 3206:65–72.
Jacobi, C., Van Atteveldt, W., and Welbers, K. (2016).
Quantitative analysis of large amounts of journalis-
tic texts using topic modelling. Digital Journalism,
4(1):89–106.
Kapellas, N. and Kapidakis, S. (2022). A Text Similarity
Study: Understanding How Differently Greek News
Media Describe News Events. International Joint
Conference on Knowledge Discovery, Knowledge En-
gineering and Knowledge Management, IC3K - Pro-
ceedings, 2(Ic3k):245–252.
Kirill, Y., Mihail, I. G., Sanzhar, M., Rustam, M., Olga,
F., and Ravil, M. (2020). Propaganda Identification
Using Topic Modelling. Procedia Computer Science,
178(2019):205–212.
Koutsikakis, J., Chalkidis, I., Malakasiotis, P., and Androut-
sopoulos, I. (2020). GREEK-BERT: The greeks vis-
iting sesame street. ACM International Conference
Proceeding Series, pages 110–117.
Kumaran, G. and Allan, J. (2005). Using names and topics
for new event detection. HLT/EMNLP 2005 - Human
Language Technology Conference and Conference on
Empirical Methods in Natural Language Processing,
Proceedings of the Conference, pages 121–128.
Liu, Z., Zhang, Y., Li, Y., and Chaomurilige (2023).
Key News Event Detection and Event Context Using
Graphic Convolution, Clustering, and Summarizing
Methods. Applied Sciences (Switzerland), 13(9).
Papadopoulou, L., Kavoulakos, K., and Avramidis, C.
(2021). Intermedia Agenda Setting and Grassroots
Collectives: Assessing Global Media’s Influence on
Greek News Outlets. Studies in Media and Communi-
cation, 9(2):12.
PATEL, V. and PATEL, A. (2018). Clustering News Articles
for Topic Detection. Iconic Research And Engineering
Journals, 1(11):57–61.
Piskorski, J., Tanev, H., Atkinson, M., and Van Der Goot,
E. (2008). Cluster-centric approach to news event ex-
traction. Frontiers in Artificial Intelligence and Appli-
cations, 181(1):276–290.
Rafea, A. and Gaballah, N. A. (2018). Topic Detection Ap-
proaches in Identifying Topics and Events from Ara-
bic Corpora. Procedia Computer Science, 142:270–
277.
Shah, N. A. and ElBahesh, E. M. (2004). Topic-based clus-
tering of news articles. Proceedings of the Annual
Southeast Conference, pages 412–413.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J.,
Jones, L., Gomez, A. N., Kaiser, L., and Polo-
sukhin, I. (2017). Attention is all you need. CoRR,
abs/1706.03762.
Xu, D. and Tian, Y. (2015). A Comprehensive Survey
of Clustering Algorithms. Annals of Data Science,
2(2):165–193.
Zhang, Y., Guo, F., Shen, J., and Han, J. (2022). Unsuper-
vised Key Event Detection from Massive Text Cor-
pora. Proceedings of the ACM SIGKDD International
Conference on Knowledge Discovery and Data Min-
ing, pages 2535–2544.
Event Detection in News Articles: A Hybrid Approach Combining Topic Modeling, Clustering, and Named Entity Recognition
279