addressing the details of this event is important for the
correct classification of this tweet. The anchor text
(“Kandel”) provides information on the crime of the
young offender and his conviction. The close
proximity of the fact to the author’s negative affective
state reveals her or his repudiation of the conviction.
We may take this affective state as a special indicator
that has a negative impact on its surrounding, which
can be toxic statements or facts from the anchor text
or the immediate statements from the other debaters.
Phase 6: The final measurement of the toxicity
combines the evaluations obtained from individual
statements with related affective states.
The measurement of the toxicity depends on the
quantity and quality of aggressive terms in the
statement. Here, our System differentiates between
oppositional opinion, offensive statement, threat
against something or somebody, or inciting
statement. In some cases, qualification is
straightforward. For example, if the author of the
statement uses outright aggressive terms like in “Ich
bin dafür, dass wir die Gaskammern wieder öffnen
und die ganz Brut da reinstecken.- I’m in favor of
opening the gas chambers again and put in the whole
offspring.”, we can immediately classify this
statement as hate speech. In all other cases, we
combine the levels of toxicity assigned to that
statement. The overall scenario, for instance, may
simply be an oppositional opinion. However,
combined with a strong negative affective state
(similar to one of Statement 1) the statement as whole
qualifies as offensive statement. For the time being,
our system evaluates each statement independently.
However, in the near future it will try to capture the
latent prevailing mood or opinion of the author along
her or his narratives.
5 CONCLUSIONS
This paper presented the state of work of a
prototypical system to produce and apply context-
aware information retrieval and classification on
different levels on granularity. Named entity
recognition (accompanied by analysis of N-grams)
helps to identify context information.
The paper presents application of recursive NER
in the area of economic analysis and hate speech
detection. Once the context descriptions are created,
retrieval and classification processes operate on these
data. It enables a smoother navigation over texts and
zooming in to text passages that hit the interest of the
users. It supports also the contextualization across a
series of statements along their discourse storyline in
social media. Text analysis along the storyline of
discourses supports hate speech detection.
The long-term objective of the system design as
discussed here is a stronger involvement of humans
in the development of context information and on the
behavior of the system concerning context inference.
This involvement results in a more active role of the
users in designing, controlling, and adapting of the
learning process that feeds the automatic detection of
context information.
REFERENCES
Aydugan Baydar G. and Arslan S. (2019). FOCA: A
System for Classification, Digitalization and
Information Retrieval of Trial Balance Documents.In
Proceedings of the 8th International Conference on
Data Science, Technology and Applications – DATA,
pp. 174-181.
Bedathur, S., Berberich, K., Dittrich, J., Mamoulis, N.,
Weikum, G., 2010. Interesting-phrase mining for ad-
hoc text analytics. In: Proceedings of the VLDB
Endowment, vol. 3, no. 1-2, 1348-1357.
Bollegala, D., Atanasov, V., Maehara, T., Kawarabayashi,
K.-I., 2018. ClassiNet—Predicting Missing Features
for Short-Text Classification. ACM Transactions on
Knowledge Discovery from Data (TKDD) 12(5): 1–29.
Brown, P.J., Jones, G.J.F., 2001. Context-aware Retrieval:
Exploring a New Environment for Information
Retrieval and Information Filtering. In: Personal and
Ubiquitous Computing 5(4): 253–263
Chatzakou, D., Leontiadis, I., Blackburn, J., Cristofaro, E.
de, Stringhini, G., Vakali, A., Kourtellis, N. (2019).
Detecting Cyberbulling and Cyberaggression in Social
Media. ACM Transactions on the Web (TWEB) 13(3),
pp. 1–51.
Cowie, J., Lehnert, W., 1996. Information Extraction.
Communications of the ACM, vol. 39, no. 1, 80-91.
Fan, W., Wallace, L., Rich, S., Zhang, Z., 2006. Tapping
the power of text mining. Communications of the ACM,
vol. 49, no. 9, 76-82.
Freitag, D., McCallum, A., 2000. Information Extraction
with HMM Structures Learned by Stochastic
Optimization. Proceedings of the Seventeenth National
Conference on Artificial Intelligence and Twelfth
Conference on Innovative Applications of Artificial
Intelligence, pp. 584-589.
Georgakopoulos, S.V., Tasoulis, S.K., Vrahatis, A.G.,
Plagianakos, V.P., 2018. Convolutional Neural
Networks for Toxic Comment Classification. In:
Proceedings of the 10th Hellenic Conference on
Artificial Intelligence, pp. 1–6.
Jancsary, J., Neubarth, F., Schreitter, S., Trost, H., 2011.
Towards a context-sensitive online newspaper. In:
Proceedings of the 2011 Workshop on Context-
awareness in Retrieval and Recommendation, pp 2-9.
Laborde, A., 2020. Wall Street cae un 6,3% en una sesión