unseen German legal documents and analyse their
contents.
The judgement corpus can be used for writ-
ing style detection in free text. Furthermore, it
is possible to look into the inner workings of the
judgement. Which conclusion is connected to
which definition and which definition belongs
to which subsumption. Additionally, judgements
from different courts could be compared in order to
answer whether there are big differences between the
writing styles of the courts.
Based on the work presented in Mitrovi
´
c et
al. (Mitrovi
´
c et al., 2017) writing style components
can be represented ontologically, and their persua-
siveness assessed based on the rhetorical elements
contained therein.
6 CONCLUSION
This paper presents two novel German legal corpora
based on Bavarian Court decisions between 2015 and
2020.
The first one contains 32,748 decisions from 131
German courts. A model that predicts the type of de-
cision was trained on this corpus. Resulting in a pre-
cision of 0.97.
The second corpus is a subset of the first one.
200 judgements were randomly chosen and annotated
with conclusion, definition, subsumption and
other, components of the Urteilsstil. On this cor-
pus several models were trained to predict to which
component a sentence of a judgement belongs. The
baseline is always outperformed, however no clear
best approach could be determined. LR performs well
with unigrams and SCV performs the same with tf-
idf.
Both corpora are published on the open science
platform zenodo.
In future work different legal experts will inspect
the existing labels to ensure the label quality.
ACKNOWLEDGEMENTS
The annotation of the judgement corpus was only pos-
sible due to the help and funding from the research
centre FREDI
13
.
The project on which this report is based was
funded by the German Federal Ministry of Educa-
tion and Research (BMBF) under the funding code
01—S20049. The author is responsible for the con-
tent of this publication.
REFERENCES
Alexy, R. (1983). Theorie der juristischen Argumentation.
Suhrkamp Verlag AG.
Danger, D. J. (2005). Urteil und urteilsstil in der zivil-
rechtlichen assessorklausur: eine praktische hilfestel-
lung. Juristische Arbeitsbl
¨
atter, (07).
Gamper, J. (2000). A parallel corpus of italian/german legal
texts. In LREC.
Mitrovi
´
c, J., O’Reilly, C., Mladenovi
´
c, M., and Handschuh,
S. (2017). Ontological representations of rhetorical
figures for argument mining. Argument & Computa-
tion, 8(3):267–287.
Mitrovi
´
c, J., Pease, A., and Granitzer, M. (2019). Modeling
legal terminology in sumo.
Mochales, R. and Moens, M.-F. (2011). Argumentation
mining. Artificial Intelligence and Law, 19(1):1–22.
Palau, R. M. and Moens, M.-F. (2009). Argumentation
mining: the detection, classification and structure of
arguments in text. In Proceedings of the 12th inter-
national conference on artificial intelligence and law,
pages 98–107. ACM.
Reed, C. (2006). Preliminary results from an argument cor-
pus. Linguistics in the twenty-first century, pages 185–
196.
Rizzo, C. R. and P
´
erez, M. J. M. (2012). Structure and
design of the british law report corpus (blrc): a legal
corpus of judicial decisions from the uk. Journal of
English Studies, 10:131–145.
Rossini Favretti, R., Tamburini, F., and Martelli, E. (2001).
Words from bononia legal corpus. International jour-
nal of corpus linguistics, 6(1-12):13–34.
Sugathadasa, K., Ayesha, B., de Silva, N., Perera, A. S.,
Jayawardana, V., Lakmal, D., and Perera, M. (2017).
Synergistic union of word2vec and lexicon for do-
main specific semantic similarity. IEEE International
Conference on Industrial and Information Systems
(ICIIS).
Urchs, S. (2020). Extracting definition and subsumption
from german law. Master’s thesis.
Weber, D. (2018). Methodik der Fallbearbeitung. Nomos
Verlagsges.MBH + Co.
Wissler, L., Almashraee, M., D
´
ıaz, D. M., and Paschke, A.
(2014). The gold standard in corpus annotation. In
IEEE GSC.
13
https://www.jura.uni-passau.de/fakultaet/
forschungseinrichtungen/forschungsstelle-fredi/
Design and Implementation of German Legal Decision Corpora
521