Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning

Rafael Oleques Nunes, Andre Spritzer, Carla Dal Sasso Freitas, Dennis Balreira

2024

Abstract

This paper explores the application of the In-Context Learning (ICL) paradigm for Named Entity Recognition (NER) within the Portuguese language legal domain. Identifying named entities in legal documents is complex due to the intricate nature of legal language and the specificity of legal terms. This task is important for a range of applications, from legal information retrieval to automated summarization and analysis. However, the manual annotation of these entities is costly due to the specialized knowledge required from legal experts and the large volume of documents. Recent advancements in Large Language Models (LLM) have led to studies exploring the use of ICL to improve the performance of Generative Language Models (GLMs). In this work, we used SabiĆ”, a Portuguese language LLM, to extract named entities within the legal domain. Our goal was to evaluate the consistency of these extractions and derive insights from the results. Our methodology involved using a legal-domain NER corpus as input and selecting specific samples for a prompting task. We then instructed the GLM to catalog its own NER corpus, which we compared with the original test examples. Our study examined various aspects, including context examples, selection strategies, heuristic methodologies, post-processing techniques, and quantitative and qualitative analyses across specific domain classes. Our results indicate promising directions for future research and applications in specialized domains.

Download


Paper Citation


in Harvard Style

Oleques Nunes R., Spritzer A., Dal Sasso Freitas C. and Balreira D. (2024). Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning. In Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-692-7, SciTePress, pages 477-489. DOI: 10.5220/0012624700003690


in Bibtex Style

@conference{iceis24,
author={Rafael Oleques Nunes and Andre Spritzer and Carla Dal Sasso Freitas and Dennis Balreira},
title={Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning},
booktitle={Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2024},
pages={477-489},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012624700003690},
isbn={978-989-758-692-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - Out of Sesame Street: A Study of Portuguese Legal Named Entity Recognition Through In-Context Learning
SN - 978-989-758-692-7
AU - Oleques Nunes R.
AU - Spritzer A.
AU - Dal Sasso Freitas C.
AU - Balreira D.
PY - 2024
SP - 477
EP - 489
DO - 10.5220/0012624700003690
PB - SciTePress