Information Extraction in the Legal Domain: Traditional Supervised Learning vs. ChatGPT

Gustavo Coelho; Alimed Celecia; Jefferson de Sousa; Melissa Lemos; Maria Lima; Ana Mangeth; Isabella Frajhof; Marco Casanova

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Information Extraction in the Legal Domain: Traditional Supervised Learning vs. ChatGPT

Topics: Deep Learning; Industrial Applications of Artificial Intelligence; Intelligent Agents; Neural Network Software and Applications

In Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS, 579-586, 2024 , Angers, France

Authors: Gustavo Coelho ¹ ; Alimed Celecia ¹ ; Jefferson de Sousa ¹ ; Melissa Lemos ¹ ; Maria Lima ¹ ; Ana Mangeth ² ; Isabella Frajhof ² and Marco Casanova ¹

Affiliations: ¹ Tecgraf - PUC-Rio, Rio de Janeiro, Brazil ; ² LES - PUC-Rio, Rio de Janeiro, Brazil

Keyword(s): Natural Language Processing, Information Extraction, Text Classification, Named Entity Recognition, Large Language Models, Prompt Engineering.

Abstract: Information Extraction is an important task in the legal domain. While the presence of structured and machine-processable data is scarce, unstructured data in the form of legal documents, such as legal opinions, is largely available. If properly processed, such documents can provide valuable information about past lawsuits, allowing better assessment by legal professionals and supporting data-driven applications. This paper addresses information extraction in the Brazilian legal domain by extracting structured features from legal opinions related to consumer complaints. To address this task, the paper explores two different approaches. The first is based on traditional supervised learning methods to extract information from legal opinions by essentially treating the extraction of categorical features as text classification and the extraction of numerical features as named entity recognition. The second approach takes advantage of the recent popularization of Large Language Models (LL Ms) to extract categorical and numerical features using ChatGPT and prompt engineering techniques. The paper demonstrates that while both approaches reach similar overall performances in terms of traditional evaluation metrics, ChatGPT substantially reduces the complexity and time required along the process. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.144.224.216

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Coelho, G., Celecia, A., de Sousa, J., Lemos, M., Lima, M., Mangeth, A., Frajhof, I. and Casanova, M. (2024). Information Extraction in the Legal Domain: Traditional Supervised Learning vs. ChatGPT. In Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-692-7; ISSN 2184-4992, SciTePress, pages 579-586. DOI: 10.5220/0012499800003690

@conference{iceis24,
author={Gustavo Coelho and Alimed Celecia and Jefferson {de Sousa} and Melissa Lemos and Maria Lima and Ana Mangeth and Isabella Frajhof and Marco Casanova},
title={Information Extraction in the Legal Domain: Traditional Supervised Learning vs. ChatGPT},
booktitle={Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2024},
pages={579-586},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012499800003690},
isbn={978-989-758-692-7},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - Information Extraction in the Legal Domain: Traditional Supervised Learning vs. ChatGPT
SN - 978-989-758-692-7
IS - 2184-4992
AU - Coelho, G.
AU - Celecia, A.
AU - de Sousa, J.
AU - Lemos, M.
AU - Lima, M.
AU - Mangeth, A.
AU - Frajhof, I.
AU - Casanova, M.
PY - 2024
SP - 579
EP - 586
DO - 10.5220/0012499800003690
PB - SciTePress