Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models

Johannes Bubeck, Janick Greinacher, Yannik Langer, Tobias Roth, Carsten Lanquillon

2024

Abstract

Large language models (LLMs) have revolutionized the field of generative artificial intelligence and strongly affect human-computer interaction based on natural language. Yet, it is difficult for users to understand how trustful LLM outputs are. Therefore, this paper develops an agent-based framework by exploring approaches, methods, and the integration of external data sources. The framework contributes to AI reasearch and usage by enabling future users to consider LLM outputs more efficiently and critically.

Download


Paper Citation


in Harvard Style

Bubeck J., Greinacher J., Langer Y., Roth T. and Lanquillon C. (2024). Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4, SciTePress, pages 527-534. DOI: 10.5220/0012364000003636


in Bibtex Style

@conference{icaart24,
author={Johannes Bubeck and Janick Greinacher and Yannik Langer and Tobias Roth and Carsten Lanquillon},
title={Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={527-534},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012364000003636},
isbn={978-989-758-680-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models
SN - 978-989-758-680-4
AU - Bubeck J.
AU - Greinacher J.
AU - Langer Y.
AU - Roth T.
AU - Lanquillon C.
PY - 2024
SP - 527
EP - 534
DO - 10.5220/0012364000003636
PB - SciTePress