Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models
Johannes Bubeck, Janick Greinacher, Yannik Langer, Tobias Roth, Carsten Lanquillon
2024
Abstract
Large language models (LLMs) have revolutionized the field of generative artificial intelligence and strongly affect human-computer interaction based on natural language. Yet, it is difficult for users to understand how trustful LLM outputs are. Therefore, this paper develops an agent-based framework by exploring approaches, methods, and the integration of external data sources. The framework contributes to AI reasearch and usage by enabling future users to consider LLM outputs more efficiently and critically.
DownloadPaper Citation
in Harvard Style
Bubeck J., Greinacher J., Langer Y., Roth T. and Lanquillon C. (2024). Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4, SciTePress, pages 527-534. DOI: 10.5220/0012364000003636
in Bibtex Style
@conference{icaart24,
author={Johannes Bubeck and Janick Greinacher and Yannik Langer and Tobias Roth and Carsten Lanquillon},
title={Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={527-534},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012364000003636},
isbn={978-989-758-680-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Towards Developing an Agent-Based Framework for Validating the Trustworthiness of Large Language Models
SN - 978-989-758-680-4
AU - Bubeck J.
AU - Greinacher J.
AU - Langer Y.
AU - Roth T.
AU - Lanquillon C.
PY - 2024
SP - 527
EP - 534
DO - 10.5220/0012364000003636
PB - SciTePress