Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting
Florian Rötzer, Kai Göbel, Maximilian Liebetreu, Stephan Strommer
2024
Abstract
We present a novel, efficient, and scalable approach for generating knowledge graphs (KGs) tailored to specific competency questions, leveraging large language model (LLM)-based retrieval-augmented generation (RAG) as a source of high-quality text data. Our method utilises a predefined ontology and defines two agents: The first agent extracts entities and triplets from the text corpus maintained by the RAG, while the second agent merges similar entities based on labels and descriptions, using embedding functions and LLM reasoning. This approach does not require fine-tuning or additional AI training, and relies solely on off-the-shelf technologies. Additionally, due to the use of RAG, the method can be used with a text corpus of arbitrary size. We applied our method to the high-pressure die casting domain, focusing on defects and their causes. In the absence of annotated datasets, manual evaluation of the resulting KGs showed over 90% precision in entity extraction and around 70% precision in triplet extraction, the main source of error being the RAG itself. Our findings suggest that this method can significantly aid in the rapid generation of customised KGs for specific applications.
DownloadPaper Citation
in Harvard Style
Rötzer F., Göbel K., Liebetreu M. and Strommer S. (2024). Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting. In Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO; ISBN 978-989-758-717-7, SciTePress, pages 365-376. DOI: 10.5220/0012951000003822
in Bibtex Style
@conference{icinco24,
author={Florian Rötzer and Kai Göbel and Maximilian Liebetreu and Stephan Strommer},
title={Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting},
booktitle={Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO},
year={2024},
pages={365-376},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012951000003822},
isbn={978-989-758-717-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO
TI - Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting
SN - 978-989-758-717-7
AU - Rötzer F.
AU - Göbel K.
AU - Liebetreu M.
AU - Strommer S.
PY - 2024
SP - 365
EP - 376
DO - 10.5220/0012951000003822
PB - SciTePress