Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting

Florian Rötzer, Kai Göbel, Maximilian Liebetreu, Stephan Strommer

2024

Abstract

We present a novel, efficient, and scalable approach for generating knowledge graphs (KGs) tailored to specific competency questions, leveraging large language model (LLM)-based retrieval-augmented generation (RAG) as a source of high-quality text data. Our method utilises a predefined ontology and defines two agents: The first agent extracts entities and triplets from the text corpus maintained by the RAG, while the second agent merges similar entities based on labels and descriptions, using embedding functions and LLM reasoning. This approach does not require fine-tuning or additional AI training, and relies solely on off-the-shelf technologies. Additionally, due to the use of RAG, the method can be used with a text corpus of arbitrary size. We applied our method to the high-pressure die casting domain, focusing on defects and their causes. In the absence of annotated datasets, manual evaluation of the resulting KGs showed over 90% precision in entity extraction and around 70% precision in triplet extraction, the main source of error being the RAG itself. Our findings suggest that this method can significantly aid in the rapid generation of customised KGs for specific applications.

Download


Paper Citation


in Harvard Style

Rötzer F., Göbel K., Liebetreu M. and Strommer S. (2024). Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting. In Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO; ISBN 978-989-758-717-7, SciTePress, pages 365-376. DOI: 10.5220/0012951000003822


in Bibtex Style

@conference{icinco24,
author={Florian Rötzer and Kai Göbel and Maximilian Liebetreu and Stephan Strommer},
title={Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting},
booktitle={Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO},
year={2024},
pages={365-376},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012951000003822},
isbn={978-989-758-717-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO
TI - Knowledge Graph Extraction from Retrieval-Augmented Generator: An Application in Aluminium Die Casting
SN - 978-989-758-717-7
AU - Rötzer F.
AU - Göbel K.
AU - Liebetreu M.
AU - Strommer S.
PY - 2024
SP - 365
EP - 376
DO - 10.5220/0012951000003822
PB - SciTePress