
– Implementing automated systems for continu-
ous evaluation and monitoring of the system’s
correctness over time.
By focusing on these areas, we aim to significantly
enhance the correctness and reliability of our RAG
system, making it an even more valuable tool for em-
ployee coaching and support in the network operator
environment.
Integration. Workstation desktops of service cen-
ter employees usually contain significantly more than
one open window. Depending on the environment and
the complexity of the respective task, three, four or
more applications fill the screen. Thus, an integration
into one single application (e.g., an ERP System like
SAP) is likely to be incomplete. For now, the solution
to provide the assistant as a web-based application is
a bearable compromise. As the whole application is
tailored in an API-first manner, switching UIs or in-
tegrating it into another UI is possible and straight-
forward.
11 CONCLUSIONS
This paper introduced a locally deployed RAG system
for employee coaching in the energy sector, focusing
on network operator processes. Our key contributions
include the following:
• Development and implementation of an on-
premise AI coaching system for the energy sector
• Evaluation of system performance across correct-
ness, performance, integration, as well as data and
runtime locality
Our findings show that local embedding models
for retrieval, combined with LLMs for answer genera-
tion, can effectively support individualized employee
coaching. The system achieved a 90.8% precision
rate for the retrieval of relevant documents from our
knowledge base. While promising, these results need
further improvement to be used in a productive envi-
ronment. The potential impact of this work extends
beyond immediate performance metrics. As AI tech-
nologies evolve, such systems could significantly en-
hance knowledge management and employee train-
ing in the energy sector. However, challenges remain
in data preprocessing, model choices and configura-
tions. In conclusion, our work represents a step to-
wards practical AI application for customer service in
the energy sector. While the results are encouraging,
further research and refinement are necessary to reach
a production-ready quality for our assistant.
ACKNOWLEDGEMENTS
This work was funded by regiocom SE, Germany.
The authors would like to express their gratitude to
regiocom for their financial support and for providing
access to the necessary data and resources that made
this research possible.
REFERENCES
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.,
Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G.,
Askell, A., Agarwal, S., Herbert-Voss, A., Krueger,
G., Henighan, T., Child, R., Ramesh, A., Ziegler,
D. M., Wu, J., Winter, C., Hesse, C., Chen, M., Sigler,
E., Litwin, M., Gray, S., Chess, B., Clark, J., Berner,
C., McCandlish, S., Radford, A., Sutskever, I., and
Amodei, D. (2020). Language models are few-shot
learners.
Bundesnetzagentur (2019). Marktstammdatenregister.
https://www.marktstammdatenregister.de. © Bun-
desnetzagentur f
¨
ur Elektrizit
¨
at, Gas, Telekommunika-
tion, Post und Eisenbahnen — DL-DE-BY-2.0.
Chase, H. (2022). LangChain.
Chroma (2023). Chromadb. Available at: https://www.
trychroma.com/.
Corporation, N. (2024). NVIDIA Multi-Instance GPU User
Guide. Accessed: 2024-08-16.
Fu, Z., Lam, W., Yu, Q., So, A. M.-C., Hu, S., Liu, Z., and
Collier, N. (2023). Decoder-only or encoder-decoder?
interpreting language model as a regularized encoder-
decoder. https://arxiv.org/abs/2304.04052.
Guu, K., Lee, K., Tung, Z., Pasupat, P., and Chang, M.-W.
(2020). Realm: Retrieval-augmented language model
pre-training.
Harries, J. P. https://huggingface.co/jphme/
Llama-2-13b-chat-german. Accessed on 2024-
08-05.
Joshi, V., He, W., sun Seo, J., and Rajendran, B. (2021). Hy-
brid in-memory computing architecture for the train-
ing of deep neural networks.
Langchain (2024). https://python.langchain.com/v0.2/docs/
integrations/vectorstores/. Accessed on 2024-08-03.
Levy, A., Shalom, B. R., and Chalamish, M. (2024). A
guide to similarity measures.
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin,
V., Goyal, N., K
¨
uttler, H., Lewis, M., tau
Yih, W., Rockt
¨
aschel, T., Riedel, S., and Kiela,
D. (2021). Retrieval-augmented generation for
knowledge-intensive nlp tasks.
Liu, J. (2022). LlamaIndex.
lm-sys (2024). FastChat: An open platform for training,
serving, and evaluating large language model based
chatbots.
Meta (2024). https://llamaimodel.com/commercial-use/.
Accessed on 2024-08-01.
ICAART 2025 - 17th International Conference on Agents and Artificial Intelligence
704