loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Luca Strano 1 ; Claudia Bonanno 1 ; Francesco Ragusa 2 ; 1 ; Giovanni Farinella 3 ; 2 ; 1 and Antonino Furnari 1 ; 2

Affiliations: 1 FPV@IPLAB, DMI - University of Catania, Italy ; 2 Next Vision s.r.l. - Spinoff of the University of Catania, Italy ; 3 Cognitive Robotics and Social Sensing Laboratory, ICAR-CNR, Palermo, Italy

Keyword(s): Virtual Assistants, Visual Question Answering, Large Language Models.

Abstract: We introduce HERO-GPT, a Multi-Modal Virtual Assistant built on a Multi-Agent System designed to swiftly adapt to any procedural context minimizing the need for training on context-specific data. In contrast to traditional approaches to conversational agents, HERO-GPT utilizes a series of dynamically interchangeable documents instead of datasets, hand-written rules, or conversational examples, to provide information on the given scenario. This paper presents the system’s capability to adapt to an industrial domain scenario through the integration of a GPT-based Large Language Model and an object detector to support Visual Question Answering. HERO-GPT is capable of offering conversational guidance on various aspects of industrial contexts, including information on Personal Protective Equipment (PPE), machinery, procedures, and best practices. Experiments performed in an industrial laboratory with real users demonstrate HERO-GPT’s effectiveness. Results indicate that users clearly pref er the proposed virtual assistant over traditional supporting materials such as paper-based manuals in the considered scenario. Moreover, the performance of the proposed system are shown to be comparable or superior to those of traditional approaches, while requiring little domain-specific data for the setup of the system. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.116.15.22

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Strano, L.; Bonanno, C.; Ragusa, F.; Farinella, G. and Furnari, A. (2024). HERO-GPT: Zero-Shot Conversational Assistance in Industrial Domains Exploiting Large Language Models. In Proceedings of the 4th International Conference on Image Processing and Vision Engineering - IMPROVE; ISBN 978-989-758-693-4; ISSN 2795-4943, SciTePress, pages 74-82. DOI: 10.5220/0012688900003720

@conference{improve24,
author={Luca Strano. and Claudia Bonanno. and Francesco Ragusa. and Giovanni Farinella. and Antonino Furnari.},
title={HERO-GPT: Zero-Shot Conversational Assistance in Industrial Domains Exploiting Large Language Models},
booktitle={Proceedings of the 4th International Conference on Image Processing and Vision Engineering - IMPROVE},
year={2024},
pages={74-82},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012688900003720},
isbn={978-989-758-693-4},
issn={2795-4943},
}

TY - CONF

JO - Proceedings of the 4th International Conference on Image Processing and Vision Engineering - IMPROVE
TI - HERO-GPT: Zero-Shot Conversational Assistance in Industrial Domains Exploiting Large Language Models
SN - 978-989-758-693-4
IS - 2795-4943
AU - Strano, L.
AU - Bonanno, C.
AU - Ragusa, F.
AU - Farinella, G.
AU - Furnari, A.
PY - 2024
SP - 74
EP - 82
DO - 10.5220/0012688900003720
PB - SciTePress