loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Christian Hofmann ; Christopher May ; Patrick Ziegler ; Iliya Ghotbiravandi ; Jörg Franke and Sebastian Reitelshöfer

Affiliation: Institute for Factory Automation and Production Systems, Friedrich-Alexander-Universität Erlangen-Nürnberg, Egerlandstraße 7, 91058 Erlangen, Germany

Keyword(s): Open Vocabulary Object Detection, Pseudo-Labeling, Large Language Model, Vision Language Model.

Abstract: Large Language Models (LLMs) and Vision Language Models (VLMs) enable robots to perform complex tasks. However, many of today’s mobile robots cannot carry the computing hardware required to run these models on board. Furthermore, access via communication systems to external computers running these models is often impractical. Therefore, lightweight object detection models are often utilized to enable mobile robots to semantically perceive their environment. In addition, mobile robots are used in different environments, which also change regularly. Thus, an automated adaptation of object detectors would simplify the deployment of mobile robots. In this paper, we present a method for automated environment-specific individualization and adaptation of lightweight object detectors using LLMs and VLMs, which includes the automated identification of relevant object classes. We comprehensively evaluate our method and show its successful application in principle, while also pointing out short comings regarding semantic ambiguities and the application of VLMs for pseudo-labeling datasets with bounding box annotations. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.225.34.193

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Hofmann, C., May, C., Ziegler, P., Ghotbiravandi, I., Franke, J. and Reitelshöfer, S. (2025). Automated Individualization of Object Detectors for the Semantic Environment Perception of Mobile Robots. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP; ISBN 978-989-758-728-3; ISSN 2184-4321, SciTePress, pages 851-862. DOI: 10.5220/0013162500003912

@conference{visapp25,
author={Christian Hofmann and Christopher May and Patrick Ziegler and Iliya Ghotbiravandi and Jörg Franke and Sebastian Reitelshöfer},
title={Automated Individualization of Object Detectors for the Semantic Environment Perception of Mobile Robots},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP},
year={2025},
pages={851-862},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013162500003912},
isbn={978-989-758-728-3},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP
TI - Automated Individualization of Object Detectors for the Semantic Environment Perception of Mobile Robots
SN - 978-989-758-728-3
IS - 2184-4321
AU - Hofmann, C.
AU - May, C.
AU - Ziegler, P.
AU - Ghotbiravandi, I.
AU - Franke, J.
AU - Reitelshöfer, S.
PY - 2025
SP - 851
EP - 862
DO - 10.5220/0013162500003912
PB - SciTePress