Conditioned Generative AI for Synthetic Training of 6D Object Pose Detection
Mathijs Lens, Aaron Van Campenhout, Toon Goedemé
2025
Abstract
In this paper, we propose a method to generate synthetic training images for a more complex computer vision task compared to image classification, specifically 6D object pose detection. We demonstrate that conditioned diffusion models can generate unlimited training images for training an object pose detection model for a custom object type. Moreover, we investigate the potential of (automatically) filtering out ill-produced images in the dataset, which increases the quality of the image dataset, and show the importance of finetuning the trained model with a limited amount of real-world images to bridge the remaining sim2real domain gap. We demonstrate our pipeline in the use case of parcel box detection for the automation of delivery vans. All code is publicly available on our GitLab https://gitlab.com/EAVISE/avc/generative-ai-synthetic-training-pose-detection.
DownloadPaper Citation
in Harvard Style
Lens M., Van Campenhout A. and Goedemé T. (2025). Conditioned Generative AI for Synthetic Training of 6D Object Pose Detection. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP; ISBN 978-989-758-728-3, SciTePress, pages 324-331. DOI: 10.5220/0013130600003912
in Bibtex Style
@conference{visapp25,
author={Mathijs Lens and Aaron Van Campenhout and Toon Goedemé},
title={Conditioned Generative AI for Synthetic Training of 6D Object Pose Detection},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP},
year={2025},
pages={324-331},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013130600003912},
isbn={978-989-758-728-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP
TI - Conditioned Generative AI for Synthetic Training of 6D Object Pose Detection
SN - 978-989-758-728-3
AU - Lens M.
AU - Van Campenhout A.
AU - Goedemé T.
PY - 2025
SP - 324
EP - 331
DO - 10.5220/0013130600003912
PB - SciTePress