images and annotations. A pipeline is presented,
which can be applied to various objects and environ-
ment settings and is extremely facile to use to any-
one for synthesising training data from 3D source
data. Annotated data, 3D source data, scripts and
tutorials are published at https://github.com/LJMP/
synthetic-industrial-dataset.
We show in multiple experiments, directly learning
safety classes end-to-end on our dataset, instead of
using a lookup table, substantially increases the predic-
tion scores and quality on real-world data. This con-
firms our hypothesis that small datasets benefit from
training abstracted similarities in different objects.
The results from these experiments prove the bene-
fit of using our dataset for logistic-relevant industrial
tasks. Expanding experiments with our dataset on
different architectures can consolidate the effect of
grouping on network performance. Additionally, our
pipeline allows synthesising new datasets for novel
scenes and environments, like consumer stores, ware-
houses and groceries stores.
Further work on this research focuses on creating a
ROS-compatible (Garage and Laboratory, 2021) drop-
in replacement for the commonly used 2D costmap
generator (Marder-Eppstein et al., 2021). This will
allow a robot’s trajectory planning to take a more op-
timal approach. The safety-class-adjusted margin to
an obstacle in the semantic costmap can now be con-
sidered, instead of a blunt arbitrary margin around
obstacles, thus enabling a risk-aware navigation, as
previously described in fig. 2.
ACKNOWLEDGEMENTS
This work is partially supported by a grant of the
BMWi ZIM program, no. ZF4029424HB9
REFERENCES
BlendFab (2021). 5000+ Different free assets available for
download.
CGTrader (2021). Search thousands of 3D models on sale.
Cywka, H. A. (2021). HDRi maps for CG artists.
Foundation, B. (v2.79b). Blender free and open source
complete 3D creation pipeline.
Garage, W. and Laboratory, S. A. I. (2021). The Robot
Operating System (ROS).
Geiger, A., Lenz, P., and Urtasun, R. (2012). Are we ready
for autonomous driving? The KITTI vision benchmark
suite. In 2012 IEEE Conference on Computer Vision
and Pattern Recognition, Providence, RI, USA, June 16-
21, 2012, pages 3354–3361. IEEE Computer Society.
He, K., Gkioxari, G., Doll
´
ar, P., and Girshick, R. B. (2017).
Mask R-CNN. CoRR, abs/1703.06870.
Li, W., Saeedi, S., McCormac, J., Clark, R., Tzoumanikas,
D., Ye, Q., Huang, Y., Tang, R., and Leuteneg-
ger, S. (2018). InteriorNet: Mega-scale Multi-
sensor Photo-realistic Indoor Scenes Dataset. CoRR,
abs/1809.00716.
Lin, T., Maire, M., Belongie, S. J., Bourdev, L. D., Girshick,
R. B., Hays, J., Perona, P., Ramanan, D., Doll
´
ar, P.,
and Zitnick, C. L. (2014). Microsoft COCO: Common
Objects in Context. CoRR, abs/1405.0312.
Marder-Eppstein, E., Lu!!, D. V., and Hershberger, D. (2021).
costmap 2d.
Menze, M. and Geiger, A. (2015). Object scene flow for au-
tonomous vehicles. In IEEE Conference on Computer
Vision and Pattern Recognition, CVPR 2015, Boston,
MA, USA, June 7-12, 2015, pages 3061–3070. IEEE
Computer Society.
Models, B. D. (2021). The Original Blender 3D Model
Repository.
Reimer, A. (2021). royalty free assets for your cgi produc-
tions.
Richter, S. R., Vineet, V., Roth, S., and Koltun, V. (2016).
Playing for Data: Ground Truth from Computer
Games. In Leibe, B., Matas, J., Sebe, N., and Welling,
M., editors, European Conference on Computer Vi-
sion (ECCV), volume 9906 of LNCS, pages 102–118.
Springer International Publishing.
Ros, G., Sellart, L., Materzynska, J., V
´
azquez, D., and L
´
opez,
A. M. (2016). The SYNTHIA Dataset: A Large Collec-
tion of Synthetic Images for Semantic Segmentation of
Urban Scenes. In 2016 IEEE Conference on Computer
Vision and Pattern Recognition, CVPR 2016, Las Vegas,
NV, USA, June 27-30, 2016, pages 3234–3243. IEEE
Computer Society.
Sketchfab (2021). Publish & find 3D models online.
Song, S., Lichtenberg, S. P., and Xiao, J. (2015). SUN
RGB-D: A RGB-D scene understanding benchmark
suite. In 2015 IEEE Conference on Computer Vision
and Pattern Recognition (CVPR), pages 567–576.
Song, S., Yu, F., Zeng, A., Chang, A. X., Savva, M., and
Funkhouser, T. A. (2016). Semantic Scene Completion
from a Single Depth Image. CoRR, abs/1611.08974.
Tobin, J., Fong, R., Ray, A., Schneider, J., Zaremba, W., and
Abbeel, P. (2017). Domain randomization for transfer-
ring deep neural networks from simulation to the real
world. In 2017 IEEE/RSJ International Conference on
Intelligent Robots and Systems (IROS), pages 23–30.
TurboSquid, I. (2021). Free 3D Models and Commercial
Use 3D Models.
waspinator (2018). pycococreator is a set of tools to help
create coco datasets.
Wrenninge, M. and Unger, J. (2018). Synscapes: A Pho-
torealistic Synthetic Dataset for Street Scene Parsing.
CoRR, abs/1810.08705.
Zaal, G. (2021). high quality HDRIs for free, no catch.
Zhang, Y., Song, S., Yumer, E., Savva, M., Lee, J., Jin,
H., and Funkhouser, T. A. (2016). Physically-Based
Rendering for Indoor Scene Understanding Using Con-
volutional Neural Networks. CoRR, abs/1612.07429.
VISAPP 2022 - 17th International Conference on Computer Vision Theory and Applications
990