4.4.3 Actor
The TD3 actor, initially trained in simulation and then
in the real world, performed better in the simpler
HAW environment compared to Knuffingen. Real-
world training, supported by the simulation expe-
rience buffer, prevented catastrophic forgetting but
still underperformed relative to simulation. While
the supervised-trained actor achieved reasonable real-
world results, the TD3 actor struggled, likely due to
the sim-to-real gap or limitations in lane segmentation
input. Testing with raw camera data did not yield per-
formance improvements, indicating a need for more
robust models or alternative learning algorithms for
real-world applications.
5 CONCLUSION
This paper presented the development of a compre-
hensive system for testing and training machine learn-
ing (ML) and reinforcement learning (RL) models for
autonomous driving within a miniature environment.
The system features a digital twin of a 1:87 scaled city
model at HAW Hamburg, enabling simulated tests
and real-world training for RL methods.
The system employs overhead cameras for track-
ing the position and orientation of a 1:87 scale au-
tonomous vehicle, the tinycar, in real time. The tiny-
car, equipped with a front-facing camera for envi-
ronmental perception, demonstrates effective control
via a low-latency wireless camera stream, supporting
real-world RL experiments.
Initial results show that the system is capable of
training neural networks to autonomously navigate
and handle intersections. The high-precision track-
ing system, combined with automatic repositioning,
achieves an 84% episode reset success rate without
human intervention. The encoder successfully re-
duces the input data dimensionality, minimizing the
sim-to-real gap and shortening the training time re-
quired in real-world environments. Models trained
with supervised learning demonstrate effective per-
formance, particularly in handling intersections.
However, RL models trained in simulation exhibit
challenges when transferred to more complex real-
world environments. While intersection handling in
simulation is reliable, real-world performance, partic-
ularly in the Knuffingen environment, reveals issues
such as oscillations and difficulties in lane selection
at intersections. The real-world transfer of models
shows limited generalization and marginal improve-
ment after additional training, indicating the need for
further refinement.
Overall, this work provides a functional frame-
work for investigating RL in real-world settings and
addressing the sim-to-real gap. Future research will
focus on improving the tracking system, extending the
gym environment to encompass more complex sce-
narios, and exploring advanced perception methods,
such as autoencoders, to further reduce the sim-to-
real gap and enhance the real-world applicability of
simulation-trained models.
REFERENCES
Fujimoto, S., Hoof, H., and Meger, D. (2018). Addressing
function approximation error in actor-critic methods.
In Proceedings of the 35th International Conference
on Machine Learning, pages 1587–1596. PMLR.
Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T.,
Tassa, Y., Silver, D., and Wierstra, D. (2016). Contin-
uous control with deep reinforcement learning. In In-
ternational Conference on Learning Representations
(ICLR).
Miniatur Wunderland Hamburg (2024).
Knuffingen: Miniatur wunderland.
https://www.miniatur-wunderland.de/wunderland-
entdecken/welten/knuffingen/. Accessed: 2024-09-
29.
Pareigis, S. and Maaß, F. L. (2023). Improved robust neural
network for sim2real gap in system dynamics for end-
to-end autonomous driving. In Gini, G., Nijmeijer,
H., Burgard, W., and Filev, D., editors, Informatics
in Control, Automation and Robotics, volume 836 of
Lecture Notes in Networks and Systems, pages 1–21.
Springer, Cham.
Pareigis, S., Tiedemann, T., Kasten, M., Stehr, M., Schnir-
pel, T., Schwalb, L., and Burau, H. (2021). K
¨
unstliche
Intelligenz in der Miniaturautonomie. In Fachtagung
des gemeinsamen Fachausschusses Echtzeitsysteme
von Gesellschaft f
¨
ur Informatik e.V. (GI), VDI/VDE-
Gesellschaft f
¨
ur Mess- und Automatisierungstechnik
(GMA) und Informationstechnischer Gesellschaft im
VDE (ITG) 2020, Informatik aktuell, pages 41–50.
Paull, L., Tani, J., Knepper, R., Rus, D., and Leonard, J.
(2017). Duckietown: an open, inexpensive and flex-
ible platform for autonomy education and research.
Proceedings of the 2017 IEEE International Con-
ference on Robotics and Automation (ICRA), pages
1497–1504.
Tiedemann, T., Fuhrmann, J., Paulsen, S., Schnirpel, T.,
Sch
¨
onherr, N., Buth, B., and Pareigis, S. (2019).
Miniature autonomy as one important testing means in
the development of machine learning methods for au-
tonomous driving : How ml-based autonomous driv-
ing could be realized on a 1:87 scale. In Interna-
tional Conference on Informatics in Control, Automa-
tion and Robotics 2019, ICINCO 2019 : proceedings
of the 16th International Conference on Informatics in
Control, Automation and Robotics, pages 483–488.
Tiedemann, T., Schwalb, L., Kasten, M., Grotkasten, R.,
ICINCO 2024 - 21st International Conference on Informatics in Control, Automation and Robotics
316