encouraging and confirms the generalization poten-
tial that RL offers as a framework, even when using a
relatively simple off-the-shelf algorithm. We believe
these factors suggest that reinforcement learning ap-
proaches to critical driving events deserve further in-
vestigations.
5 CONCLUSIONS AND FUTURE
WORK
This paper proposes a deep reinforcement learning
approach for self-driving in pre-accident scenarios,
with the aim of investigating its effectiveness and
safeness specifically in critical circumstances, when
damage minimization should be the only priority.
The control system is based on an end-to-end de-
sign that directly maps raw sensory data, coming from
a single RGB camera, to vehicle commands. The im-
ages gathered by the camera are subjected to feature
extraction by means of a pre-trained ResNet and then
fed into the RL algorithm for the learning process
that takes place in a virtual environment. Model up-
dates are driven by an hand-crafted reward function,
specifically designed to encourage emergency maneu-
vers in critical situations, taking into account colli-
sion damage and other minor factors, namely speed
control, road following and covered distance. Experi-
ments were carried out on several typical pre-accident
scenario recreated in the CARLA simulated world,
where the autonomous vehicle showed promising per-
formance, managing, in the vast majority of cases, to
avoid collisions with the other vehicle in the scene.
Despite these encouraging results, the work leaves
room for future improvements. First, as previously
stated, our model is designed to be embedded in a
broader system involving a module that takes care of
driving in ordinary circumstances. To this aim, a neu-
ral network could be trained to identify critical scenar-
ios and act as a switch between the two driving sys-
tems, passing the control to the emergency one when
danger is detected.
Furthermore, the use of a more advanced rein-
forcement learning algorithm is also worth consider-
ing. Algorithms like Deep Deterministic Policy Gra-
dient (DDPG) (Lillicrap et al., 2015) or Soft Actor-
Critic (SAC) (Haarnoja et al., 2018) have recently
shown good performances and their ability to deal
with continuous action spaces could favour a more ac-
curate and smooth driving style.
ACKNOWLEDGEMENTS
This work was partially funded by the H2020 project
AI4Media “A European Excellence Centre for Media,
Society and Democracy” under GA 951911.
REFERENCES
Badrinarayanan, V., Kendall, A., and Cipolla, R. (2017).
Segnet: A deep convolutional encoder-decoder ar-
chitecture for image segmentation. IEEE transac-
tions on pattern analysis and machine intelligence,
39(12):2481–2495.
Bojarski, M., Del Testa, D., Dworakowski, D., Firner,
B., Flepp, B., Goyal, P., Jackel, L. D., Monfort,
M., Muller, U., Zhang, J., et al. (2016). End to
end learning for self-driving cars. arXiv preprint
arXiv:1604.07316.
Broggi, A., Buzzoni, M., Debattisti, S., Grisleri, P., Laghi,
M. C., Medici, P., and Versari, P. (2013). Exten-
sive tests of autonomous driving technologies. IEEE
Transactions on Intelligent Transportation Systems,
14(3):1403–1415.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-
Fei, L. (2009). Imagenet: A large-scale hierarchical
image database. In 2009 IEEE conference on com-
puter vision and pattern recognition, pages 248–255.
Ieee.
Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., and
Koltun, V. (2017). Carla: An open urban driving sim-
ulator. In Conference on robot learning, pages 1–16.
PMLR.
Geiger, A., Lenz, P., and Urtasun, R. (2012). Are we ready
for autonomous driving? the kitti vision benchmark
suite. In 2012 IEEE conference on computer vision
and pattern recognition, pages 3354–3361. IEEE.
Gonz
´
alez, A., V
´
azquez, D., L
´
opez, A. M., and Amores, J.
(2016). On-board object detection: Multicue, multi-
modal, and multiview random forest of local experts.
IEEE transactions on cybernetics, 47(11):3980–3990.
Haarnoja, T., Zhou, A., Abbeel, P., and Levine, S. (2018).
Soft actor-critic: Off-policy maximum entropy deep
reinforcement learning with a stochastic actor. In
International conference on machine learning, pages
1861–1870. PMLR.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep resid-
ual learning for image recognition. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 770–778.
Kendall, A., Hawke, J., Janz, D., Mazur, P., Reda, D., Allen,
J.-M., Lam, V.-D., Bewley, A., and Shah, A. (2019).
Learning to drive in a day. In 2019 International Con-
ference on Robotics and Automation (ICRA), pages
8248–8254. IEEE.
Levinson, J., Askeland, J., Becker, J., Dolson, J., Held, D.,
Kammel, S., Kolter, J. Z., Langer, D., Pink, O., Pratt,
V., et al. (2011). Towards fully autonomous driving:
Reinforced Damage Minimization in Critical Events for Self-driving Vehicles
265