
Coordination for Complex Road Conditions at Unsignalized
Intersections: A MADDPG Method with Enhanced Data Processing
Ruo Chen
1
, Yang Zhu
1,2,∗
and Hongye Su
1,2
1
Ningbo Innovation Center, Zhejiang University, Ningbo, 315100, China
2
College of Control Science and Engineering, Zhejiang University, Hangzhou, 310027, China
{22360415, zhuyang88}@zju.edu.cn, hysu@iipc.zju.edu.cn
Keywords:
Deep Reinforcement Learning, MADDPG, Data Processing, Sliding Control.
Abstract:
In this paper, we use deep reinforcement learning to enable connected and automated vehicles (CAVs) to drive
in a intersection with human-driven vehicles. The multi-agent deep deterministic policy gradient (MADDPG)
algorithm is improved to be more efficient for data processing, so that it can solve the problem of learning bot-
tlenecks in complex environments, and use sliding control to execute control strategies. Finally, the feasibility
of the method is verified in the simulation environment of CARLA.
1 INTRODUCTION
Intersections are often considered to be the main
source of traffic congestion and energy waste (Shi-
razi and Morris, 2017). Based on the existing traffic
system and human driving environment, many meth-
ods have been proposed to increase the operation effi-
ciency of traffic lights, like (Feng and Head, 2015).
Besides, traffic flow fundamental diagram (FD) is
viewed as the basis of traffic flow theory and been
used in many cases, like (Zhou and Zhu, 2020). In ad-
dition, deep learning methods (Zhang and Ge, 2024),
(Zhang and Li, 2023) and tree search based algorithm
(Li and Wang, 2006), (Xu and Zhang, 2020) have
been gradually applied in the transportation field. In
recent years, the rapid development of autonomous
vehicles is expected to solve the problem of intersec-
tion congestion. The intersection management system
based on global information can better deal with the
data of automatic driving vehicles and the environ-
ment, so as to replace the traditional traffic light sys-
tem to improve traffic efficiency. Autonomous inter-
section management (AIM) is tailored for CAVs, aim-
ing at replacing the conventional traffic control strate-
gies (Wu and Chen, 2019). Graph-based modeling is
often used to solve traffic congestion (Chen and Xu,
2022). Meanwhile, some methods focus in reducing
energy consumption (Malikopoulos and Cassandras,
2018).
At present, most methods can only deal with the
∗
Corresponding author.
control strategy problem in a single environment,
such as all vehicles are driven by humans, and control
facilities are traffic lights (Shobana and Shakunthala,
2023), or all vehicles are autonomous vehicles with-
out traffic lights (Wang and Gong, 2024). However,
real-world environments are often more complex and
diverse, such as pedestrians and human-driven vehi-
cles at intersections, which can only be detected but
not controlled by AI systems. Secondly, a method
that simply generates an overall policy for passing
through an intersection have difficulties to cope with
sudden disturbances, and errors in the control process
can greatly affect the robustness of this policy. Due
to the high dynamics and randomness of unsignalized
control intersections, how to design efficient and safe
multi-vehicle cooperative motion planning methods is
still a challenging problem.
As a deep reinforcement learning method (Lowe
et al., 2020), MADDPG can adapt to the input
changes and make corresponding responses to the
changing environment. And the action of the agent
is given by the neural network, which can ensure the
real-time action. Compared to traditional exhaustive
methods, MADDPG is less complex, but it also has a
performance impact (Xu and Liu, 2024).
294
Chen, R., Zhu, Y. and Su, H.
Coordination for Complex Road Conditions at Unsignalized Intersections: A MADDPG Method with Enhanced Data Processing.
DOI: 10.5220/0013133100003941
Paper published under CC license (CC BY-NC-ND 4.0)
In Proceedings of the 11th International Conference on Vehicle Technology and Intelligent Transport Systems (VEHITS 2025), pages 294-300
ISBN: 978-989-758-745-0; ISSN: 2184-495X
Proceedings Copyright © 2025 by SCITEPRESS – Science and Technology Publications, Lda.