
(Lopez et al., 2018) traffic simulator, which simulates
real traffic scenarios. SUMO offers a designated ve-
hicle class known as emergency, facilitating the sim-
ulation of emergency vehicles and their unique priv-
ileges. Vehicles classified as emergency vehicles are
automatically assigned default shapes and sizes suit-
able for rescue operations. They possess special pre-
vileges, such as the ability to overtake on the right side
in all traffic scenarios. Additionally, these vehicles are
permitted to traverse lanes specifically designated for
”emergency” use, which may restrict normal passen-
ger traffic. This functionality within SUMO enables
an accurate modeling of emergency vehicle behaviors
and traffic dynamics. We showcase that RLLS can re-
duce the EV travel time significantly as compared to
this default (SUMO emergency) baseline.
Our second contribution is the introduction of ex-
perimental settings which can help with evaluating the
worst case performance of algorithms. To evaluate
the worst case performance of algorithms, we intro-
duce slowing down vehicles at regular time intervals
into the traffic. These slowing down vehicles block
the traffic and introduce more congestion in the traffic
network. This simulation setting helps with evalua-
tion of the robustness of algorithms. Using a wide
range of experiments, we showcase that our RLLS
model trained using normal traffic scenarios can gen-
eralize well to these worst case settings and we do not
need to train separate models for the different traffic
scenarios.
In addition, we also evaluate the performance of
our approaches on real word dataset by using real
time speed data from New York City traffic (NYD,
2022) to calibrate traffic in a simulation. In this set-
ting as well, RLLS model outperforms the existing
approaches. In all the settings, for purposes of realis-
tic modeling, we also allow the EV to communicate
with other vehicles within a communication distance
c
d
. This is equivalent to communication done by an
EV using a siren in real world scenarios.
2 RELATED WORK
The first thread of research focuses on finding strate-
gies for the assignment of EV to incoming requests
(emergency calls) (Ghosh and Varakantham, 2018;
Haghani et al., 2003; Joe et al., 2022; Schmid, 2012).
(Schmid, 2012), formulates the problem of finding
the optimal dispatch strategy as an approximate dy-
namic programming problem and uses value function
approximation strategies to find the assignment of EV
to emergency calls at each timestep. (Ghosh and
Varakantham, 2018) formulate the problem as an in-
teger optimization problem and use Benders decom-
position to find a solution to the integer optimization
problem.
The second thread of research focuses on finding
the best route for EV to travel from base station to the
disaster location and from disaster location to hospi-
tal(s) (Giri et al., 2022; Su et al., 2022; Jotshi et al.,
2009). A sub-thread of this line of work, is the co-
ordination of traffic signal control to mitigate traffic
congestion and as a result to allow EV to reach the
destination quickly (Asaduzzaman and Vidyasankar,
2017; Chen et al., 2020; Chu et al., 2019; Van der Pol
and Oliehoek, 2016).
The last thread of research which is most rele-
vant to this paper is related to improving the lane
level dynamics of Emergency vehicles (Agarwal and
Paruchuri, 2016; Ismath et al., 2019; Cao and Zhao,
2022). In this thread of work, focus is on under-
standing and computing the value of each lane so to
pick the best feasible lane to optimize on the travel
time. (Zhang et al., 2022) focuses on safe lane-
changing trajectories for autonomous driving in ur-
ban environments to enhance the efficiency as well
as safety. (Maleki et al., 2023) studies a real-time
optimal cooperative lane change strategy leverag-
ing V2V communication, prioritizing safety and effi-
ciency through constrained optimization. While these
approaches primarily address normal traffic scenar-
ios using heuristic methods, our strategy formulates
the challenge of minimizing EV traversal time as an
MDP and employs RL techniques. Additionally, we
introduce scenarios with random slowing vehicles to
add complexity and enhance the realism of the simu-
lation. There are different assumptions and aspects of
the problem e.g., nature of communication, range of
communication, communication protocol to use, priv-
ileges of EV etc. that can affect the lane level decision
making process. Please note that while traffic simu-
lators will need to handle vehicle routing which in-
volves changing of lanes, decisions to switch lanes are
myopic in nature in general even though route plan-
ning tends to get optimized in a global sense.
3 BACKGROUND
The handling of lane level dynamics in EV traversal
consists of picking the best lane for EV to travel while
traversing a multi lane stretch of road. Similar to ex-
isting work, we assume the presence of a V2V single
hop communication model where EV can obtain the
position and speed of a vehicle in any lane up to a
fixed communication distance, c
d
via V2V commu-
nication. It can also send lane change requests to
Improving Lane Level Dynamics for EV Traversal: A Reinforcement Learning Approach
135