One of the specificities of our work is that we fo-
cused on very fluctuating data due to the COVID-19
pandemic. The main contributions of the paper are
the study of the existing situation at Paris-CDG, the
needs, and then the identification of three main tasks
real-time prediction of parking delays, forecasting,
and explicability of predictions. Concerning the mod-
eling, we have identified five categories of data for our
problem: flight data, data on the progress of passen-
ger processes (security, boarding, Etc.), weather data,
current delay status data, Etc. Finally, we have con-
ducted an empirical study to perform the data, feature,
and model selection, and we provide an overview of
the results obtained.
2 FLIGHT DELAYS: STATE OF
THE ART
The problem of aircraft delays is a well-known prob-
lem. Different models among random forests, sup-
port vector machines and logistic regression have
been studied (Natarajan et al., 2018) to predict
whether a flight will be delayed. A logistic regres-
sion model to predict a class of departing flights
is proposed in (Nigam and Govinda, 2017). In
(Venkatesh et al., 2017), the authors study arrival
delays and propose different approaches to predict
whether a specific flight will be delayed. In (Ibrahem
et al., 2021) the authors compared different machine-
learning approaches (random forest, logistic regres-
sion, Bayesian naive classifier, and decision trees)
for delay prediction on arrival. In (Tang, 2021), a
comparison of seven binary models is performed. In
(Yi et al., 2021), the authors have proposed several
stacked approaches for the Boston Logan Interna-
tional Airport flight dataset from January to Decem-
ber 2019.
We also find different studies on the impact of
different factors on flight delays. For example, the
authors of (Wang et al., 2003) studied the impact
of flight connections on delay. In (Markovic et al.,
2008), a statistical study on the impact of weather
at Frankfurt airport is proposed. In (Yogita Borse
et al., 2020), the authors focused on weather data
as the main feature to predict the delay class. In
(Esmaeilzadeh and Mokhtarimousavi, 2020), a sup-
port vector machine (SVM) model is used. Based on
20 days, this latter study examines some causes of
air traffic delays at the three major airports in New
York City. In the study (Cai et al., 2021), a deep
learning approach for flight delay prediction consid-
ering a multi-airport scenario is proposed. Regarding
regression-based approaches, the authors (Rebollo
and Balakrishnan, 2014) have proposed approaches
based on classification and regression with random
forests for US airports.
To our knowledge, there is only one study that at-
tempts to predict takeoff delay, but it only attempts
to predict one hour before the estimated takeoff time,
and it is intended for the Maastricht (Dalmau-Codina
et al., 2019).
3 Paris-CDG AIRPORT
In this section, we provide factual information about
Paris-CDG airport. Paris-CDG airport is the most
important airport in France. It was opened in 1974
to cope with the saturation of Paris Orly airport (the
main Parisian airport before the opening of Paris-
CDG). It is located north of Paris and is the hub of
Air France. This company represents 50% of the traf-
fic at Paris-CDG. Three main terminals numbered 1
to 3 makeup Paris-CDG.
At Paris-CDG, there is more than one flight de-
parture per minute. There are about 720, 000 flights
per year, or about 2, 000 flights per day. On aver-
age, there are 145 passengers per flight. At Paris-
CDG, resources are currently planned using solutions
powered by constraint solvers. Among the critical
resources are the parking lots (or ”stands”) assigned
to the flights. When a parking lot is released late,
this can lead to complex scheduling changes and cas-
cading delays. It is, therefore, essential to anticipate
and predict these delays as accurately as possible, ex-
plain them, and propose actions to limit them. Before
presenting the problem, we will first introduce some
terms. We call rotation the set composed of an ar-
rival flight and a departure flight. This set generally
consists of two flights, but there may be only one, in
which case the flight begins a new rotation. The ro-
tation period is the time between the arrival of an air-
craft (landing) and its departure (takeoff). A flight has
a scheduled departure time, called SOBT (Scheduled
Off-Block Time), at which it is supposed to leave its
parking area. The moment when the flight leaves its
parking position is called AOBT. The delay is then the
period between AOBT and SOBT.
3.1 Milestones Before Off-Block
We present here the main milestones preceding the
pushback of an aircraft. These milestone stands are
presented in Figure. 1. Before the flight arrives at
the airport, the flight estimates its arrival time at the
block (EIBT: Estimated In-Block Time). The AIBT
(Actual In Block Time) is the right time when the
Predicting Off-Block Delays: A Case Study at Paris - Charles de Gaulle International Airport
181