these challenges, losses in global stock markets
appeared to have been notably mitigated (Le et al.,
2021). Wang et al. argued that in extreme scenarios,
there exists a robust causal relationship between
investor sentiment and the crude oil futures market
(Wang et al., 2021).
Therefore, how to accurately predict WTI crude oil
prices has become a top priority, and many complex
and innovative models have been built to predict WTI
crude oil prices. Traditional econometric models play
a crucial role across diverse economic domains,
including the prediction of WTI crude oil prices.
Herrera employed RiskMetrics and GARCH models
for short-term forecasts, Exponential GARCH
(EGARCH) for medium-term horizons, and Markov-
switching GARCH (MS-GARCH) for long-term
predictions (Wang and Liu, 2016 & Herrera, Hu and
Pastor, 2018). Indeed, machine learning methods and
hybrid models have gained significant traction in the
realm of crude oil price prediction. A multitude of
scholars have conducted extensive research in this
area. Wu et al. leveraged a Convolutional Neural
Network (CNN) model to extract text features from
news media texts and Google Trends data, assessing
their efficacy in explaining crude oil price predictions
(Wu et al., 2021). Li et al. investigated the enduring
impacts of global crude oil production and economic
activities on crude oil prices. They devised a hybrid
model incorporating Genetic Algorithm Optimized
Support Vector Machine (GASVM) and Back
Propagation Neural Network (BPNN) to analyze
monthly oil price data for predictive purposes (Li,
Zhu and Wu, 2019). Wang fused a multi-layer
perceptron with a neural network to develop an
Elman Recurrent Neural Network (ERNN) model for
empirical crude oil price forecasting (Wang and
Wang, 2016).
Overall, these studies highlight the need for
accurate prediction of the global price of WTI crude.
The econometric model can predict short-term crude
oil prices more accurately, but the nonlinear, complex
and non-stationary characteristics of crude oil prices
make the model have certain flaws. The machine
learning model uses linear and nonlinear models to
enrich the experimental process and set up various
scenarios to improve the applicability of the model.
Then, more complex deep learning models and
numerous neural network algorithms were added to
forecasts, which can extract effective information and
focus on trends and changes in time series. This paper
will primarily concentrate on utilizing the ARIMA
and GARCH models for crude oil price prediction.
The aim is to offer a valuable reference for crude oil
futures investors, aiding them in making informed
decisions and conducting risk mitigation transactions.
By leveraging these models, investors can potentially
reduce their losses to a considerable extent.
2 METHODOLOGY
2.1 Data Source
Fred's global WTI crude oil price index is the source
of the data used in this investigation. This dataset
comprises monthly average prices of crude oil in U.S.
dollars. It is meticulously documented, with no
instances of missing values or outliers. For analysis,
the paper has selected price data spanning from
January 2000 to April 2024, amounting to 292
observations. The first 276 observations, covering the
period from January 2000 to December 2022,
constitute the training set, while the remaining 16
observations, spanning from January 2023 to April
2024, are designated as the test set. Ultimately, a
rolling forecast approach is adopted to predict the
WTI crude oil price for May, June, and July 2024.
The internationally recognized crude oil
benchmark prices are WTI and Bren crude oil prices.
This paper selects the price of WTI crude oil, which
occupies the leading position in terms of global
commodity futures trading volume because of its
advantages of transparent quotations and high
liquidity, as well as the status of U.S. super crude oil
buyers and the world influence of the New York
Stock Exchange. At the same time, this paper selected
prices rather than yields, average prices rather than
closing prices, and monthly data over the past decade
rather than all data.
2.2 Variable Selection
Crude oil prices are obviously volatile and cyclical.
Indeed, the prices of crude oil can experience
substantial fluctuations, with the potential for
significant rises or falls within relatively brief time
frames, and they can experience rising and falling
cycles over a span of several years or even ten years.
Changes in oil prices are often affected by
geopolitics, technological progress, and the
macroeconomic environment, as illustrated in Figure
1: