features can help in high prediction performance and
thus, due care should be taken to select a set of
relevant and non-redundant features However,
conventional feature selection methods require
number of features to be extracted or a strict
assumption of conditional independence, and still
couldn't provide the minimal set of features that are
most relevant and non-redundant for the study. The
basic assumption of conditional independence of
feature selection methods degrades the performance
of model if features are strongly inter-connected.
Most of the real world problems contain features
that are strongly inter-related to each other. Due to
above mentioned research gaps; there is lack of
robust feature selection method to select relevant
and non-redundant factors for oil price forecasting
which can incorporate complexities of crude oil
prices. Hence, to overcome the limitations of
existing pool of methods, this study used I
2
MI
2
feature selection algorithm when features are
strongly dependent on each other and are non-linear.
2 I
2
MI
2
ALGORITHM FOR
FEATURE SELECTION
The novel three stage feature selection method
called I
2
MI
2
algorithm is an extended version of MI
3
Algorithm (Sehgal and Pandey, 2014) build on
pillars of interaction information and mutual
information. It is used for selecting relevant and
non-redundant features that drive oil price. The
proposed algorithm consists of three stages. In the
first stage, mutual information is computed between
target variable and candidate inputs. The variables
are ranked based on normalized mutual information
value and the irrelevant features are filtered out
based on a threshold value. The selected variables
are the list of irrelevant but redundant features. To
overcome redundancy, in stage two, three-variable
interaction information is computed among the
selected features in stage one. The set of selected
features having negative interaction information are
used to filter out the redundant features.
The study incorporates the concept of interaction
information so as to filter redundant input variables
instead of correlation analysis or partial correlation
analysis. Interaction information is favoured over
correlation analysis as it measures non-linear
dependency. This stage provides list of features that
are relevant and non-redundant in nature. Further, in
the third stage, mutual information is computed
between the selected features from stage two and
ranked according to normalized mutual information
value. Depending on a threshold value, redundant
features in stage three are filtered according to
relevance rank in stage one. The selected features
are used to build neural networks for oil price
prediction. The performance of proposed feature
selection algorithm is compared with Correlation
based Feature Selection (CFS), Modified Relief
(MR) and Modified Relief + Mutual Information
(MR + MI) (Amjady and Daraeepour, 2009) feature
selection methods. The performance criterions used
for comparing I
2
MI
2
algorithm with other algorithms
are RMSE, MAE and MAPE.
The proposed algorithm I
2
MI
2
with GRNN as
forecasting engine has performed the best among all
other feature selection methods. I
2
MI
2
algorithm has
lowest RMSE, MAE and MAPE as 1.29, 0.96 and
2.51 respectively. The reason for the best
performance lies in the fact that the final selected
features from proposed algorithm are 100% non-
redundant and relevant for the study. Two stage (MR
+ MI) with CNN as forecasting engine as proposed
by Amjady and Daraeepour (Amjady and
Daraeepour, 2009) has not performed better than
proposed algorithm. I
2
MI
2
algorithm is fully
automatic algorithm and doesn’t require user to
specify the number of features to be selected. I
2
MI
2
algorithm can provide the minimal representative set
of features for regression problems in business,
biostatistics, applied energy and many more
disciplines.
3 NUMERICAL RESULTS
For analysing the different mechanism in the falling
and rising period of oil prices, two sub-periods are
considered: January 2004-July 2008 and August
2008-December 2012, before and after 2008
financial crisis, respectively. The data collected for
factors driving oil prices are classified into eight
major classes: Speculations (2), Supply (3-4),
Demand (5-8), Reserves (9-15), Inventory (16-18),
Exchange Market (19-22), Stock Market (23) and
Economy (24-26) as shown in Table 1. The features
are selected on the basis of extensive literature
review. For each sub-period, I
2
MI
2
algorithm is
applied to select minimal set of relevant and non-
redundant factors that leads to high prediction
performance for oil prices. General Regression
Neural Network model is used as forecasting
engines to analyse the explanatory power of selected
features and their contribution in driving oil prices.
The proposed methodology is used to forecast the