performing model was the one created with the feed-
forward trained by a multi-layer perceptron back-
propagation Neural Network algorithm with an f-
measure of 75%.
3.2.2 Zone Prediction Models
The four zone prediction models were trained by
considering fault historical data from the IOU grid
and weather data. Of the 1725 records with faults
and weather data, 70% were used for training and
30% for testing the trained models. The output of
these models predict in what zone (AMZ, UMZ,
PMZ) on the IOU grid the fault occurred. The best-
performing model was the one created training a
Neural Network algorithm. The model contains one
hidden layer with 20 nodes, and produces an
accuracy of 66%, an average precision of 69%, an
average recall of 68%, and an f-measure of 68%.
3.2.3 Substation Prediction Models
The four substation prediction models were trained
by considering fault historical data from the IOU
grid and weather data. Of the 1725 records with
faults and weather data, 70% were used for training
and 30% for testing the trained models. The output
of these models predicts the IOU substation ID
where the fault occurred. The best performing model
was the one created with the recursive partitioning
algorithm and produces an accuracy of 59%, an
average precision of 66%, an average recall of 54%,
and an f-measure of 59%.
3.2.4 Infrastructure Prediction Models
The four infrastructure prediction models were
trained by considering fault historical data from the
IOU grid and weather data. Of the 1725 records with
faults and weather data, 70% were used for training
and 30% for testing the trained models. The output
of these models predicts the type of infrastructure
(overhead or underground) on the section of the IOU
grid where the fault occurred. The best-performing
model was the one created training a Neural
Network algorithm with an f-measure of 57%.
3.2.5 Feeder Prediction Models
The four feeder prediction models were trained by
considering fault historical data from the IOU grid
and weather data. Of the 1725 records with faults
and weather data, 70% were used for training and
30% for testing the trained models. The output of
these models predicts the IOU Feeder where the
fault occurred. The best-performing model is the one
created with the recursive partitioning algorithm
with an f-measure of 74%.
3.3 Machine Learning for Forecasting
Fault Events in Assets
Many utilities have deployed diverse types of
sensors in their mission critical and expensive assets
such as power transformers. When monitoring
power transformers two types of on-line
measurements can be collected: (a) operational
information such as voltage, load, current, oil
temperature, winding temperatures, pump status, fan
status, cooling system status, etc; (b) condition
information, such as oil quality, gassing, dielectric
properties, aging, etc. Utilities use a variety of
sensors in their transformers and such sensors have
different monitoring capabilities, especially in terms
of the types and concentrations of gasses in the oil of
the transformers. Some time utilities supplement the
monitored concentration of gasses by conducting a
dissolved gas analysis (DGA) test periodically. A
study has been completed with the objective of
developing analytical models based on data mining
to identify patterns in gas concentrations, to identify
trends of gas concentrations that may lead to
catastrophic failures of equipment, and in general to
identify correlations between observations that
would result in new knowledge or confirm existing
heuristic knowledge about power transformers. The
example presented below does not identify the name
of the utility with which this study was conducted.
Hence, we refer our customer as Utility A. In our
example, Utility A had a fleet of over 300 power
transformers and had historical data collected for a
period of ten years. The historical data collected
included DGA analysis tests for all transformers
(concentration of H2, CH4, C2H6, C2H4, C2H2,
CO, CO2, O2, N2, and moisture), ID transformer
data (transformer name, type, age, pump type,
construction type, and conservator type), oil
temperature, winding temperature, and fluid quality
(metal particles present in oil). Utility A installed
sensors in its transformers fleet and recently
installed a fibre-optic network that helped to
transmit the monitored data into a central repository.
The objective was to develop a profile of
potential “hot spots” in power transformers where
the concentration of CO, CO2, and O2 are high (CO
> 571 parts per million, CO2 > 4001 ppm, and O2 >
10,000 ppm) and oil temperatures need to be
monitored so they do not exceed values > 150 C.
These conditions can show deterioration of the
KDIR2012-InternationalConferenceonKnowledgeDiscoveryandInformationRetrieval
368