The established monitoring stations can collect on-
line water quality data sets for FRC, temperature
(T), electrical conductivity (EC), pH and turbidity
(Turb). Pressure (P) and water flow rates (Q) are
also being recorded on-line and continuously for the
monitoring stations: Bogacay Pumping Station and
Reservoir, ON 67, ON 68, ON 70, ON 71, ON 72,
ON 73 , ON 74 and Hurma Balancing Reservoir
(Figure 1). These parameters are expected to
influence FRC levels and its decay kinetics in WDS.
Data sets from the analyzers were recorded and
stored at five-minute intervals at SCADA center. For
this study, these data sets were converted into
quarterly averaged values. We have utilized the
water quality and operational data provided by
ASAT for the months September 2009 and October
2009 for the above mentioned monitoring stations.
There was some missing data within time series for
some of the parameters due to short time power
failures and due to unavoidable operational
problems. Missing data points were generated
synthetically by averaging the earlier and following
values. The number of missing information was
relatively few.
2.2 Selection of Input Variables
The purpose of ANN modeling in WDS field of
water resources engineering is in general to obtain a
mathematical tool to calculate the necessary input
levels at control points to satisfy the desired levels of
chlorine in future at the critical points of WDS. In
this study, the control point is Bogacay Pumping
Station and Reservoir and the critical points are the
other monitoring stations namely ON 67, ON 68,
ON 70, ON 71, ON 72, ON 73, ON 74 and Hurma
Balancing Reservoir (Figure 1). An earlier study
(TUBITAK, 2009) has suggested that in cases where
supplied water quality is high and where the chlorine
decay rates are very low, tools other than classical
ANN methodology with multiple inputs should be
investigated and adopted for more simplified
modeling. Because of this observation, in this
research only single controllable input, namely Free
Residual Chlorine at Bogacay Pumping Station and
Reservoir (FRC
Bogacay
), has been utilized. Further,
the collection of data for model building has been
realized in such a way that the specifically created
FRC perturbations within control point has created
FRC time series at monitoring points that are
suitable to develop SISO-TS-ANN models for
control purposes as consistent with (TUBITAK,
2009).
2.3 Model Building
Through this research we wanted to investigate the
applicability of SISO-TS-ANN models developed as
a FRC control tool and as a forecaster for a fairly
complex WDS with high quality raw water input.
The selected single input was FRC
Bogacay
. In order to
initiate ANN modeling one has to decide about
selecting NSUAI and H. In order to establish criteria
to select the best combinations of NSUAI and H for
different monitoring stations, we have adopted
response surface methodology for R
2
and MAE. The
creation of response surfaces required selection of
experimental matrix that covers reasonable and
practical combinations of (NSUAI,H), including
(8,2), (8,4), (8,8), (12,2), (12,4), (12,8), (24,2),
(24,4), (24,8), (48,2), (48,4), (48,8), (72,2), (72,4),
(72,8), (96,2), (96,4), (96,8). The tested NSUAI
values were 8, 12, 24, 48, 72 and 96. The tested H
values were 2, 4 and 8. The R
2
and MAE values have
been estimated for each combination of NSUAI and
H; then the contour plots were prepared by using
STATISTICA (SANN, 2008). SISO-TS-ANN
models for forecasting FRC levels at monitoring
stations have been developed utilizing the software
SANN ANS (Statistica Automated Neural Network-
Automated Network Search) package (SANN,
2008). Modeling constraints were as follows: 1)
Minimum and maximum numbers of hidden neurons
were 3 and 11 respectively, 2) 20 candidate
structures have been tested for each data set and only
the one with best performance and precision was
requested to be retained. MLP approach has been
instructed to be utilized.
The best SISO-TS- ANN for each monitoring
station has been found by examining the response
surfaces developed for that station. Several different
combinations of types of hidden and output
activation functions, number of neurons, number of
steps used as inputs (NSUAI) and number of steps
ahead to predict (H) have been tested and the one
that minimizes MAE and maximizes R
2
has been
selected and listed in Table 1.
3 RESULTS AND DISCUSSIONS
The developed response surfaces as contour plots for
R
2
and MAE are given in Figures 2 and 3
respectively. These figures cover response surfaces
for monitoring stations ON69, ON71 and Hurma
balancing reservoir only. Similar response surfaces
are obtained for other monitoring stations as well. In
order to save space, only the response surfaces of
IJCCI2012-InternationalJointConferenceonComputationalIntelligence
590