SWR is a statistical technique that allows us to
build a prediction model representing the
relationship between independent and dependent
variables. The estimation model is obtained by
adding, at each stage, the independent variable with
the highest association to the dependent variable,
taking into account all variables currently in the
model. SWR aims to find the set of independent
variables that best explains the variation in the
dependent variable.
To select the variables to be added in the model a
Manual SWR (MSWR) can be applied, using the
technique proposed by Kitchenham (1998). The idea
underlying this procedure is to select the important
independent variables, and then to use linear
regression to obtain the final model.
CBR is a branch of Artificial Intelligence where
knowledge of similar past cases is used to solve new
cases. The idea is to predict the effort of a new
project by considering similar projects previously
developed. We applied CBR by employing ANGEL
(Shepperd and Schofield, 2000). ANGEL
implements the Euclidean distance as similarity
function and uses features normalised between 0 and
1. We used 1, 2, and 3 analogies employing as
adaptation strategies the mean of k analogies (simple
average), the inverse distance weighted mean and
the inverse rank weighted mean (Shepperd and
Schofield, 2000). We used the feature subset
selection of ANGEL in order to let the tool to
automatically choose, among all the variables, the
ones to employ as set of key features in the analogy
based estimation.
To have a better visual insight on the
effectiveness of the estimation models, we compared
the prediction accuracies taking into account both
the summary statistics and the boxplots of absolute
residuals, where residuals are calculated as (EFreal –
EFpred). Boxplots are widely employed in
exploratory data analysis since they provide a quick
visual representation to summarize the data, using
five values: median, upper and lower quartiles,
minimum and maximum values, and outliers
(Kitchenham et al., 2001). In development effort
estimation, boxplots are used to visually represent
the amount of the error for a given prediction
technique. In particular, we used boxplot to
graphically render the spread of the absolute
residuals.
In order to verify whether the estimates obtained
with TS are characterized by significantly better
accuracy than the considered benchmarks we
statistically analyzed the absolute residuals, as
suggested in (Kitchenham et al., 2001). Since (i) the
absolute residuals for all the analyzed estimation
methods were not normally distributed, and (ii) the
data was naturally paired, we decided to use the
Wilcoxon test (Royston, 1982). The achieved results
were intended as statistically significant at = 0.05.
We performed the empirical analysis by
exploiting two datasets: the Desharnais (Desharnais,
1989) dataset, containing 81 observations, and the
NASA (Bailey and Basili, 1981) dataset, with 18
observations. Despite of these datasets are quite old,
they have been widely and recently used to evaluate
and compare estimation methods (see e.g., (Burgess
and Lefley, 2001) (Shepperd and Schofield, 2000)).
As for Desharnais dataset, in our analysis we did not
consider the length of the code as made in (Burgess
and Lefley, 2001), and categorical variables (i.e.,
Language and YearEnd). We excluded four projects
that had missing values, as done by Shepperd and
Schofield (2000). The NASA dataset consists of two
independent variables, i.e. Developed Lines (DL) of
code and Methodology (Me). The descriptive
statistics of the selected factors for the two datasets
are shown in Tables 1 and 2.
Table 1: Descriptive statistics of Desharnais dataset.
Variable Min Max Mean Std.Dev.
TeamExp 0.00 4.00 2.30 1.33
ManagerExp 0.00 7.00 2.65 1.52
Entities 7.00 387.00 120.55 86.11
Transactions 9.00 886.00 177.47 146.08
AdjustedFPs 73.00 1127.00 298.01 182.26
RawFPs 62.00 1116.00 282.39 186.36
Envergue 5.00 52.00 27.45 10.53
EFH (m/h) 546.00 23490.00 4903.95 4188.19
Table 2: Descriptive statistics of NASA dataset.
Variable Min Max Mean Std. Dev.
Me 19.00 35.00 27.78 5.39
DL 2.10 100.80 35.26 35.10
EFH (m/m) 5.00 138.30 49.47 45.73
We exploited some parameter settings to find
suitable values for moves and iterations numbers.
Concerning the number of moves, we executed TS
using four different values, i.e. 100, 500, 1000, and
2000. The best results in terms of MMRE and
Pred(25) were achieved with 1000 moves for
Desharnais dataset and 100 moves for NASA
dataset. We also executed the algorithm with
different numbers of iterations, and the best results
were achieved using 3000 and 500 iterations on
Desharnais and NASA, respectively. We did not
consider number of moves greater than 2000 for
Desharnais dataset and 100 for NASA dataset since
we noted a decreasing in the performance.
Moreover, note that for NASA dataset, having a
ICEIS 2010 - 12th International Conference on Enterprise Information Systems
238