quality. The first task was to produce effort
estimators using GP symbolic regression. The
second task was to produce classification rules using
grammar-guided GP for if-then rules. For each
problem, two available datasets were examined. In
the regression task of effort estimation, the system
produced short and comprehensible mathematical
expressions that outperformed previous models. In
the classification task of defect prediction, the
system produced the best-so-far or otherwise
competitive results, while succeeding in deriving
small hierarchical rule trees. Although the system
has been proved capable of producing highly
accurate and meaningful results for each case, this
success is only domain dependent, e.g. it seems that
due to the different dataset compositions, there is no
ability to drive overall conclusions on each of the
two tasks (effort estimation, defect prediction).
Further research involves the application of other
computational intelligent approach in these domains,
such as neuro-fuzzy rule-based systems, as well as
the application of genetic programming into other
related software engineering issues.
(IF= (% D 0.35) 0.01 CLS1 (IF> I -0.84
(IF> I -0.84 (IF> LOC -0.98 CLS1 CLS0)
(IF> LOC -0.98 CLS1 CLS0)) (IF> I -0.85
CLS1 (IF< LBL -0.98 (IF> L -0.53 CLS0
(IF> EVG -0.83 CLS1 (IF= (% D 0.35)
0.01 CLS1 (IF> I -0.84 (IF> I -0.84
(IF> LOC -0.98 CLS1 CLS0) (IF> LOC -
0.98 CLS1 CLS0)) (IF> I -0.85 CLS1 (IF<
IVG 0.06 (IF> I -0.80 (IF> I -0.83 CLS1
(IF= (% D 0.36) 0.01 CLS1 (IF> I -0.85
CLS1 (IF> LOC -0.98 CLS1 CLS0)))) (IF>
L -0.53 CLS0 (IF> I -0.87 CLS1 (IF> I -
0.85 CLS1 (IF> TON -0.82 CLS1 (IF> LOC
-0.98 CLS1 CLS0)))))) CLS1))))))
CLS1))))
Figure 2: Hierarchical rule tree for the NASA JM1
domain.
REFERENCES
Basili V.R., Caldiera G., and Rombach H.D., 1994,
Experience Factory, in Marciniak, J.J., Ed.,
Encyclopedia of Software Engineering, Vol. 1, pp.
469-476, John Wiley & Sons.
Berlanga F.J., del Jesus M.J., Herrera F., 2005, Learning
compact fuzzy rule-based classification systems with
genetic programming, in EUSFLAT05, 4th Conf. of the
European Society for Fuzzy Logic and Technology,
Barcelona, pp. 1027-1032.
Boehm B., 1981, Software Engineering Economics,
Prentice-Hall.
Boetticher, G., Lokhandwala, N., James C. Helm, 2006,
Understanding the Human Estimator, in 2
nd
Int’l
Predictive Models in Soft. Eng.Workshop,
Philadelphia, PA, Sep. 2006.
Desharnais J.M., Analyse statistique de la productivite des
projets informatique a partie de la technique des point
des fonction, 1989, MSc thesis, Univ. of Montreal.
Dreger J., 1989, Function Point Analysis, Englewood
Cliffs, NJ, Prentice Hall.
Fenton N.E., and Pfleeger S., 1997, Software Metrics: A
Rigorous and Practical Approach, Thompson Press.
Foss T.,Stensrud E., Kitchenham B., Myrtveit I., 2003, A
Simulation Study of the Model Evaluation Criterion
MMRE, IEEE Trans. on Soft. Eng., 29:11, Nov. 2003.
Halstead M., 1977, Elements of Software Science,
Elsevier.
Koza J.R., 1992, Genetic Programming: On the
Programming of Computers by Means of Natural
Selection, Cambridge, MA, MIT Press.
Lum K., Bramble M., Hihn J., Hackney J., Khorrami M.
and Monson E., 2003, Handbook of Software Cost
Estimation, Jet Propulsion Laboratory, Pasadena, CA,
USA.
McCabe T., 1976, A Complexity Measure, IEEE Trans.
Software Eng.,2:4, pp. 308-320.
Menzies T., DiStefano J., Orrego A., and Chapman R.,
2004, Assessing Predictors of Software Defects, in
Proc. PSM-2004, Workshop in Predictive Software
Models, Chicago, IL.
Menzies T, Chen Z., Hihn J., and Lum K., 2006, Selecting
Best Practices for Effort Estimation, IEEE Trans.
Software Eng. 32:11, Nov 2006.
Menzies T., Dekhtyar A., Distefano J., Greenwald J.,
2007, Problems with Precision: A Response to
"Comments on; Data Mining Static Code Attributes to
Learn Defect Predictors, IEEE Trans. on Soft. Eng.,
33: 9, Sept. 2007 pp. 637 - 640.
Morasca S., and Ruhe G., 2000, A hybrid approach to
analyze empirical software engineering data and its
application to predict module fault-proneness in
maintenance, J. of Systems and Software, 53:3, Sep.
2000, pp. 225 – 237, Elsevier
Shepperd M. J., and Schofield C.,1997, Estimating
software project effort using analogies, IEEE Trans.
on Soft. Eng., 23, pp. 736-743.
Tsakonas A., Dounias G., Hierarchical Classification
Trees Using Type-Constrained Genetic Programming,
in Proc. of 1st Intl. IEEE Symposium in Intelligent
Systems, Varna, Bulgaria, 2002.
Tsakonas A., and Dounias G., Evolving Neural-Symbolic
Systems Guided by Adaptive Training Schemes:
Applications in Finance, App. Art. Intell., 21:7, 2007,
pp. 681-706.
ICSOFT 2008 - International Conference on Software and Data Technologies
300