
the confidence an estimator had in its estimate at
the time of estimation. The transparency of the pro-
posed visual support is likely to increase this con-
fidence, which should allow—in many cases—to
agree on more narrow estimates.
For example, the lower right project cluster of the
Desharnais 2 data set (see figure 4) seems—despite
some outliers—to increase confidence in an effort
estimate range between 2500 and 3500.
6 CONCLUSION AND FURTHER
RESEARCH
MDS provides a transparent method to visualize high-
dimensional data and to analyze analogies or similari-
ties intuitively. In this paper we propose portfolio data
preparation steps for an MDS visualization of high-
dimensional project portfolio data, we visualize sev-
eral real-world data sets and assess the achieved ap-
proximation quality, and we outline several benefits
of the approach referring to concrete portfolio prop-
erties.
Main findings are that the approximation quality is
within reasonable boundaries given in the MDS liter-
ature, and that cost estimation can indeed benefit sub-
stantially from MDS—specific benefits include better
transparency of the analogy-based approach, a better
understanding of a portfolio’s data properties, thus,
easier assessment of the validity of analogy-based ap-
proaches in specific circumstances, easier data han-
dling and project selection, and finally, higher confi-
dence in estimates.
However, many aspects have to be refined and will
be addressed in future research efforts. First, weight-
ing portfolio data dimensions using brute force could
be extended from the current appoach to fine-grained
weight levels. Second, user interface issues will be
addressed to facilitate cluster analysis, for example,
providing easy access to project cluster mean and
variance values. Finally, quantitative measures for es-
timation confidence will be defined to assess the value
of the visualization for the estimators, for instance, by
weighting estimates’ accuracies (post-project) with
the estimators’ corresponding confidence values in
these estimates (pre-project).
To sum up, this and future research aims at support-
ing decision makers in the crucial task of cost estima-
tion, by providing transparent and intuitive means to
analyze portfolio data and assess estimates’ plausibil-
ity.
REFERENCES
Agarwal, R., Kumar, M., Yogesh, Mallick, S., Bharadwaj,
R. M., and Anantwar, D. (2001). Estimating software
projects. Software Engineering Notes, 26(4):60–7.
Albrecht, A. J. and Gaffney, S. H. (1983). Software func-
tion, source lines of code and development effort pre-
diction: A software science validation. IEEE Trans-
actions of Software Engineering, 9(6):639–48.
Auer, M., Graser, B., and Biffl, S. (2003). An approach to
visualizing empirical software project portfolio data
using multidimensional scaling. In Proceedings of the
IEEE International Conference on Information Reuse
and Integration Paper Notification (IRI 2003).
Boehm, B. W. (1981). Software Engineering Economics.
Prentice Hall.
Boetticher, G. D. (2001). Using machine learning to predict
project effort: Empirical case studies in data-starved
domains. In Proceedings of the Model Based Require-
ments Workshop, pages 17–24.
Borg, I. and Groenen, P. (1996). Modern Multidimensional
Scaling: Theory and Applications. Springer.
Briand, L. C., Langley, T., and Wieczorek, I. (2000). A
replicated assessment and comparison of common
software cost modeling techniques. In Proceedings
of the 22nd International Conference on Software En-
gineering (ICSE’00), pages 4–11, Limerick, Ireland.
Clouse, D. and Cottrell, G. (1996). Discrete multi-
dimensional scaling. In Cottrell, G., editor, Proceed-
ings of the 18th Annual Conference of the Cognitive
Science Society (COGSCI’96), pages 290–4.
Conte, S. D., Dunsmore, H. E., and Shen, V. Y. (1986).
Software Engineering Metrics and Models. Ben-
jamin/Cummings.
Coxon, A. and Davies, P. (1982). Key Texts in Multidimen-
sional Scaling. Heinemann.
Desharnais, J. M. (1989). Analyse statistique de la produc-
tivitie des projets informatique a partie de la technique
des point des fonction. Master’s thesis, Univ. of Mon-
treal.
Goodhill, G., Simmen, M., and Willshaw, D. (1995). An
evaluation of the use of multidimensional scaling
for understanding brain connectivity. Philosophical
Transactions of the Royal Society, B 348:265–80.
Hihn, J. and Habib-Agahi, H. (1991). Cost estimation of
software intensive projects: A survey of current prac-
tices. In Proceedings of the 13th International Confer-
ence on Software Engineering (ICSE’91), pages 276–
87.
Jones, C. (1998). Estimating Software Costs. McGraw-Hill.
Kemerer, C. (1987). An empirical validation of software
cost estimation models. Communications of the ACM
(May), pages 416–29.
Kruskal, J. B. (1964a). Multidimensional scaling by opti-
mizing goodness of fit to a nonmetric hypothesis. Psy-
chometrika, 29(1):1–27.
VISUALIZING SOFTWARE PROJECT ANALOGIES TO SUPPORT COST ESTIMATION
67