LSA helps us to identify few relevant components
out of a large repository. The experiments that we
performed are quite promising with that respect. LSA
does not provide support for the next step in compo-
nent procurement, the assessment of the likely adap-
tations to be carried out. At this moment we have no
answer to this next problem.
For the future, we are planning to perform many
more case studies using varying types of documents.
It would be interesting to see how more structured
documents such as use case descriptions and other
templates (Kamsties et al., 2001; Overhage, 2004),
that are more and more used in industry, affect LSA.
Will such structures improve its performance or will
they have a negative effect? The same applies to more
formalized documents, such as requirements contain-
ing logic and formulae. We have seen already how
UML diagrams can inhibit the text-based LSA tech-
nique. Is that going to be the same with formal ex-
pressions? Another issue that we will look at in the
future is how we can extract textual concepts from di-
agrams that are used in industry (Born et al., 2004).
REFERENCES
Alves, C. and Castro, J. (2001). Cre: A systematic method
for cots selection. In 15th Brazilian Symposium on
Software Engineering.
Atkinson, C., Bayer, J., Bunse, C., Kamsties, E., Laiten-
berger, O., Laqua, R., Muthig, D., Paech, B., W
¨
ust, J.,
and Zettel, J. (2002). Component-Based Product Line
Engineering with UML. Addison-Wesley.
Atkinson, C., Bunse, C., Gross, H.-G., and Peper, C., edi-
tors (2006). Component-Based Software Development
for Embedded Systems, volume 3778 of Lecture Notes
in Computer Science. Springer.
Born, M., Schieferdecker, I., Gross, H.-G., and Santos, P.
(2004). Model-driven development and testing. In
1st European Workshop on MDA with Emphasis on
Industrial Applications, Enschede, The Netherlands.
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., and
Harshman, R. (1990). Indexing by latent semantic
analysis. Journal of the American Society of Infor-
mation Science, 49(6):391–407.
DeLucia, A., Fasano, F., Oliveto, R., and Tortora, G. (2004).
Enhancing an artefact management system with trace-
ability recovery features. In 20th IEEE Int. Conf. on
Software Maintenance, pages 306–350. IEEE Com-
puter Society.
DeLucia, A., Fasano, F., Oliveto, R., and Tortora, G. (2005).
A traceability recovery tool. In 9th European Con-
ference on Software Maintenance and Reengineering,
pages 32–41. IEEE Computer Society.
Gross, H.-G. (2004). Component-Based Software Testing
with UML. Springer.
Gross, H.-G., Melideo, M., and Sillitti, A. (2005). Self-
certification and trust in component procurement. Sci-
ence of Computer Programming, 56(1–2):141–156.
Kamsties, E., von Knethen, A., and Paech, B. (2001). Struc-
ture of quasar requirements documents. Technical re-
port, Fraunhofer IESE, Kaiserslautern.
Kontio, J. (1996). A case study in applying a systematic
method for cots selection. In 18th Intl Conference on
Software Engineering (ISCE-1996), pages 201–209,
Berlin.
Kontio, J., Chen, S., and Limperos, K. (1995). A cots se-
lection method and experiences of its use. In 20th
Annual Software Engineering Workshop, Greenbelt
Maryland. NASA Goddard Space Flight Center.
Landauer, T., Folz, P., and Laham, D. (1998). An introduc-
tion to latent semantic analysis. Discourse Processes,
25:259–284.
Lormans, M. and van Deursen, A. (2005). Reconstructing
requirements coverage views from design and test us-
ing traceability recovery via lsi. In 3rd Intl. Workshop
on Traceability in Emerging Forms of Software Engi-
neering, pages 37–45, Long Beach.
Lormans, M. and van Deursen, A. (2006). Can lsi help re-
constructing requirements traceability in design and
test? In 10th IEEE Conference on Software Mainte-
nance and Reegineering, Bari, Italy.
Maiden, N. and Cube, C. (2000). Cots software selection:
The need to make tradeoffs between system require-
ments, architecture and cots components. In COTS
workshop. Continuing Collaborations for Successful
COTS Development.
Maletic, J. and Valluri, N. (1999). Automatic software clus-
tering via latent semantic analysis. In 14th Intl Confer-
ence on Automated Software Engineering, page 251.
Overhage, S. (2004). Object-Oriented and Internet-Based
Technologies, Weske and Liggesmeyer (Eds), volume
3263 of Lecture Notes in Computer Science, chapter
UnSCom: A Standardized Framework for the Speci-
fication of Software Components. Springer, Heidel-
berg.
Tran, V., Lui, D., and Hummel, B. (1997). Component-
based systems development, challenges and lessons
learned. In 8th International Workshop on Software
Technology and Engineering Practice, pages 452–
462.
Zeimpekis, D. and Gallopoulos, E. (2005). Design
of a matlab toolbox term-document matrix gen-
eration. Technical report, High-Performance In-
formation Systems Laboratory, University of Pa-
tras. http://scgroup.hpclab.ceid.upatras.gr/scgroup/
Projects/TMG.
ICSOFT 2007 - International Conference on Software and Data Technologies
116