REFORMULATING COMPONENT IDENTIFICATION AS DOCUMENT ANALYSIS PROBLEM - Towards Automated Component Procurement

Hans-Gerhard Gross, Marco Lormans, Jun Zhou

2007

Abstract

One of the first steps of component procurement is the identification of required component features in large repositories of existing components. On the highest level of abstraction, component requirements as well as component descriptions are usually written in natural language. Therefore, we can reformulate component identification as a text analysis problem and apply latent semantic analysis for automatically identifying suitable existing components in large repositories, based on the descriptions of required component features. In this article, we motivate our choice of this technique for feature identification, describe how it can be applied to feature tracing problems, and discuss the results that we achieved with the application of this technique in a number of case studies.

References

  1. Alves, C. and Castro, J. (2001). Cre: A systematic method for cots selection. In 15th Brazilian Symposium on Software Engineering.
  2. Atkinson, C., Bayer, J., Bunse, C., Kamsties, E., Laitenberger, O., Laqua, R., Muthig, D., Paech, B., Wüst, J., and Zettel, J. (2002). Component-Based Product Line Engineering with UML. Addison-Wesley.
  3. Atkinson, C., Bunse, C., Gross, H.-G., and Peper, C., editors (2006). Component-Based Software Development for Embedded Systems, volume 3778 of Lecture Notes in Computer Science. Springer.
  4. Born, M., Schieferdecker, I., Gross, H.-G., and Santos, P. (2004). Model-driven development and testing. In 1st European Workshop on MDA with Emphasis on Industrial Applications, Enschede, The Netherlands.
  5. Deerwester, S., Dumais, S., Furnas, G., Landauer, T., and Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society of Information Science, 49(6):391-407.
  6. DeLucia, A., Fasano, F., Oliveto, R., and Tortora, G. (2004). Enhancing an artefact management system with traceability recovery features. In 20th IEEE Int. Conf. on Software Maintenance, pages 306-350. IEEE Computer Society.
  7. DeLucia, A., Fasano, F., Oliveto, R., and Tortora, G. (2005). A traceability recovery tool. In 9th European Conference on Software Maintenance and Reengineering, pages 32-41. IEEE Computer Society.
  8. Gross, H.-G. (2004). Component-Based Software Testing with UML. Springer.
  9. Gross, H.-G., Melideo, M., and Sillitti, A. (2005). Selfcertification and trust in component procurement. Science of Computer Programming, 56(1-2):141-156.
  10. Kamsties, E., von Knethen, A., and Paech, B. (2001). Structure of quasar requirements documents. Technical report, Fraunhofer IESE, Kaiserslautern.
  11. Kontio, J. (1996). A case study in applying a systematic method for cots selection. In 18th Intl Conference on Software Engineering (ISCE-1996), pages 201-209, Berlin.
  12. Kontio, J., Chen, S., and Limperos, K. (1995). A cots selection method and experiences of its use. In 20th Annual Software Engineering Workshop, Greenbelt Maryland. NASA Goddard Space Flight Center.
  13. Landauer, T., Folz, P., and Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25:259-284.
  14. Lormans, M. and van Deursen, A. (2005). Reconstructing requirements coverage views from design and test using traceability recovery via lsi. In 3rd Intl. Workshop on Traceability in Emerging Forms of Software Engineering, pages 37-45, Long Beach.
  15. Lormans, M. and van Deursen, A. (2006). Can lsi help reconstructing requirements traceability in design and test? In 10th IEEE Conference on Software Maintenance and Reegineering, Bari, Italy.
  16. Maiden, N. and Cube, C. (2000). Cots software selection: The need to make tradeoffs between system requirements, architecture and cots components. In COTS workshop. Continuing Collaborations for Successful COTS Development.
  17. Maletic, J. and Valluri, N. (1999). Automatic software clustering via latent semantic analysis. In 14th Intl Conference on Automated Software Engineering, page 251.
  18. Overhage, S. (2004). Object-Oriented and Internet-Based Technologies, Weske and Liggesmeyer (Eds), volume 3263 of Lecture Notes in Computer Science, chapter UnSCom: A Standardized Framework for the Specification of Software Components. Springer, Heidelberg.
  19. Tran, V., Lui, D., and Hummel, B. (1997). Componentbased systems development, challenges and lessons learned. In 8th International Workshop on Software Technology and Engineering Practice, pages 452- 462.
  20. Zeimpekis, D. and Gallopoulos, E. (2005). Design of a matlab toolbox term-document matrix generation. Technical report, High-Performance Information Systems Laboratory, University of Patras. http://scgroup.hpclab.ceid.upatras.gr/scgroup/ Projects/TMG.
Download


Paper Citation


in Harvard Style

Gross H., Lormans M. and Zhou J. (2007). REFORMULATING COMPONENT IDENTIFICATION AS DOCUMENT ANALYSIS PROBLEM - Towards Automated Component Procurement . In Proceedings of the Second International Conference on Software and Data Technologies - Volume 2: ICSOFT, ISBN 978-989-8111-06-7, pages 111-116. DOI: 10.5220/0001326701110116


in Bibtex Style

@conference{icsoft07,
author={Hans-Gerhard Gross and Marco Lormans and Jun Zhou},
title={REFORMULATING COMPONENT IDENTIFICATION AS DOCUMENT ANALYSIS PROBLEM - Towards Automated Component Procurement},
booktitle={Proceedings of the Second International Conference on Software and Data Technologies - Volume 2: ICSOFT,},
year={2007},
pages={111-116},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001326701110116},
isbn={978-989-8111-06-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Software and Data Technologies - Volume 2: ICSOFT,
TI - REFORMULATING COMPONENT IDENTIFICATION AS DOCUMENT ANALYSIS PROBLEM - Towards Automated Component Procurement
SN - 978-989-8111-06-7
AU - Gross H.
AU - Lormans M.
AU - Zhou J.
PY - 2007
SP - 111
EP - 116
DO - 10.5220/0001326701110116