A COMPARATIVE STUDY OF DOCUMENT CORRELATION TECHNIQUES FOR TRACEABILITY ANALYSIS

Anju G. Parvathy, Bintu G. Vasudevan, Rajesh Balakrishnan

Abstract

One of the important aspects of software engineering is to ensure traceability across the development lifecycle. Traceability matrix is widely used to check for completeness and to aid impact analysis. We propose that this computation of traceability can be automated by looking at the correlation between the documents. This paper describes and compares four novel approaches for traceability computation based on text similarity, term structure and inter-document correlation algorithms. These algorithms base themselves on different information retrieval techniques for establishing document correlation. Observations from our experiments are also presented. The advantages and disadvantages of each of these approaches are discussed in detail. Various scenarios where these approaches would be applicable and the future course of action are also discussed.

References

  1. Alexander, I. (2002). Towards automatic traceability in industrial practice. In Proc. of Workshop on Traceability, pages 26-31.
  2. Blei, D. M. and Lafferty, J. D. (2007). A correlated topic model of science. Appl. Stat., 1(1):17-35.
  3. Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation. J. of MLR3, pages 993-1022.
  4. Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., and Harshman, R. A. (1990). Indexing by latent semantic analysis. J. of ASIS, 41(6):391-407.
  5. Dekhtyar, A., Hayes, J. H., and Sundaram, S. K. (2006). Advancing candidate link generation for requirements tracing: The study of methods. In IEEE Trans. on SE, volume 32, pages 4-19.
  6. DOORS (2006). Telelogic, http://www.telelogic.com.
  7. Dumais, S. (1991). Improving the retrieval of information from external sources. Behavior Research Methods, Instruments, and Computers, 23(2):229-236.
  8. Ebner, G. and Kaindl, H. (2002). Tracing all around in reengineering. IEEE Software, 19(3):70-77.
  9. Goldin, L. and Berry, D. M. (1997). Abstfinder, a prototype natural language text abstraction finder for use in requirements elicitation. ASE, 4(4):375-412.
  10. Gotel, O. and Finkelstein, A. (1994). An analysis of the requirements traceability problem. In Proc. of the IEEE Int'l. Conf. on Req. Engg., pages 94-101.
  11. Hoffman, T. (1999). Probabilistic latent semantic analysis. In Proc. of UAI, pages 289-296.
  12. Hoffmann, T., Puzicha, J., and Jordan, M. I. (1999). Learning from dyadic data. ANIPS 11.
  13. Knethen, A. V. and Grund, M. (2003). Quatrace: A tool environment for(semi-) automatic impact analysis based on traces. In Proc. of the Int'l. Conf. on Software Maintenance, pages 246-255.
  14. Kolda, T. G. and O'Leary, D. P. (1998). A semi - discrete matrix decomposition for latent semantic indexing in information retrieval. ACM Trans. on Info. Sys., 16(4):322-346.
  15. Lin, D. and Pantel, P. (2001). Induction of semantic classes from natural language text. In Proc. of the seventh Int'l. Conf. on KDDM, pages 317-322, California.
  16. RDD-100 (2006). Holagent corporation, http://www.holagent.com/products/product1.html.
  17. RequisitePro, R. (2006). Rational software, http://www.rational.com/products/reqpro/index.jsp.
  18. Richardson, J. and Green, J. (2004). Automating traceability for generated software artifacts. In Proc. of the 19th IEEE Int'l. Conf. on ASE, pages 24-33.
  19. Spence, I. and Probasco, L. (1998). Traceability strategies for managing requirements with use cases. W. Paper.
Download


Paper Citation


in Harvard Style

G. Parvathy A., G. Vasudevan B. and Balakrishnan R. (2008). A COMPARATIVE STUDY OF DOCUMENT CORRELATION TECHNIQUES FOR TRACEABILITY ANALYSIS . In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS, ISBN 978-989-8111-38-8, pages 64-69. DOI: 10.5220/0001676100640069


in Bibtex Style

@conference{iceis08,
author={Anju G. Parvathy and Bintu G. Vasudevan and Rajesh Balakrishnan},
title={A COMPARATIVE STUDY OF DOCUMENT CORRELATION TECHNIQUES FOR TRACEABILITY ANALYSIS},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS,},
year={2008},
pages={64-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001676100640069},
isbn={978-989-8111-38-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS,
TI - A COMPARATIVE STUDY OF DOCUMENT CORRELATION TECHNIQUES FOR TRACEABILITY ANALYSIS
SN - 978-989-8111-38-8
AU - G. Parvathy A.
AU - G. Vasudevan B.
AU - Balakrishnan R.
PY - 2008
SP - 64
EP - 69
DO - 10.5220/0001676100640069