Formal verification of documentation by means of self-organizing map

Algirdas Laukaitis, Olegas Vasilecas


By using background knowledge of the general and specific domains and by processing new natural language corpus experts are able to produce a conceptual model for some specific domain. In this paper we present a model that tries to capture some aspects of this conceptual modeling process. This model is functionally organized into two information processing streams: one reflects the process of formal concept lattice generation from domain conceptual model, and the another one reflects the process of formal concept lattice generation from the domain documentation. It is expected that similarity between those concept lattices reflects similarity between documentation and conceptual model.In addition to this process of documentation formal verification the set of natural language processing artifacts are created. Those artifacts then can be used for the development of information systems natural language interfaces. To demonstrate it, an experiment for the concepts identification form natural language queries is provided at the end of this paper. keywords Information systems engineering, formal concept analysis, IS documents self-organization, natural language processing.


  1. Burg, J.F.M., Riet, R.P.: Enhancing CASE Environments by Using Linguistics. International Journal of Software Engineering and Knowledge Engineering 8(4), (1998) 435-448.
  2. Cunningham, H.: GATE, a General Architecture for Text Engineering. Computers and the Humanities, 36, (2002) 223-254.
  3. Ganter B., Wille. R.: Formal Concept Analysis: Mathematical Foundations. Springer, BerlinHeidelberg, (1999).
  4. Hofmann, T.: Probabilistic latent semantic indexing. In Research and Development in Information Retrieval, (1999) 50-57.
  5. Hotho, A., Staab, S., Stumme, G.: Explaining text clustering results using semantic structures. In Principles of Data Mining and Knowledge Discovery, 7th European Conference, PKDD 2003, Croatia. LNCS. Springer (2003) 22-26.
  6. Hung, C., Wermter, S., Smith, P.: Hybrid Neural Document Clustering Using Guided Selforganisation and WordNet. Issue of IEEE Intelligent Systems, (2004) 68-77.
  7. IBM. IBM Banking Data Warehouse General Information Manual. Available from on the IBM corporate site (accessed July 2006).
  8. IBM Voice Toolkit V5.1 for WebSphere Studio. (accessed July 2006).
  9. Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: WEBSOM self-organizing maps of document collections. Neurocomputing, 21, (1998) 101-117.
  10. Kohonen, T.: Self-Organizing Maps, Springer-Verlag, (2001).
  11. Lagus, K., Honkela, T., Kaski, S., Kohonen, T.: WEBSOM for textual datamining. Articial Intelligence Review, 13 (5/6) (1999) 345-364.
  12. Miller, G.A.: WordNet: A Dictionary Browser, Proc. 1st Int'l Conf. Information in Data, (1985) 25-28.
  13. Ryan, K.: The role of natural language in requirements engineering. Proceedings of IEEE International Symposium on Requirements Engineering, IEEE Computer Society Press, (1993) 240-242.
  14. Rolland, C., Proix, C.: A Natural Language Approach to Requirements Engineering. 4th International CAiSE Conference, Manchester UK, (1992) 257-277.
  15. Salton. G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, (1989).
  16. Valtchev, P., Grosser, D., Roume, C., Rouane H. M.: GALICIA: an open platform for lattices. In A. de Moor B. Ganter, editor, Using Conceptual Structures: Contributions to 11th Intl. Conference on Conceptual Structures (2003) 241-254.

Paper Citation

in Harvard Style

Laukaitis A. and Vasilecas O. (2009). Formal verification of documentation by means of self-organizing map . In - ENASE, ISBN , pages 0-0

in Bibtex Style

author={Algirdas Laukaitis and Olegas Vasilecas},
title={Formal verification of documentation by means of self-organizing map},
booktitle={ - ENASE,},

in EndNote Style

TI - Formal verification of documentation by means of self-organizing map
SN -
AU - Laukaitis A.
AU - Vasilecas O.
PY - 2009
SP - 0
EP - 0
DO -