Information Retrieval in Collaborative Engineering Projects - A Vector Space Model Approach

Paulo Figueiras, Ruben Costa, Luis Paiva, Celson Lima, Ricardo Jardim-Gonçalves

Abstract

This work introduces a conceptual framework and its current implementation to support the classification and discovery of knowledge sources, where every knowledge source is represented through a vector (named Semantic Vector - SV). The novelty of this work addresses the enrichment of such knowledge representations, using the classical vector space model concept extended with ontological support, which means to use ontological concepts and their relations to enrich each SV. Our approach takes into account three different but complementary processes using the following inputs: (1) the statistical relevance of keywords, (2) the ontological concepts, and (3) the ontological relations. SVs are compared against each other, in order to obtain their similarity index, and better support end users with a search/retrieval of knowledge sources capabilities. This paper presents the technical architecture (and respective implementation) supporting the conceptual framework, emphasizing the SV creation process. Moreover, it provides some examples detailing the indexation process of knowledge sources, results achieved so far and future goals pursued here are also presented.

References

  1. Berners-Lee, T., Hendler, J. & Lassila, O., 2001. The Semantic Web. Scientific American, pp. 34-43.
  2. Castells, P., Fernández, M. & Vallet, D., 2007. An Adaptation of the Vector-Space Model for OntologyBased Information Retrieval. IEEE Transactions on Knowledge and Data Engineering, February, 19(2), pp. 261-272.
  3. Deza, M. M. & Deza, E., 2009. Encyclopedia of Distances. Heidelberg: Springer-Verlag Berlin Heidelberg.
  4. Jones, K. S., 1972. A Statistical Interpretation of Term Specificity and its Application in Retrieval. Journal of Documentation, 28(1), pp. 11-21.
  5. Li, S., 2009. A Semantic Vector Retrieval Model for Desktop Documents. Journal of Software Engineering & Applications, Issue 2, pp. 55-59.
  6. Nagarajan, M. et al., 2007. Altering Document Term Vectors for Classification - Ontologies as Expectations of Co-occurrence. ReCALL, p. 1225.
  7. Rapid-I GmBH, 2011. RapidMiner. [Online] Available at: http://rapid-i.com/content/view/181/190/ [Acedido em 2011].
  8. Salton, G., Wong, A. & Yang, C. S., 1975. A Vector Space Model for Automatic Indexing. Communications of the ACM, 18(11), pp. 613-620.
Download


Paper Citation


in Harvard Style

Figueiras P., Costa R., Paiva L., Lima C. and Jardim-Gonçalves R. (2012). Information Retrieval in Collaborative Engineering Projects - A Vector Space Model Approach . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012) ISBN 978-989-8565-30-3, pages 233-238. DOI: 10.5220/0004139302330238


in Bibtex Style

@conference{keod12,
author={Paulo Figueiras and Ruben Costa and Luis Paiva and Celson Lima and Ricardo Jardim-Gonçalves},
title={Information Retrieval in Collaborative Engineering Projects - A Vector Space Model Approach},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012)},
year={2012},
pages={233-238},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004139302330238},
isbn={978-989-8565-30-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012)
TI - Information Retrieval in Collaborative Engineering Projects - A Vector Space Model Approach
SN - 978-989-8565-30-3
AU - Figueiras P.
AU - Costa R.
AU - Paiva L.
AU - Lima C.
AU - Jardim-Gonçalves R.
PY - 2012
SP - 233
EP - 238
DO - 10.5220/0004139302330238