SEMANTIC CLUSTERING BASED ON ONTOLOGIES - An Application to the Study of Visitors in a Natural Reserve

Montserrat Batet, Aïda Valls, Karina Gibert

2011

Abstract

The development of large ontologies for general and specific domains provides new tools to improve the quality of data mining techniques such as clustering. In this paper we explain how to improve clustering results by exploiting the semantics of categorical data by means of ontologies and how this semantics can be included into a hierarchical clustering method. We want to prove that when the conceptual meaning of the values is taken into account, it is possible to find a better interpretation of the clusters. This is demonstrated with the analysis of real data collected from visitors to of a Natural Reserve. The results of our methodology are compared with the ones obtained with a classical multivariate analysis done in the same database.

References

  1. Ahmad, A., and Dey, L. (2007). A k-mean clustering algorithm for mixed numeric and categorical data. Data Knowledge Engineering, 63(2), 503-527.
  2. Anton-Clavé, S., Nel-lo, M.-G., and Orellana, A. (2007). Coastal tourism in Natural Parks. An analysis of demand profiles and recreational uses in coastal protected natural areas. Revista Turismo & Desenvolvimento, 7-8, 69-81.
  3. Batet, M., Sanchez, D., Valls, A., and Gibert, K. (2010). Exploiting Taxonomical Knowledge to Compute Semantic Similarity: An Evaluation in the Biomedical Domain. In Trends in Applied Intelligent Systems. 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, LNAI 6096 (pp. 274-283): Springer.
  4. Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. Cambridge, Massachusetts: MIT Press. More information: http://wordnet.princeton.edu.
  5. Gibert, K., and Cortés, U. (1997). Weighing quantitative and qualitative variables in clustering methods. Mathware and Soft Computing, 4(3), 251-266.
  6. Leacock, C., and Chodorow, M. (1998). Combining local context and WordNet similarity for word sense identification. In WordNet: An electronic lexical database (pp. 265-283): MIT Press.
  7. Lin, D. (1998, July 24-27). An Information-Theoretic Definition of Similarity. Paper presented at the 15th International Conference on Machine Learning (ICML98), Madison, Wisconsin, USA.
  8. Rada, R., Mili, H., Bichnell, E., and Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man, and Cybernetics, 9(1), 17-30.
  9. Resnik, P. (1995, August 20 - 25). Using Information Content to Evalutate Semantic Similarity in a Taxonomy. Paper presented at the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, Montreal, Quebec, Canada.
  10. Studer, R., Benjamins, V. R., and Fensel, D. (1998). Knowledge Engineering: Principles and Methods. Data and Knowledge Engineering, 25(1-2)(1-2), 161- 197.
  11. Ward, J. H. (1963). Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, 58, 236-244.
Download


Paper Citation


in Harvard Style

Batet M., Valls A. and Gibert K. (2011). SEMANTIC CLUSTERING BASED ON ONTOLOGIES - An Application to the Study of Visitors in a Natural Reserve . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-40-9, pages 283-289. DOI: 10.5220/0003165602830289


in Bibtex Style

@conference{icaart11,
author={Montserrat Batet and Aïda Valls and Karina Gibert},
title={SEMANTIC CLUSTERING BASED ON ONTOLOGIES - An Application to the Study of Visitors in a Natural Reserve},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2011},
pages={283-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003165602830289},
isbn={978-989-8425-40-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - SEMANTIC CLUSTERING BASED ON ONTOLOGIES - An Application to the Study of Visitors in a Natural Reserve
SN - 978-989-8425-40-9
AU - Batet M.
AU - Valls A.
AU - Gibert K.
PY - 2011
SP - 283
EP - 289
DO - 10.5220/0003165602830289