Two Sides of a Coin - Translate while Classify Multilanguage Annotations with Domain Ontology-driven Word Sense Disambiguation

Massimiliano Gioseffi, Angela Locoro

Abstract

In this paper we present an approach for the translation and classification of short texts in one step. Our work lays in the tradition of Domain-Driven Word Sense Disambiguation, though a major emphasis is given to domain ontologies as the right tool for sense-tagging and topic detection of short texts which, by their nature, are known to be reluctant to statistical treatment. We claim that in a scenario where users can annotate knowledge items using different languages, domain ontologies can prove very suitable for driving the word disambiguation and topic classification tasks. In this way, two tasks are gainfully collapsed in a single one. Although this study is still in its infancy, in what follows we are able to articulate motivations, design, workflow analysis, and concrete evolutions envisioned for our tool.

References

  1. Agirre, E., De Lacalle, O. L., and Soroa, A. (2009). Knowledge-based wsd on specific domains: performing better than generic supervised wsd. In Proceedings of the 21st international jont conference on Artifical intelligence, IJCAI'09, pages 1501-1506, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
  2. Buitelaar, P. and Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. In Proceedings of the NAACL Workshop on WordNet and Other Lexical Resources: Applications, Extensions and Customizations.
  3. Cucchiarelli, A. and Velardi, P. (1998). Finding a domainappropriate sense inventory for semantically tagging a corpus. Nat. Lang. Eng., 4(4):325-344.
  4. Gliozzo, A. M., Magnini, B., and Strapparava, C. (2004). Unsupervised domain relevance estimation for word sense disambiguation. In EMNLP, pages 380-387. ACL.
  5. Koeling, R. and McCarthy, D. (2008). From predicting predominant senses to local context for word sense disambiguation. In Proceedings of the 2008 Conference on Semantics in Text Processing, STEP 7808, pages 129-138, Stroudsburg, PA, USA. Association for Computational Linguistics.
  6. Mahesh, K. (1996). Ontology development for machine translation: Ideology and methodology.
  7. Navigli, R. (2009). Word sense disambiguation: A survey. ACM Comput. Surv., 41(2).
  8. O'Hara, T., Mahesh, K., and Niremburg, S. (1998). Lexical acquisition with wordnet and the mikrokosmos ontology. In In Proceedings of the ACL Workshop on the Use of WordNet in NLP, pages 94-101.
  9. Philpot, A., Hovy, E., and Pantel, P. (2010). The Omega ontology. Cambridge University Press.
  10. Wenyin, L., Quan, X., Feng, M., and Qiu, B. (2010). A short text modeling method combining semantic and statistical information. Information Sciences, 180(20):4031 - 4041.
Download


Paper Citation


in Harvard Style

Gioseffi M. and Locoro A. (2013). Two Sides of a Coin - Translate while Classify Multilanguage Annotations with Domain Ontology-driven Word Sense Disambiguation . In Proceedings of the 5th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-8565-39-6, pages 358-363. DOI: 10.5220/0004330203580363


in Bibtex Style

@conference{icaart13,
author={Massimiliano Gioseffi and Angela Locoro},
title={Two Sides of a Coin - Translate while Classify Multilanguage Annotations with Domain Ontology-driven Word Sense Disambiguation},
booktitle={Proceedings of the 5th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2013},
pages={358-363},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004330203580363},
isbn={978-989-8565-39-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 5th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Two Sides of a Coin - Translate while Classify Multilanguage Annotations with Domain Ontology-driven Word Sense Disambiguation
SN - 978-989-8565-39-6
AU - Gioseffi M.
AU - Locoro A.
PY - 2013
SP - 358
EP - 363
DO - 10.5220/0004330203580363