Leveraging Entity Linking to Enhance Entity Recognition in Microblogs

Pikakshi Manchanda, Elisabetta Fersini, Matteo Palmonari

2015

Abstract

The Web of Data provides abundant knowledge wherein objects or entities are described by means of properties and their relationships with other objects or entities. This knowledge is used extensively by the research community for Information Extraction tasks such as Named Entity Recognition (NER) and Linking (NEL) to make sense of data. Named entities can be identified from a variety of textual formats which are further linked to corresponding resources in the Web of Data. These tasks of entity recognition and linking are, however, cast as distinct problems in the state-of-the-art, thereby, overlooking the fact that performance of entity recognition affects the performance of entity linking. The focus of this paper is to improve the performance of entity recognition on a particular textual format, viz, microblog posts by disambiguating the named entities with resources in a Knowledge Base (KB). We propose an unsupervised learning approach to jointly improve the performance of entity recognition and, thus, the whole system by leveraging the results of disambiguated entities.

References

  1. Cohen, W., Ravikumar, P., and Fienberg, S. (2003). A comparison of string metrics for matching names and records. In Kdd workshop on data cleaning and object consolidation, volume 3, pages 73-78.
  2. Cucerzan, S. (2007). Large-scale named entity disambiguation based on wikipedia data. In EMNLP-CoNLL, volume 7, pages 708-716.
  3. Cunningham, H., Maynard, D., Bontcheva, K., and Tablan, V. (2002). A framework and graphical development environment for robust nlp tools and applications. In ACL, pages 168-175.
  4. Damljanovic, D. and Bontcheva, K. (2012). Named entity disambiguation using linked data. In Proceedings of the 9th Extended Semantic Web Conference.
  5. Derczynski, L., Maynard, D., Rizzo, G., van Erp, M., Gorrell, G., Troncy, R., Petrak, J., and Bontcheva, K. (2015). Analysis of named entity recognition and linking for tweets. Information Processing & Management.
  6. Ferragina, P. and Scaiella, U. (2010). Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM international conference on Information and knowledge management. ACM.
  7. Finin, T., Murnane, W., Karandikar, A., Keller, N., Martineau, J., and Dredze, M. (2010). Annotating named entities in twitter data with crowdsourcing. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 80-88. Association for Computational Linguistics.
  8. Finkel, J. R., Grenager, T., and Manning, C. (2005). Incorporating non-local information into information extraction systems by gibbs sampling. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pages 363-370. Association for Computational Linguistics.
  9. Guo, S., Chang, M.-W., and Kiciman, E. (2013). To link or not to link? a study on end-to-end tweet entity linking. In HLT-NAACL.
  10. Hoffart, J., Altun, Y., and Weikum, G. (2014). Discovering emerging entities with ambiguous names. In Proceedings of the 23rd international conference on World wide web. International World Wide Web Conferences Steering Committee.
  11. Ibrahim, Y., Amir Yosef, M., and Weikum, G. (2014). Aidasocial: Entity linking on the social stream. In Proceedings of the 7th International Workshop on Exploiting Semantic Annotations in Information Retrieval, pages 17-19. ACM.
  12. Liu, X., Li, Y., Wu, H., Zhou, M., Wei, F., and Lu, Y. (2013). Entity linking for tweets. In ACL (1).
  13. Liu, X., Zhang, S., Wei, F., and Zhou, M. (2011). Recognizing named entities in tweets. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language TechnologiesVolume 1, pages 359-367. Association for Computational Linguistics.
  14. Meij, E., Weerkamp, W., and de Rijke, M. (2012). Adding semantics to microblog posts. In Proceedings of the fifth ACM international conference on Web search and data mining. ACM.
  15. Mendes, P. N., Jakob, M., GarcĂ­a-Silva, A., and Bizer, C. (2011). Dbpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems, pages 1-8. ACM.
  16. Milne, D. and Witten, I. H. (2008). Learning to link with wikipedia. In Proceedings of the 17th ACM conference on Information and knowledge management, pages 509-518. ACM.
  17. Ritter, A., Clark, S., Etzioni, O., et al. (2011). Named entity recognition in tweets: an experimental study. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
  18. Rizzo, G., Cano, A. E., Pereira, B., and Varga, A. (2015). Making sense of microposts (# microposts2015) named entity recognition & linking challenge. In 5th International Workshop on Making Sense of Microposts (# Microposts 15).
  19. Rula, A., Palmonari, M., and Maurino, A. (2012). Capturing the age of linked open data: Towards a datasetindependent framework. In Sixth IEEE International Conference on Semantic Computing, ICSC 2012, Palermo, Italy, September 19-21, 2012, pages 218- 225. IEEE Computer Society.
  20. Sutton, C. and McCallum, A. (2006). An introduction to conditional random fields for relational learning. Introduction to statistical relational learning, pages 93- 128.
  21. Usbeck, R., Ngonga Ngomo, A.-C., Luo, W., and Wesemann, L. (2014). Multilingual disambiguation of named entities using linked data. In International Semantic Web Conference (ISWC), Demos & Posters.
  22. Yamada, I., Takeda, H., and Takefuji, Y. (2015). Enhancing named entity recognition in twitter messages using entity linking. ACL-IJCNLP 2015, page 136.
Download


Paper Citation


in Harvard Style

Manchanda P., Fersini E. and Palmonari M. (2015). Leveraging Entity Linking to Enhance Entity Recognition in Microblogs . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 147-155. DOI: 10.5220/0005640701470155


in Bibtex Style

@conference{kdir15,
author={Pikakshi Manchanda and Elisabetta Fersini and Matteo Palmonari},
title={Leveraging Entity Linking to Enhance Entity Recognition in Microblogs},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={147-155},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005640701470155},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Leveraging Entity Linking to Enhance Entity Recognition in Microblogs
SN - 978-989-758-158-8
AU - Manchanda P.
AU - Fersini E.
AU - Palmonari M.
PY - 2015
SP - 147
EP - 155
DO - 10.5220/0005640701470155