Managing Metadata Variability within a Hierarchy of Annotation Schemas

Ionuţ Cristian Pistol, Dan Cristea

Abstract

The paper describes the theoretical basis of the ALPE model, a hierarchy of annotation formats used to guide the automatic computation of processing flows capable of performing complex linguistic processing tasks. The hierarchy is comprised of a core, which is a direct acyclic graph whose nodes represent XML annotation formats, and a halo which contains additional annotation formats. The core hierarchy also serves as a standardization hub for annotated documents. The focus of the paper is the description of the new additions to the model, allowing the integration and usage of non-XML formats in processing flows and new equivalence relations between XML formats.

References

  1. Cristea D., Forascu C., Pistol I. : Requirements-Driven Automatic Configuration of Natural Language Applications. In Bernadette Sharp (Ed.): Proceedings of the 3rd International Workshop on Natural Language Understanding and Cognitive Science - NLUCS 2006, in conjunction with ICEIS 2006, Cyprus, Paphos, May 2006. INSTICC Press, Portugal. 92006) ISBN: 972-8865-50-3. (2006).
  2. Cristea, D., Pistol, I. : Managing Language Resources and Tools Using a Hierarchy of Annotation Schemas. Proceedings of the Workshop on Sustainability of Language Resources, LREC-2008, Marakesh. (2008).
  3. Cunningham H., Maynard D., Bontcheva K., Tablan V. : GATE: A framework and graphical development environment for robust NLP tools and applications. In Proceedings of the 40th Anniversary Meeting of the ACL (ACL'02). Philadelphia, US. (2002).
  4. Ferrucci D. and Lally A. : UIMA: an architectural approach to unstructured information processing in the corporate research environment, Natural Language Engineering 10, No. 3- 4, 327-348. (2004).
  5. Romary L., Ide N. : International Standard for a Linguistic Annotation Framework, Natural Language Engineering 10, 3-4 (09/2004) 211-225 (2007).
  6. Váradi T., Krauwer S., Wittenburg P., Wynne M. and Koskenniemi K. : CLARIN: Common Language Resources and Technology Infrastructure, Proceedings of LREC-2008, Marakesh (2008).
Download


Paper Citation


in Harvard Style

Pistol I. and Cristea D. (2009). Managing Metadata Variability within a Hierarchy of Annotation Schemas . In Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009) ISBN 978-989-8111-92-0, pages 111-116. DOI: 10.5220/0002171501110116


in Bibtex Style

@conference{nlpcs09,
author={Ionuţ Cristian Pistol and Dan Cristea},
title={Managing Metadata Variability within a Hierarchy of Annotation Schemas },
booktitle={Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)},
year={2009},
pages={111-116},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002171501110116},
isbn={978-989-8111-92-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)
TI - Managing Metadata Variability within a Hierarchy of Annotation Schemas
SN - 978-989-8111-92-0
AU - Pistol I.
AU - Cristea D.
PY - 2009
SP - 111
EP - 116
DO - 10.5220/0002171501110116