Declarative Parsing and Annotation of Electronic Dictionaries

Christian Schneiker, Dietmar Seipel, Werner Wegstein, Klaus Prätor

Abstract

We present a declarative annotation toolkit based on \xml and \prolog technologies, and we apply it for annotating the Campe Dictionary to obtain an electronic version in \xml ({\sc Tei}). For parsing flat structures, we use a very compact grammar formalism called extended definite clause grammars (\edcg's), which is an extended version of the \dcg's that are well--known from the logic programming language \prolog. For accessing and transforming \xml structures, we use the \xml query and transformation language \fnquery. It turned out, that the declarative approach in \prolog is much more readable, reliable, flexible, and faster than an alternative implementation which we had made in \java and \xslt for the \textgrid community project.

References

  1. Campe, Joachim Heinrich: Wörterbuch der deutschen Sprache. 5 Volumes, Braunschweig, 1807-1811.
  2. Covington, M.A.: GULP 3.1: An Extension of Prolog for Unification-Based Grammar. Research Report AI-1994-06, Artificial Intelligence Center, University of Georgia, 1994
  3. Dereko: The German Reference Corpus Project. http://www.sfs.nphil.unituebingen.de/dereko/, 2009
  4. Fuchs, N.E.; Fromherz, M.P.J.: Transformational Development of Logic Programs from Executable Specifications - Schema Based Visual and Textual Composition of Logic Programs. C. Beckstein, U. Geske (eds.), Entwicklung, Test und Wartung deklarativer KI-Programme, GMD Studien Nr. 238, Gesellschaft für Informatik und Datenverarbeitung, 1994
  5. Fuchs, N.E.; Schwitter, R.: Specifying Logic Programs in Controlled Natural Language. Proc. Workshop on Computational Logic for Natural Language Processing (CLNP) 1995
  6. Gazdar, G.; Mellish, C. Natural Language Processing in Prolog. An Introduction to Computational Linguistics. Addison-Wesley, 1989
  7. Hausmann, F.J.; Reichmann, O.; Wiegand, H.E.; Zgusta, L.; eds.: Wörterbücher / Dictionaries / Dictionnaires - Ein internationales Handbuch zur Lexikographie / An International Encyclopedia of Lexicography / Encyclopédie internationale de lexicographie. Berlin/New York, 1989 (I), 1990 (II)
  8. Hirakawa H.; Ono, K.; Yoshimura, Y.: Automatic Refinement of a POS Tagger Using a Reliable Parser and Plain Text Corpora. Proc. 18th International Conference on Computational Linguistics (COLING) 2000
  9. Landau, S.: Dictionaries. The Art and Craft of Lexicography. 2nd Edition, Cambridge, 2001
  10. Lloyd, J.: Practical Advantages of Declarative Programming. CSLI Lecture Notes, Number 10, 1987
  11. O'Keefe, R.A.: The Craft of Prolog. MIT Press, 1990
  12. Pereira, F.C.N.; Shieber, S.M: Prolog and Natural-Language Analysis. CSLI Lecture Notes, Number 10, 1987
  13. Schwitter, R.: Working for Two: a Bidirectional Grammer for a Controlled Natural Language. Proc. 21st Australasian Joint Conference on Artificial Intelligence (AI) 2008, pp. 168- 179
  14. Seipel, D.: Processing XML Documents in Prolog. Proc. 17th Workshop on Logic Programmierung (WLP) 2002
  15. Seipel, D.; Prätor, K.: XML Transformations Based on Logic Programming. Proc. 18th Workshop on Logic Programming (WLP) 2005, pp. 5-16
  16. TEI Consortium, eds.: TEI P5: Guidelines for Electronic Text Encoding and Interchange. http://www.tei-c.org/Guidelines/P5/
  17. Textgrid: Modular platform for collaborative textual editing - a community grid for the humanities. http://www.textgrid.de, 2009
Download


Paper Citation


in Harvard Style

Schneiker C., Seipel D., Wegstein W. and Prätor K. (2009). Declarative Parsing and Annotation of Electronic Dictionaries . In Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009) ISBN 978-989-8111-92-0, pages 122-132. DOI: 10.5220/0002203401220132


in Bibtex Style

@conference{nlpcs09,
author={Christian Schneiker and Dietmar Seipel and Werner Wegstein and Klaus Prätor},
title={Declarative Parsing and Annotation of Electronic Dictionaries},
booktitle={Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)},
year={2009},
pages={122-132},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002203401220132},
isbn={978-989-8111-92-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)
TI - Declarative Parsing and Annotation of Electronic Dictionaries
SN - 978-989-8111-92-0
AU - Schneiker C.
AU - Seipel D.
AU - Wegstein W.
AU - Prätor K.
PY - 2009
SP - 122
EP - 132
DO - 10.5220/0002203401220132