Declarative Parsing and Annotation of Electronic Dictionaries
Christian Schneiker, Dietmar Seipel, Werner Wegstein, Klaus Prätor
2009
Abstract
We present a declarative annotation toolkit based on \xml and \prolog technologies, and we apply it for annotating the Campe Dictionary to obtain an electronic version in \xml ({\sc Tei}). For parsing flat structures, we use a very compact grammar formalism called extended definite clause grammars (\edcg's), which is an extended version of the \dcg's that are well--known from the logic programming language \prolog. For accessing and transforming \xml structures, we use the \xml query and transformation language \fnquery. It turned out, that the declarative approach in \prolog is much more readable, reliable, flexible, and faster than an alternative implementation which we had made in \java and \xslt for the \textgrid community project.
References
- Campe, Joachim Heinrich: Wörterbuch der deutschen Sprache. 5 Volumes, Braunschweig, 1807-1811.
- Covington, M.A.: GULP 3.1: An Extension of Prolog for Unification-Based Grammar. Research Report AI-1994-06, Artificial Intelligence Center, University of Georgia, 1994
- Dereko: The German Reference Corpus Project. http://www.sfs.nphil.unituebingen.de/dereko/, 2009
- Fuchs, N.E.; Fromherz, M.P.J.: Transformational Development of Logic Programs from Executable Specifications - Schema Based Visual and Textual Composition of Logic Programs. C. Beckstein, U. Geske (eds.), Entwicklung, Test und Wartung deklarativer KI-Programme, GMD Studien Nr. 238, Gesellschaft für Informatik und Datenverarbeitung, 1994
- Fuchs, N.E.; Schwitter, R.: Specifying Logic Programs in Controlled Natural Language. Proc. Workshop on Computational Logic for Natural Language Processing (CLNP) 1995
- Gazdar, G.; Mellish, C. Natural Language Processing in Prolog. An Introduction to Computational Linguistics. Addison-Wesley, 1989
- Hausmann, F.J.; Reichmann, O.; Wiegand, H.E.; Zgusta, L.; eds.: Wörterbücher / Dictionaries / Dictionnaires - Ein internationales Handbuch zur Lexikographie / An International Encyclopedia of Lexicography / Encyclopédie internationale de lexicographie. Berlin/New York, 1989 (I), 1990 (II)
- Hirakawa H.; Ono, K.; Yoshimura, Y.: Automatic Refinement of a POS Tagger Using a Reliable Parser and Plain Text Corpora. Proc. 18th International Conference on Computational Linguistics (COLING) 2000
- Landau, S.: Dictionaries. The Art and Craft of Lexicography. 2nd Edition, Cambridge, 2001
- Lloyd, J.: Practical Advantages of Declarative Programming. CSLI Lecture Notes, Number 10, 1987
- O'Keefe, R.A.: The Craft of Prolog. MIT Press, 1990
- Pereira, F.C.N.; Shieber, S.M: Prolog and Natural-Language Analysis. CSLI Lecture Notes, Number 10, 1987
- Schwitter, R.: Working for Two: a Bidirectional Grammer for a Controlled Natural Language. Proc. 21st Australasian Joint Conference on Artificial Intelligence (AI) 2008, pp. 168- 179
- Seipel, D.: Processing XML Documents in Prolog. Proc. 17th Workshop on Logic Programmierung (WLP) 2002
- Seipel, D.; Prätor, K.: XML Transformations Based on Logic Programming. Proc. 18th Workshop on Logic Programming (WLP) 2005, pp. 5-16
- TEI Consortium, eds.: TEI P5: Guidelines for Electronic Text Encoding and Interchange. http://www.tei-c.org/Guidelines/P5/
- Textgrid: Modular platform for collaborative textual editing - a community grid for the humanities. http://www.textgrid.de, 2009
Paper Citation
in Harvard Style
Schneiker C., Seipel D., Wegstein W. and Prätor K. (2009). Declarative Parsing and Annotation of Electronic Dictionaries . In Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009) ISBN 978-989-8111-92-0, pages 122-132. DOI: 10.5220/0002203401220132
in Bibtex Style
@conference{nlpcs09,
author={Christian Schneiker and Dietmar Seipel and Werner Wegstein and Klaus Prätor},
title={Declarative Parsing and Annotation of Electronic Dictionaries},
booktitle={Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)},
year={2009},
pages={122-132},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002203401220132},
isbn={978-989-8111-92-0},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2009)
TI - Declarative Parsing and Annotation of Electronic Dictionaries
SN - 978-989-8111-92-0
AU - Schneiker C.
AU - Seipel D.
AU - Wegstein W.
AU - Prätor K.
PY - 2009
SP - 122
EP - 132
DO - 10.5220/0002203401220132