Authors:
Raja Bensalem Bahloul
1
;
Kais Haddar
1
and
Philippe Blache
2
Affiliations:
1
Higher Institute of Computer Science and Multimedia, Tunisia
;
2
Université de Provence, France
Keyword(s):
Formal Modelling, Treebank Enrichment, Arabic Language, Property Grammar.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Artificial Intelligence
;
Knowledge Engineering and Ontology Development
;
Knowledge-Based Systems
;
Natural Language Processing
;
Pattern Recognition
;
Symbolic Systems
Abstract:
The enrichment of an Arabic treebank with syntactic properties can facilitate many types of parsing processes. This enrichment allows also the increase of its use in different NLP applications, the acquirement of new linguistic resources and the ease of the probabilistic parsing process by using statistics to limit the properties to the satisfied ones or to the most frequent ones. In this context, our proposed enrichment method is based on a formalization phase, a Property Grammar induction phase from a source treebank and a treebank regeneration phase with a new syntactic property-based representation. Starting with a formalization phase in our enrichment problem may succeed its resolution procedure. In fact, it limits the specification of the data sets and the interactions between them to the used ones, which avoids any duplication. The formalization allows also the anticipation of the constraints to respect in the problem. The implementation of this enrichment method is experiment
ed essentially on the Arabic treebank ATB. This experiment provides us with good and encouraging results and various properties of different types.
(More)