Learning Domain-specific Grammars from a Small Number of Examples
Herbert Lange, Peter Ljunglöf
2020
Abstract
In this paper we investigate the problem of grammar inference from a different perspective. The common approach is to try to infer a grammar directly from example sentences, which either requires a large training set or suffers from bad accuracy. We instead view it as a problem of grammar restriction or sub-grammar extraction. We start from a large-scale resource grammar and a small number of examples, and find a sub-grammar that still covers all the examples. To do this we formulate the problem as a constraint satisfaction problem, and use an existing constraint solver to find the optimal grammar. We have made experiments with English, Finnish, German, Swedish and Spanish, which show that 10–20 examples are often sufficient to learn an interesting domain grammar. Possible applications include computer-assisted language learning, domain-specific dialogue systems, computer games, Q/A-systems, and others.
DownloadPaper Citation
in Harvard Style
Lange H. and Ljunglöf P. (2020). Learning Domain-specific Grammars from a Small Number of Examples. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI, ISBN 978-989-758-395-7, pages 422-430. DOI: 10.5220/0009371304220430
in Bibtex Style
@conference{nlpinai20,
author={Herbert Lange and Peter Ljunglöf},
title={Learning Domain-specific Grammars from a Small Number of Examples},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI,},
year={2020},
pages={422-430},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009371304220430},
isbn={978-989-758-395-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI,
TI - Learning Domain-specific Grammars from a Small Number of Examples
SN - 978-989-758-395-7
AU - Lange H.
AU - Ljunglöf P.
PY - 2020
SP - 422
EP - 430
DO - 10.5220/0009371304220430