suitable specific parser by implementing the rules re-
lated to the rest of the problematic words, including
the other prepositions explained in Section 3 and the
nexus “que”. In doing so we would get a real and
competitive n–version dependency parser.
A basic aspect that may be strongly consid-
ered when developing machine learning–based de-
pendency parsers is the accuracy and suitability of the
train and test corpora, this has been claimed in our
previous related work and has been observed again
during the development of the present one. Not only
does it mean that the samples must be 100% error free
tagged, but that they should be carefully selected to
ensure a high recall both in the train and the test sets.
ACKNOWLEDGEMENTS
This work has been partially funded by Banco
Santander Central Hispano and Universidad Com-
plutense de Madrid under the Creaci
´
on y Consoli-
daci
´
on de Grupos de Investigaci
´
on program, Ref.
921332–953.
REFERENCES
Ballesteros, M., Herrera, J., Francisco, V., and Gerv
´
as,
P. (2010a). A feasibility study on low level tech-
niques for improving parsing accuracy for spanish us-
ing maltparser. In Artificial Intelligence: Theories,
Models and Applications, volume 6040 of Lecture
Notes in Artificial Intelligence, pages 39–48. Springer.
Ballesteros, M., Herrera, J., Francisco, V., and Gerv
´
as, P.
(2010b). Improving Parsing Accuracy for Spanish us-
ing Maltparser. Journal of the Spanish Society for
NLP (SEPLN), 44:83–90.
Ballesteros, M., Herrera, J., Francisco, V., and Gerv
´
as, P.
(2010c). Towards a N-Version Dependency Parser.
13th International Conference on Text, Speech and Di-
alogue 2010., 6231.
Buchholz, S. and Marsi, E. (2006). Conll-x shared task on
multilingual dependency parsing. In CoNLL-X ’06:
Proceedings of the Tenth Conference on Computa-
tional Natural Language Learning, pages 149–164,
Morristown, NJ, USA. Association for Computational
Linguistics.
Eisner, J. (1996). Three new probabilistic models for depen-
dency parsing: An exploration. In Proceedings of the
16th International Conference on Computational Lin-
guistics (COLING–96), pages 340–345, Copenhagen.
Herrera, J. and Gerv
´
as, P. (2008). Towards a Dependency
Parser for Greek Using a Small Training Data Set.
Journal of the Spanish Society for Natural Language
Processing (SEPLN), 41:29–36.
Herrera, J., Gerv
´
as, P., Moriano, P., Moreno, A., and
Romero, L. (2007a). Building Corpora for the De-
velopment of a Dependency Parser for Spanish Using
Maltparser. Journal of the Spanish Society for Natural
Language Processing (SEPLN), 39:181–186.
Herrera, J., Gerv
´
as, P., Moriano, P., Moreno, A., and
Romero, L. (2007b). JBeaver: un Analizador de De-
pendencias para el Espa
˜
nol Basado en Aprendizaje.
In Proceedings of the 12th Conference of the Spanish
Society for Artificial Intelligence (CAEPIA 07), Sala-
manca, Spain, pages 211–220. Asociaci
´
on Espa
˜
nola
para la Inteligencia Artificial.
McDonald, R., Lerman, K., and Pereira, F. (2006). Mul-
tilingual dependency analysis with a two-stage dis-
criminative parser. In Proceedings of the 10th Confer-
ence on Computational Natural Language Learning
(CoNLL–X), pages 216–220.
McDonald, R. and Nivre, J. (2007). Characterizing the er-
rors of data–driven dependency parsing models. In
Proceedings of the 2007 Joint Conference on Empir-
ical Methods in Natural Language Processing and
Computational Natural Language Learning, pages
122–131. Association for Computational Linguistics.
Nivre, J., Hall, J., and Nilsson, J. (2004). Memory–based
dependency parsing. In Proceedings of CoNLL–2004,
pages 49–56. Boston, MA, USA.
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kbler,
S., Marinov, S., and Marsi, E. (2007). Maltparser:
A language-independent system for data-driven de-
pendency parsing. Natural Language Engineering,
13(2):95–135.
Nivre, J., Hall, J., Nilsson, J., Eryi
˘
git, G., and Marinov, S.
(2006). Labeled pseudo–projective dependency pars-
ing with support vector machines. In Proceedings of
the 10th Conference on Computational Natural Lan-
guage Learning (CoNLL–X), pages 221–225.
Taul
´
e, M., Mart
´
ı, M. A., and Recasens, M. (2008). Ancora:
Multilevel annotated corpora for Catalan and Span-
ish. In Proceedings of the Sixth International Lan-
guage Resources and Evaluation (LREC’08), Mar-
rakech, Morocco. European Language Resources As-
sociation (ELRA).
Yamada, H. and Matsumoto, Y. (2003). Statistical depen-
dency analysis with support vector machines. In Pro-
ceedings of International Workshop of Parsing Tech-
nologies (IWPT’03), pages 195–206.
GIVING SHAPE TO AN N-VERSION DEPENDENCY PARSER - Improving Dependency Parsing Accuracy for Spanish
using Maltparser
341