Our system is a first approximation to an auto-
matic system that runs through dependency trees and
returns a simplified version of the sentence parsed.
We can conclude that it is possible to simplify cor-
rectly texts using dependency parsing, in the particu-
lar case of Spanish. The simplified sentence is gram-
matically correct. But on the other hand, choosing any
label, using dependency parsing the algorithms make
aggressive simplifications.
We made a simple version of the algorithm and we
only focused on the dependency tree. We are work-
ing to increase the number of simplification opera-
tions and some future work might be oriented towards
defining lexical simplification operations like to swap
the “difficult” words with a simple synonym using a
version of WordNet (Fellbaum, 1998) for Spanish like
EuroWordNet (Vossen, 1998).
REFERENCES
Bautista, S., Gerv
´
as, P., and Madrid, R. (2009). Feasibil-
ity Analysis for SemiAutomatic Conversion of Text
to Improve Readability. In The Second International
Conference on Information and Communication Tech-
nologies and Accessibility.
Buchholz, S. and Marsi, E. (2006). Conll-x shared task on
multilingual dependency parsing. In Proceedings of
the 10th Conference on Computational Natural Lan-
guage Learning (CoNLL–X), pages 149–164.
Canning, Y. (2000). Cohesive simplification of newspaper
text for aphasic readers. In 3rd annual CLUK Doc-
toral Research Colloquium.
Caseli, H. M., Pereira, T. F., Specia, L., Pardo, T. A. S.,
Gasperin, C., and M.Aluisio, S. (2009). Building a
Brazilian Portuguese Parallel Corpus of Original and
Simplified Texts. In In Proceedings of CICLing.
Chandrasekar, R., Doran, C., and Srinivas, B. (1996). Mo-
tivations and methods for text simplification. In In
Proceedings of the Sixteenth International Conference
on Computational Linguistics (COLING ’96, pages
1041–1044.
Chandrasekar, R. and Srinivas, B. (1997). Automatic induc-
tion of rules for text simplification. Knowledge-Based
Systems, 10.
Devlin, S. and Tait, J. (1998). Linguist Databases, chapter
The use of a Psycholinguistic database in the Simpli-
fication of Text for Aphasic Readers, pages 161–173.
CSLI.
Devlin, S. and Unthank, G. (2006). Helping aphasic people
process online information. In Assets ’06: Proceed-
ings of the 8th international ACM SIGACCESS con-
ference on Computers and accessibility, pages 225–
226, New York, NY, USA. ACM.
Fellbaum, C., editor (1998). WordNet: an electronic lexical
database. MIT Press.
Inui, K., Fujita, A., Takahashi, T., Iida, R., and Iwakura,
T. (2003). Text simplification for reading assistance:
a project note. In Proceedings of the second in-
ternational workshop on Paraphrasing, pages 9–16,
Morristown, NJ, USA. Association for Computational
Linguistics.
Klebanov, B. B., Knight, K., and Marcu, D. (2004).
Text simplification for information-seeking applica-
tions. In On the Move to Meaningful Internet Systems,
Lecture Notes in Computer Science, pages 735–747.
Springer Verlag.
K
¨
ubler, S., McDonald, R. T., and Nivre, J. (2009). Depen-
dency Parsing. Synthesis Lectures on Human Lan-
guage Technologies. Morgan & Claypool Publishers.
Max, A. (2006). Writing for language-impaired readers. In
CICLing, pages 567–570.
Mcdonald, R. K. R., Nilsson, J., Riedel, S., and Yuret, D.
(2007). The conll 2007 shared task on dependency
parsing.
Palomar, M., Civit, M., D
´
ıaz, A., Moreno, L., Bisbal, E.,
Aranzabe, M., Ageno, A., Mart
´
ı, M., and Navarro, B.
(2004). 3lb: Construcci
´
on de una base de datos de
´
arboles sint
´
actico–sem
´
anticos para el catal
´
an, euskera
y espa
˜
nol. In Proceedings of the XX Conference of
the Spanish Society for Natural Language Process-
ing (SEPLN), pages 81–88. Sociedad Espa
˜
nola para
el Procesamiento del Lenguaje Natural.
Petersen, S. E. and Ostendorf, M. (2007). Text simplifica-
tion for language learners: a corpus analysis. In In
Proc. of Workshop on Speech and Language Technol-
ogy for Education.
Siddharthan, A. (2002). Resolving attachment and clause
boundary amgiguities for simplifying relative clause
constructs. In Proceedings of the Student Research
Workshop, 40th Meeting of the Association for Com-
putacional Linguistics.
Siddharthan, A. (2003). Syntactic Simplification and Text
Cohesion. PhD thesis, Research on Language and
Computation.
Snow, C. E., States., U., Science, and Corporation), T. P.
I. R. (2002). Reading for understanding : toward an
R&D program in reading comprehension / Catherine
Snow. Rand, Santa Monica, CA :.
Taul
´
e, M., Mart
´
ı, M., and Recasens, M. (2008). AnCora:
Multilevel Annotated Corpora for Catalan and Span-
ish. In Proceedings of 6th International Conference
on Language Resources and Evaluation.
Vossen, P., editor (1998). EuroWordNet: a multilingual
database with lexical semantic networks. Kluwer
Academic Publishers, Norwell, MA, USA.
Williams, S., Reiter, E., and Osman, L. M. (2003). Exper-
iments with discourse-level choices and readability.
In In Proceedings of the European Natural Language
Generation Workshop (ENLG) and 11th Conference
of the European Chapter of the Association for Com-
putational Linguistics (EACL03), pages 127–134.
TEXT SIMPLIFICATION USING DEPENDENCY PARSING FOR SPANISH
335