In the short-term, the idea is to combine our
approach to the usual ontology learning from text
ones. In this way, so as to better take advantage of
Wikipedia’s articles, it would seem interesting to
complete the approach of (Herbelot and Copestake,
2006) which exploits plain text only. We plan to also
exploit in this context redirect links and homonym
pages to maximise the number of relevant articles.
On the other hand we want to improve the analysis
of enumerative structures by going beyond simple
parsing, particularly regarding the primer. Authors
may use complex grammatical constructions or
linguistic variations in their writing, even within the
enumerative structures. We then face problems of
anaphora resolution, ellipses, apposition,
extraposition and rhetorical forms, etc. (fig. 1.).
Also, discourse analysis must be carried out to
process non-parallel enumerative structures.
REFERENCES
Auer, S., Bizer, C., Lehmann, J., Kobilarov, G., Cyganiac,
R., Ives, Z., 2007. DBpedia : a nucleus for a web of
open data. In: Proceedings of the Sixth International
Semantic Web Conference and Second Asian Semantic
Web Conference (ISWC/ASWC2007), Busan, South
Korea, vol. 4825, pp 715-728
Chernov, S., Iofciu, T., Nejdl, W., Zhou, X. , 2006.
Extracting semantic relationships between Wikipedia
categories. In: Proceedings of the First International
Workshop : SemWiki’06 - From Wiki to Semantics.
Co-located with the Third Annual European Semantic
Web Conference ESWC’06 in Budva, Montenegro
Giovannetti, E., Marchi, S., Montemagni, S.: Combining
Statistical Techniques and Lexico-syntactic Patterns
for Semantic Relation Extraction from Text. Fifth
workshop on Semantic Web Applications and
Perspectives, FA0-UN, Roma, Italy (2008)
Groza, T., Handschuh, S., Möller K., Decker, S., 2007.
SALT - Semantically Annotated LaTeX for scientific
publications. In: Proceedings of the 4th European
Semantic Web Conference (ESWC 2007). Innsbruck,
Austria
Giuliano, C., Lavelli, A., Romano, L.: Exploiting Shallow
Linguistic Information for Relation Extraction from
Biomedical Literature. In Proc. EACL (2006)
Hearst M. A.: TextTiling, 1997. Segmenting Text into
Multi-paragraph Subtopic Passages. Computational
Linguistics, volume 23, Number 1
Herbelot, A., Copestake, A., 2006: Acquiring ontological
relationships from Wikipedia using RMRS. In:
Proceedings of the International Semantic Web
Conference 2006. Workshop on Web Content Mining
with Human Language Technologies, Athens, GA
Jacquemin C., Bush C., 2000. Fouille du Web pour la
collecte d’Entités Nommées. In : E. Wehrli (Ed.),
TALN 2000, Lausanne
Kamel, M., Aussenac-Gilles, N., 2009. How can document
structure improve ontology learning? (regular paper).
In: Semantic Authoring, Annotation and Knowledge
Markup Workshop - collocated with K-CAP 2009
(SAAKM 2009), Redondo Beach, California (USA),
Siegfried Handschuh, Michael Sintek (Eds.), CEUR
Workshop Proceedings, p. 1-8
Luc, C., 2001. Une typologie des énumérations basée sur
les structures rhétoriques et architecturales du texte.
TALN2001, Université de Tours, p. 263-272
Mann, W. C., Matthiessen, C. M., Thompson, S. A., 1992.
Rhetorical structure theory and text analysis. In:
Mann, W. C. and Thompson, S. A., editors, Discourse
Description, Diverse Linguistic Analyses of a Fund-
Raising Text, pp. 39-78. John Benjamins publishing
Compagny, Amsterdam/Philadelphia
Medelyan O., Milne D., Legg C., Witten I.H., 2009.
Mining meaning from Wikipedia. International
Journal of Human-Computer studies. Volume 67,
Issue 9, pp.716-754
Nédellec, C., Nazarenko, A.: Ontology and Information
Extraction. in S. Staab & R. Studer (eds.) Handbook
on Ontologies in Information Systems, Springer (2003)
Nguyen, D.P.T., Matsuo, Y., Ishizuka, M., 2007. Relation
extraction from Wikipedia using subtree mining. In:
Proceedings of the AAAI’07 Conference, Vancouver,
Canada, July 2007, pp. 1414-1420
Power, R., Scott, D., Bouayad-Agua, N., 2003. Document
Structure. Computational linguistics, 29:4, pp. 211-
260
Rebeyrolle, J, Péry-Woodley M.-P, 1998. Repérage
d’objets textuels fonctionnels pour le filtrage
d’information : le cas de la définition. In: Rencontre
Internationale sur l’Extraction et le Filtrage et le
Résumé Automatique, Sfax, Tunisie, pp19-30
Shen, D., Yang, Q., Chen, Z., 2007. Noise reduction
through summarization for Web-page classification.
Information Processing and Management, volume 43,
issue 6, pp. 1735-1747
Wang, G., Zhang, H., Wang, H., Yu, Y., 2007. Enhancing
relation extraction by eliciting selectional constraint
features from Wikipedia. In : Proceedings of the
Natural Language Processing and Information
Systems Conference, pp. 329-340
ONTOLOGY BUILDING USING PARALLEL ENUMERATIVE STRUCTURES
281