form both HTML and flat text documents. Further-
more, the HıLεX system can be used to implement a
new generation of semantic wrappers. Many func-
tions that will be available in the future ”semantic
web” technologies are turning into reality today with
the HıLεX system.
Currently the approach is under consolidation and
its theoretical foundations are under investigation and
improvement. Future work will be focused on the
consolidation and extension of the HıLεX two-dimen-
sional grammar, the investigation of computational
complexity issues from a theoretical point of view,
the extension of the approach to pdf and other doc-
ument formats, the exploitation of natural language
processing techniques aimed to improve information
extraction from documents with only textual contents.
7 ADDITIONAL AUTHORS
• Tina Dell’Armi. Exeura s.r.l. University of Cal-
abria, 87036 Rende (CS), Italy dellarmi@exeura.it
• Lorenzo Gallucci. Exeura s.r.l. Exeura s.r.l.
University of Calabria, 87036 Rende (CS), Italy
gallucci@exeura.it
• Nicola Leone. Department of Matematics; Exeura
s.r.l. University of Calabria, 87036 Rende (CS),
Italy, leone@mat.unical.it
• Francesco Ricca. Department of Matematics,
University of Calabria, 87036 Rende (CS), Italy,
ricca@mat.unical.it
• Domenico Sacc
`
a. Exeura s.r.l.; DEIS; ICAR-CNR,
University of Calabria, 87036 Rende (CS), Italy,
sacca@unical.it
REFERENCES
Baumgartner, R., Flesca, S., and Gottlob, G. (2001a).
Declarative information extraction, web crawling, and
recursive wrapping with lixto. In LPNMR ’01: Pro-
ceedings of the 6th International Conference on Logic
Programming and Nonmonotonic Reasoning, pages
21–41, London, UK. Springer-Verlag.
Baumgartner, R., Flesca, S., and Gottlob, G. (2001b). Vi-
sual web information extraction with lixto. In The
VLDB Journal, pages 119–128.
Chang, S.-K. (1970). The analysis of two-dimensional pat-
terns using picture processing grammars. In STOC
’70: Proceedings of the second annual ACM sympo-
sium on Theory of computing, pages 206–216, New
York, NY, USA. ACM Press.
Eikvil, L. (1999). Information extraction from world wide
web - a survey. Technical Report 945, Norweigan
Computing Center.
Eiter, T., Faber, W., Leone, N., and Pfeifer, G. (2000).
Declarative Problem-Solving Using the DLV System.
In Minker, J., editor, Logic-Based Artificial Intelli-
gence, pages 79–103. Kluwer Academic Publishers.
Eiter, T., Leone, N., Mateis, C., Pfeifer, G., and Scarcello,
F. (1997). A deductive system for non-monotonic rea-
soning. In Logic Programming and Non-monotonic
Reasoning, pages 364–375.
Faber, W. and Pfeifer, G. (since 1996). Dlv homepage.
Feldman, R., Aumann, Y., Finkelstein-Landau, M., Hurvitz,
E., Regev, Y., and Yaroshevich, A. (2002). A com-
parative study of information extraction strategies. In
Gelbukh, A. F., editor, CICLing, volume 2276 of
Lecture Notes in Computer Science, pages 349–359.
Springer.
Gelfond, M. and Lifschitz, V. (1991). Classical negation in
logic programs and disjunctive databases. New Gen-
eration Computing, 9(3/4):365–386.
Giammarresi, D. and Restivo, A. (1997). Two-dimensional
languages. In Salomaa, A. and Rozenberg, G., editors,
Handbook of Formal Languages, volume 3, Beyond
Words, pages 215–267. Springer-Verlag, Berlin.
Kuhlins, S. and Tredwell, R. (2003). Toolkits for generat-
ing wrappers – a survey of software toolkits for auto-
mated data extraction from web sites. In Aksit, M.,
Mezini, M., and Unland, R., editors, Objects, Com-
ponents, Architectures, Services, and Applications for
a Networked World, volume 2591 of Lecture Notes in
Computer Science (LNCS), pages 184–198, Berlin. In-
ternational Conference NetObjectDays, NODe 2002,
Erfurt, Germany, October 7–10, 2002, Springer.
Laender, A., Ribeiro-Neto, B., Silva, A., and Teixeira, J.
(2002). A brief survey of web data extraction tools. In
SIGMOD Record, volume 31.
Leone, N., Pfeifer, G., Faber, W., Eiter, T., Gottlob, G.,
Perri, S., and Scarcello, F. (2004). The DLV System
for Knowledge Representation and Reasoning.
Ricca, F., Leone, N., Dell’Armi, T., DeBonis, V., Galizia,
S., and Grasso, G. (2005). A dlp system with object-
oriented features. In LPNMR ’05: Proceedings of 8th
International Conference on Logic Programming and
Non Monotonic Reasoning, Diamante, Italy.
Rosenfeld, B., Feldman, R., Fresko, M., Schler, J., and
Aumann, Y. (2004). Teg: a hybrid approach to in-
formation extraction. In Grossman, D., Gravano, L.,
Zhai, C., Herzog, O., and Evans, D. A., editors, CIKM,
pages 589–596. ACM.
A LOGIC-BASED APPROACH TO SEMANTIC INFORMATION EXTRACTION
123