to the repository and import of complex operators
is supported. We are currently investigating a more
declarative UI, which follows the document structure.
7 CONCLUSIONS
To summarize, this paper contribute an IE language
platform that can be a base for involving domain ex-
perts in IE plan modeling. We presented a process
for creating a domain-specific IE system. We showed
how our approach can be implemented using MDE
and a generic IE framework. As a proof of concept
we have developed AdaptIE and show how it can be
successfully applied in a real world scenario. As a fu-
ture work, we want to continue to bring IE to causal
users in two directions. The first direction is further
investigation on IE languages for non IE expert users.
The second direction is work on automatic genera-
tion (driven by information available in, e.g., database
schema or descriptions of OLAP cubes) of IE plan to
simplify performing ad-hock Business analysis over
unstructured data.
REFERENCES
Agrawal, R., Ailamaki, A., Bernstein, P. A., Brewer, E. A.,
Carey, M. J., Chaudhuri, S., Doan, A., Florescu, D.,
Franklin, M. J., Molina, H. G., Gehrke, J., Gruenwald,
L., Haas, L. M., Halevy, A. Y., Hellerstein, J. M., Ioan-
nidis, Y. E., Korth, H. F., Kossmann, D., Madden, S.,
Magoulas, R., Ooi, B. C., O’Reilly, T., Ramakrishnan,
R., Sarawagi, S., Stonebraker, M., Szalay, A. S., and
Weikum, G. (2008). The claremont report on database
research. SIGMOD Rec., 37(3):9–19.
ATLAS (2006). Atlas Transformation Language (ATL)
User Manual v0.7. Nantes.
Barczynski, W. M., Brauer, F., Loeser, A., and Mocan, A.
(2009). Algebraic information extraction of enterprise
data: Methodology and operators. In IK-KR Workshop
at 20th International Joint Conference on Artificial In-
telligence 2009 (to be published).
B
´
ezivin, J. and Heckel, R., editors (2005). Language
Engineering for Model-Driven Software Develop-
ment, 29. February - 5. March 2004, volume 04101
of Dagstuhl Seminar Proceedings. Internationales
Begegnungs- und Forschungszentrum f
¨
ur Informatik
(IBFI), Schloss Dagstuhl, Germany.
Bontcheva, K., Tablan, V., Maynard, D., and Cunningham,
H. (2004). Evolving gate to meet new challenges in
language engineering. Natural Language Engineer-
ing, 10(3-4):349–373.
Bosch, J. and Dittrich, Y. (2004). Domain-Specific Lan-
guages for a Changing World. http://www.ide.hk-
r.se/ bosch/papers/dslincw.ps.
Bouquet, P., Stoermer, H., Niederee, C., and Mana, A.
(2008). Entity Name System: The Backbone of an
Open and Scalable Web of Data. In ICSC 2008,
number CSS-ICSC 2008-4-28-25 in CSS-ICSC, pages
554–561. IEEE Computer Society.
Brauer, F., Barczynski, W., Hackenbroich, G., Schramm,
M., Mocan, A., and Foerster, F. (2009). Rankie:
Document retrieval on ranked entity graphs (demo).
In 35th conference International Conference on Very
Large Data Bases (VLDB) 2009.
DeRose, P., Shen, W., 0002, F. C., Doan, A., and Ramakr-
ishnan, R. (2007). Building structured web commu-
nity portals: A top-down, compositional, and incre-
mental approach. In (Koch et al., 2007), pages 399–
410.
EMF (2008). Eclipse Modeling Framework. Documen-
tation available at http://www.eclipse.org/modeling/
emf/.
Favre, J.-M. (2004a). Foundations of meta-pyramids: Lan-
guages vs. metamodels - episode ii: Story of thotus
the baboon1. In (B
´
ezivin and Heckel, 2005).
Favre, J.-M. (2004b). Foundations of model (driven) (re-
verse) engineering : Models - episode i: Stories of
the fidus papyrus and of the solarus. In (B
´
ezivin and
Heckel, 2005).
Ferruci, D. and Lally, A. (2004). Uima: an architectural ap-
proach to unstructured information processing in the
corporate research environment. Natural Language
Engineering, 10(3-4):327–348.
GMF (2009). http://gmf.eclipse.org.
Hevner, A. R., March, S. T., Park, J., and Ram, S. (2004).
Design science in information systems research. MIS
Quarterly, 28(1).
Koch, C., Gehrke, J., Garofalakis, M. N., Srivastava, D.,
Aberer, K., Deshpande, A., Florescu, D., Chan, C. Y.,
Ganti, V., Kanne, C.-C., Klas, W., and Neuhold, E. J.,
editors (2007). Proceedings of the 33rd International
Conference on Very Large Data Bases, University of
Vienna, Austria, September 23-27, 2007. ACM.
Petrasch, R. and Meimberg, O. (2006). Model Driven Archi-
tecture Eine praxisorientierte Einfhrung in die MDA.
dpunkt.verlag.
Reiss, F., Raghavan, S., Krishnamurthy, R., Zhu, H., and
Vaithyanathan, S. (2008). An algebraic approach to
rule-based information extraction. In ICDE, pages
933–942. IEEE.
Sarawagi, S. (2008). Information extraction. Foundations
and Trends in Databases, 1(3):261–377.
Shen, W., DeRose, P., McCann, R., Doan, A., and Ramakr-
ishnan, R. (2008). Toward best-effort information ex-
traction. In SIGMOD ’08: Proceedings of the 2008
ACM SIGMOD international conference on Manage-
ment of data, pages 1031–1042, New York, NY, USA.
ACM.
Shen, W., Doan, A., Naughton, J. F., and Ramakrishnan, R.
(2007). Declarative information extraction using dat-
alog with embedded extraction predicates. In (Koch
et al., 2007), pages 1033–1044.
ICEIS 2010 - 12th International Conference on Enterprise Information Systems
256