was generated as tuple-based (in-memory) mediator
using plain JDBC. The EAI server shows a much bet-
ter scaling than the ETL tool, while the ETL tool has
a lower initial overhead. The resulting break-even
point may be reasoned by two facts. First, the EAI
server stores the temporary data in a coarse-grained
way (XML document), while the ETL tool uses a
fine-grained (tuple-based) temporary datastore. Sec-
ond, the overhead for XML serialization decreases
with increasing datasizes because it is executed asyn-
chronously while reading and writing data to/from the
external systems. In Figure 4b, there is the compari-
son between unoptimized and optimized (using mate-
rialization points) FDBMS DPD, which shows an in-
teresting scaling. In Figure 4c, the unoptimized EAI
server DPD is compared with optimized versions.
6 SUMMARY
The motivation behind this work and the enclosing
research project GCIP was the lack of model-driven
techniques for integration process generation. There
was also little work on process optimization using
rewriting and comparison techniques. Thus, our main
approach was the adaptation of MDA to the context
of integration processes and the application of opti-
mization techniques on different levels of abstraction.
To summarize the paper, we illustrated our approach
of generating platform-specific integration processes
for FDBMS, ETL tools and EAI servers. Further,
we classified the optimization techniques which could
be applied during model-driven process generation as
well as the optimization-influencing factors. Finally,
we described the framework implementation in short
and discussed example performance evaluations.
REFERENCES
B
¨
ohm, M., Habich, D., Lehner, W., and Wloka, U. (2008a).
Dipbench: An independent benchmark for data inten-
sive integration processes. In IIMAS.
B
¨
ohm, M., Habich, D., Lehner, W., and Wloka, U. (2008b).
Dipbench toolsuite: A framework for benchmarking
integration systems. In ICDE.
B
¨
ohm, M., Habich, D., Lehner, W., and Wloka, U.
(2008c). Model-driven development of complex and
data-intensive integration processes. In MBSDI.
B
¨
ohm, M., Habich, D., Wloka, U., Bittner, J., and Lehner,
W. (2007). Towards self-optimization of message
transformation processes. In ADBIS.
Bruno, N. and Chaudhuri, S. (2002). Exploiting statistics
on query expressions for optimization. In SIGMOD.
Dessloch, S., Maier, A., Mattos, N., and Wolfson, D.
(2003). Information integration - goals and chal-
lenges. Datenbank-Spektrum, 3(6):7–13.
Dorda, C., Heinkel, U., and Mitschang, B. (2007). Im-
proving application integration with model-driven en-
gineering. In ICITM.
Habich, D., Richly, S., and Lehner, W. (2006). Gig-
nomda - exploiting cross-layer optimization for com-
plex database applications. In VLDB’ 06.
Hahn, K., Sapia, C., and Blaschka, M. (2000). Auto-
matically generating olap schemata from conceptual
graphical models. In DOLAP.
Hull, R., Llirbat, F., Kumar, B., Zhou, G., Dong, G., and Su,
J. (2000). Optimization techniques for data-intensive
decision flows. In ICDE.
Kabra, N. and DeWitt, D. J. (1998). Efficient mid-query re-
optimization of sub-optimal query execution plans. In
SIGMOD.
K
¨
onigs, A. (2005). Model transformation with triple graph
grammars. In MODELS’ 05 workshops.
Maz
´
on, J.-N., Trujillo, J., Serrano, M., and Piattini, M.
(2005). Applying mda to the development of data
warehouses. In DOLAP.
Seol, H., Choi, J., Park, G., and Park, Y. (2007). A frame-
work for benchmarking service process using data en-
velopment analysis and decision tree. Expert Syst.
Appl., 32(2).
Simitsis, A. (2005). Mapping conceptual to logical models
for etl processes. In DOLAP.
Simitsis, A., Vassiliadis, P., and Sellis, T. (2005). Optimiz-
ing etl processes in data warehouses. In ICDE.
Srivastava, U., Munagala, K., Widom, J., and Motwani, R.
(2006). Query optimization over web services. In
VLDB.
Stillger, M., Lohman, G. M., Markl, V., and Kandil, M.
(2001). Leo - db2’s learning optimizer. In VLDB.
Trujillo, J. and Luj
´
an-Mora, S. (2003). A uml based ap-
proach for modeling etl processes in data warehouses.
In ER.
Urhan, T. and Franklin, M. J. (2001). Dynamic pipeline
scheduling for improving interactive query perfor-
mance. In VLDB.
Urhan, T., Franklin, M. J., and Amsaleg, L. (1998). Cost-
based query scrambling for initial delays. In SIG-
MOD.
Vassiliadis, P., Simitsis, A., and Skiadopoulos, S. (2002).
Conceptual modeling for etl processes. In DOLAP.
Vrhovnik, M., Schwarz, H., Suhre, O., Mitschang, B.,
Markl, V., Maier, A., and Kraft, T. (2007). An ap-
proach to optimize data processing in business pro-
cesses. In VLDB.
ICEIS 2008 - International Conference on Enterprise Information Systems
136