with new challenges and requirements—which we
have analyzed in this in-depth study of MediaWiki,
the sofware behind Wikipedia, a WIS of great renown
and importance. Our study shows that MediaWiki
has undergone a very intensive schema evolution, as
a result of the cooperative, multi-party, open-source
development and administration that is common in
leading-edge WIS projects. Thus, the WIS environ-
ment, (i) contrasts with the smaller, less-open and
slow-turnover setting of typical in traditional informa-
tion systems, (ii) creates a more urgent needs for bet-
ter automation and documentation tools for support-
ing graceful schema evolution in WIS. In this paper
we analyze and quantify the schema evolution prob-
lem of WIS and introduce concepts and tools that rep-
resent an important first step toward realizing (ii).
At the conceptual level, we have introduced the
Schema Modification Operators (SMOs), that proved
effective both in an operational mode to support
schema evolution (Moon et al., 2008; Curino et al.,
2008b), and in an “a posteriori” mode to support in-
depth analysis. Moreover, we also developed a sim-
ple set of software tools to facilitate the analysis of
schema evolution, and the derivation of the SMOs de-
scribing such an evolution. This tool-suite proved ef-
fective in the analysis of MediaWiki and is available
online at (Curino et al., 2008a). The structured repre-
sentation of the evolution history of MediaWiki that
we derived in this project is also available for down-
loading (Curino et al., 2008a). Such data-set is cur-
rently being extended by analyzing other leading WIS
projects in order to create a rich schema evolution
benchmark. Once completed, this benchmark will
(i) provide the community with a rich set of schema
evolution examples that can be studied to avoid com-
mon up-front design errors and improve schema man-
agement best practices, and (ii) represent a critical
validation tool for techniques and systems designed
to automate the schema evolution process (includ-
ing those that are currently under development in our
lab). Indeed, the desirability of such a benchmark was
stressed in the past by other researchers working in re-
lated areas (Bernstein et al., 2006).
ACKNOWLEDGEMENTS
The authors would like to thank Alin Deutsch for the
numerous in-depth discussions on schema mapping
and query rewriting.
REFERENCES
Almeida, R. B., Mozafari, B., and Cho, J. (2007). On the
evolution of wikipedia. In Int. Conf. on Weblogs and
Social Media.
Bernstein, P. A. (2003). Applying model management to
classical meta data problems. In CIDR.
Bernstein, P. A., Green, T. J., Melnik, S., and Nash, A.
(2006). Implementing mapping composition. In
VLDB.
Curino, C., Moon, H., Tanca, L., and Zan-
iolo, C. (2008a). Pantha rei data set [on-
line]: http://yellowstone.cs.ucla.edu/schema-
evolution/index.php/Main Page.
Curino, C. A., Moon, H. J., and Zaniolo, C. (2008b). Grace-
ful database schema evolution: the prism workbench.
In UCLA Tech. Rep., 2008. Submitted for publication.
Franconi, E., Grandi, F., and Mandreoli, F. (2001). Schema
evolution and versioning: A logical and computa-
tional characterisation. Database Schema Evolution
and Meta-Modeling, pages 85–99.
Galante, R. d. M., dos Santos, C. S., Edelweiss, N., and
Moreira, A. F. (2005). Temporal and versioning model
for schema evolution in object-oriented databases.
Data & Knowledge Engineering, 53(2):99–128.
Marche, S. (1993). Measuring the stability of data
models.European Journal of Information Systems,
2(1):37–47.
Moon, H. J., Curino, C. A., Deutsch, A., Hou, C.-Y.,
and Zaniolo, C. (2008). Managing and querying
transaction-time databases under schema evolution. In
UCLA Tech. Rep., 2008. Submitted for publication.
Moro, M. M., Malaika, S., and Lim, L. (2007). Preserving
XML Queries during Schema Evolution. In WWW,
pages 1341–1342.
Ram, S. and Shankaranarayanan, G. (2003). Research
issues in database schema evolution: the road not
taken. In Boston University School of Management,
Department of Information Systems, Working Paper
No: 2003-15.
Rizzi, S. and Golfarelli, M. (2007). X-time: Schema
versioning and cross-version querying in data ware-
houses. In ICDE, pages 1471–1472.
Roddick, J. (1995). A Survey of Schema Versioning Is-
sues for Database Systems. Information and Software
Technology, 37(7):383–393.
Shneiderman, B. and Thomas, G. (1982). An architecture
for automatic relational database system conversion.
ACM Transactions on Database Systems, 7(2):235–
257.
Sjoberg, D. I. (1993). Quantifying schema evolution. Infor-
mation and Software Technology, 35(1):35–44.
Velegrakis, Y., Miller, R. J., and Popa, L. (2003). Mapping
adaptation under evolving schemas. In VLDB.
Wikimedia Foundation (2007). The mediawiki workbook
2007 dammit.lt/uc/workbook2007.pdf. [Online].
Yu, C. and Popa, L. (2005). Semantic adaptation of schema
mappings when schemas evolve. In VLDB.
ICEIS 2008 - International Conference on Enterprise Information Systems
332