loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Carlo A. Curino 1 ; Hyun J. Moon 2 ; Letizia Tanca 1 and Carlo Zaniolo 2

Affiliations: 1 DEI, Politecnico di Milano, Italy ; 2 CSD, UCLA, United States

Keyword(s): Schema evolution, Wikipedia, Case Study, Benchmark.

Related Ontology Subjects/Areas/Topics: Databases and Information Systems Integration ; Enterprise Information Systems ; Software Measurement ; Web Databases

Abstract: Evolving the database that is at the core of an Information System represents a difficult maintenance problem that has only been studied in the framework of traditional information systems. However, the problem is likely to be even more severe in web information systems, where open-source software is often developed through the contributions and collaboration of many groups and individuals. Therefore, in this paper, we present an in-depth analysis of the evolution history of the Wikipedia database and its schema; Wikipedia is the best-known example of a large family of web information systems built using the open-source software MediaWiki. Our study is based on: (i) a set of Schema Modification Operators that provide a simple conceptual representation for complex schema changes, and (ii) simple software tools to automate the analysis. This framework allowed us to dissect and analyze the 4.5 years of Wikipedia history, which was short in time, but intense in terms of growth and evolut ion. Beyond confirming the initial hunch about the severity of the problem, our analysis suggests the need for developing better methods and tools to support graceful schema evolution. Therefore, we briefly discuss documentation and automation support systems for database evolution, and suggest that the Wikipedia case study can provide the kernel of a benchmark for testing and improving such systems. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.145.9.200

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
A. Curino, C.; J. Moon, H.; Tanca, L. and Zaniolo, C. (2008). SCHEMA EVOLUTION IN WIKIPEDIA - Toward a Web Information System Benchmark. In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS; ISBN 978-989-8111-36-4; ISSN 2184-4992, SciTePress, pages 323-332. DOI: 10.5220/0001713003230332

@conference{iceis08,
author={Carlo {A. Curino}. and Hyun {J. Moon}. and Letizia Tanca. and Carlo Zaniolo.},
title={SCHEMA EVOLUTION IN WIKIPEDIA - Toward a Web Information System Benchmark},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS},
year={2008},
pages={323-332},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001713003230332},
isbn={978-989-8111-36-4},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS
TI - SCHEMA EVOLUTION IN WIKIPEDIA - Toward a Web Information System Benchmark
SN - 978-989-8111-36-4
IS - 2184-4992
AU - A. Curino, C.
AU - J. Moon, H.
AU - Tanca, L.
AU - Zaniolo, C.
PY - 2008
SP - 323
EP - 332
DO - 10.5220/0001713003230332
PB - SciTePress