and edges of the OEM graph. In (Marian et al.,
2001), a diff algorithm is employed for detecting
changes between two versions of an XML
document and storing them as edit scripts or deltas.
A similar approach is in (Chien, Tsotras and
Zaniolo, 2001), where a referenced-based
identification of objects is used across versions. In
(Gergatsoulis and Stavrakas, 2003) MXML, an
XML extension that uses context information to
express time and model multifaceted documents, is
proposed. Other works deal with change modelling
(Rizzolo et al., 2009) and detection (Papavassiliou
et al., 2009), (Galani, Papastefanatos and
Stavrakas, 2016) on semantic data and RDF.
In (Rizzolo and Vaisman, 2008), an XML
document is modelled as a directed graph and
transaction time is attached at the edges. In (Gao
and Snodgrass, 2003), a temporal query language
for adding valid time support in XQuery is
presented. In (Wang and Zaniolo, 2003) a
temporally grouped data model is employed for
uniformly representing and querying versions. In
(Moon et al., 2008), this technique is extended for
publishing the history of a relational database in
XML and a set of schema modification operators
(SMOs) is used for representing mappings between
successive schema versions. (Amagasa, Yoshikawa
and Uemura, 2000) deal with archiving curated
databases for scientific data, using timestamps and
merging all versions into one hierarchy. (Buneman,
Chapman and Cheney, 2006) deal with provenance
in curated databases. User actions are recorded in
sequence and stored as provenance links.
Our model introduces a change-based view for
evolving data. Changes are not derived by data
versions, but are modelled as first class citizens
along with data. Changes are not described via
diffs or transformations with edit scripts between
versions, but are complex objects operating on
data, exhibiting structural, semantic, and temporal
properties. Thus, querying evolution involves
searching on both data and change structure, using
temporal- and change-based conditions. Change-
centric modelling can provide additional
information on what, why, and how data evolved.
5 CONCLUSIONS
In this paper, we formally defined evo-path: a
language for querying evolving data and changes
in a uniform way. Evo-path operates on evo-graph,
a model that captures data versions and structured
changes. We also defined evo-path translation into
plain XPath expressions, which are evaluated on
evoXML, an XML representation of evo-graph.
Our next steps involve experimenting evo-path.
ACKNOWLEDGEMENTS
This research has been funded by the project
"Moving from Big Data Management to Data
Science" (MIS 5002437/3)-Action "Reinforcement
of the Research and Innovation Infrastructure"
(funded by Greece and the European Regional
Development Fund).
REFERENCES
Amagasa, T., Yoshikawa, M., Uemura, S. (2000). A
Data Model for Temporal XML Documents. In
DEXA.
Buneman, P., Khanna, S., Tajima, K., Tan, W.C. (2004).
Archiving Scientific Data. In ACM Transactions on
Database Systems, Vol. 20, pp 1-39.
Buneman, P., Chapman, A. P., Cheney, J. (2006).
Provenance Management in Curated Databases. In
SIGMOD'06.
Chawathe, S., Abiteboul, S., Widom, J. (1999).
Managing Historical Semistructured Data. In
Journal of Theory and Practice of Object Systems,
Vol. 24(4), pp.1-20.
Chien, S-Y., Tsotras, V. J., Zaniolo, C. (2001). Efficient
Management of Multiversion Documents by Object
Referencing. In VLDB.
Gao, D., Snodgrass, R. T. (2003). Temporal Slicing in
the Evaluation of XML Queries. In VLDB.
Gergatsoulis, M., Stavrakas, Y. (2003). Representing
Changes in XML Documents using Dimensions. In
1st International XML Database Symposium.
Marian, A., Abiteboul, S., Cobena, G., Mignet, L.
(2001). Change-Centric Management of Versions in
an XML Warehouse. In VLDB.
Moon, H.J., Curino, C., Deutsch, A., Hou, C.Y.,
Zaniolo, C. (2008). Managing and querying
transaction-time databases under schema evolution.
In VLDB.
National research council - Committee on Frontiers at
the Interface of Computing and Biology (2005).
Catalyzing Inquiry at the Interface of Computing and
Biology. Edited by J. C. Wooley, H. S. Lin.,
National Academies Press.
Papavassiliou, V., Flouris, G., Fundulaki, I., Kotzinos,
D., Christophides, V. (2009). On Detecting High-
Level Changes in RDF/S KBs. In ISWC.
Rizzolo, F., Vaisman, A. A. (2008). Temporal XML:
modeling, indexing, and query processing. In VLDB
J., 17(5): 1179-1212.