(an XQuery program to be executed over the local
database) and to tgtQuery (an XQuery program trans-
forming the obtained answer into the target schema).
The query q
1
is propagated to (all or some) partners of
P
1
– among them also to P
1
itself. Each propagation
is recorded in table Propagations, where: propagID
identifies the propagation; qryPosId identifies the po-
sition in table Queris; srcPeer is the URL of the
source partner, where the query has been propagated;
srcAnswer is the answer obtained from the srcPeer.
34XHULHV
TU\3RV,G
TU\,GT
WJW3HHU3
WJW3URSDJ,G
WJW3RV,G
P\4XHU\T
WJW4XHU\T
WJW$QVZHU,
33URSDJDWLRQV
SURSDJ,G
TU\3RV,G
VUF3HHU3
VUF$QVZHU,
33URSDJDWLRQV
SURSDJ,G
TU\3RV,G
VUF3HHU3
VUF$QVZHU,
33URSDJDWLRQV
SURSDJ,G
TU\3RV,G
VUF3HHU3
VUF$QVZHU,
$QV
34XHULHV
TU\3RV,G
TU\,GT
WJW3HHU3
WJW3URSDJ,G
WJW3RV,G
P\4XHU\T
WJW4XHU\T
WJW$QVZHU,
34XHULHV
TU\3RV,G
TU\,GT
WJW3HHU3
WJW3URSDJ,G
WJW3RV,G
P\4XHU\T
WJW4XHU\T
WJW$QVZHU,
33URSDJDWLRQV
33URSDJDWLRQV
$QV
33URSDJDWLRQV
33URSDJDWLRQV
$QV
3
3
3
Figure 5: Query and answers propagation in SixP2P.
All srcAnswers are merged (using full or par-
tial mode) resulting to the Ans
1
. Next, tgtQuery
is evaluated over Ans
1
to obtain tgtAnswer, which
is ultimately sent to tgtPeer and stored in tgtPeer’s
Propagations table in the tuple identified by the pair
(tgtPropagId,tgtPosId). The evaluation removes du-
plicates and considers key constraints.
4 CONCLUSIONS
The paper presents a novel method for schema map-
ping and query reformulation in XML data integra-
tion systems in P2P environment. We discussed some
issues concerning query propagation strategies and
merging modes, when missing data is to be discov-
ered in the P2P integration processes. We showed,
how to use functional dependencies to select the way
of query propagation and data merging, to increase
the information content of the answer. The approach
is fully implemented in SixP2P system. We present
its general architecture, and sketched the way how
queries and answers are sent across the P2P envi-
ronment. In SixP2P, schemas, schema constraints,
schema mappings, and queries are specified in a
uniform and precise way. We develop algorithms
for automatic generation of XQuery programs which
perform operations of query reformulation and data
merging.
ACKNOWLEDGEMENTS
The work was supportedin part by the Polish Ministry
of Science and Higher Education under Grant N516
015 31/1553.
REFERENCES
Arenas, M. (2006). Normalization theory for XML. SIG-
MOD Record, 35(4):57–64.
Arenas, M. and Libkin, L. (2005). XML Data Exchange:
Consistency and Query Answering. In PODS Confer-
ence, pages 13–24.
Bernstein, P. A., Giunchiglia, F., Kementsietsidis, A., My-
lopoulos, J., Serafini, L., and Zaihrayeu, I. (2002).
Data management for peer-to-peer computing : A vi-
sion. In WebDB, pages 89–94.
Brzykcy, G., Bartoszek, J., and Pankowski, T. (2007). Se-
mantic Data Integration in P2P Environment using
Schema Mappings and Agent Technology, AMSTA
2007. In Lecture Notes in Computer Science 4496,
pages 385–394. Springer.
Fagin, R., Kolaitis, P. G., Popa, L., and Tan, W. C. (2004).
Composing schema mappings: Second-order depen-
dencies to the rescue. In PODS, pages 83–94.
Fuxman, A., Kolaitis, P. G., Miller, R. J., and Tan, W. C.
(2006). Peer data exchange. ACM Trans. Database
Syst., 31(4):1454–1498.
Koloniari, G. and Pitoura, E. (2005). Peer-to-peer manage-
ment of XML data: issues and research challenges.
SIGMOD Record, 34(2):6–17.
Madhavan, J. and Halevy, A. Y. (2003). Composing map-
pings among data sources. In VLDB, pages 572–583.
Melnik, S., Bernstein, P. A., Halevy, A. Y., and Rahm, E.
(2005). Supporting executable mappings in model
management. In SIGMOD Conference, pages 167–
178.
Pankowski, T. (2006). Management of executable schema
mappings for XML data exchange. In Database Tech-
nologies for Handling XML Information on the Web,
EDBT 2006 Workshops, Lecture Notes in Computer
Science 4254, pages 264–277.
Pankowski, T. (2008). Pattern based XML data integration
in P2P environment. submitted.
Pankowski, T., Cybulka, J., and Meissner, A. (2007). Rea-
soning About XML Schema Mappings in the Presence
of Key Constraints and Value Dependencies. In Web
Reasoning and Rule Systems, Lecture Notes in Com-
puter Science 4524, pages 374–376.
Tatarinov, I. and Halevy, A. Y. (2004). Efficient query refor-
mulation in peer-data management systems. In SIG-
MOD Conference, pages 539–550.
XML DATA INTEGRATION IN PEER-TO-PEER DATA MANAGEMENT SYSTEMS
299