9 CONCLUSIONS AND FUTURE
WORKS
In this paper, we propose a solution to solve the
problem of semantic unbalance of clusters at the
intra and at the inter-cluster levels in dynamic data
integration environments. Our proposal ensures that
the clusters will always have semantically similar
peers according to the established threshold for both
intra-cluster and inter-cluster connections. The
experimental results demonstrated that our solution
is able to detect a semantic unbalance problem and
to perform the corresponding balance actions in
order to keep the semantic balance of the systems
clusters. We are currently improving the solution to
reduce the overload, over the cluster, when verifying
the semantic unbalance.
REFERENCES
Ayyasamy, S., and Sivanandam, S., 2010. A Cluster Based
Replication Architecture for Load Balancing in Peer-
to-Peer Content Distribution. International Journal of
Computer Networks & Communications (IJCNC),
vol.2, pp. 158-172.
Conforti, G., Ghelli, G., Manghi, P., and Sartiani, C.,
2004. A Self-organizing XML P2P Database System.
Proceedings of the 2004 international conference on
Current Trends in Database Technology, pp. 456-465.
Curino, C., Moon, H. J., D., Alin, and Zaniolo, C., 2013.
Automating the database schema evolution process.
Published in The VLDB Journal – The International
Journal on Very Large Data Bases, vol. 22, pp. 73-98.
Genevès, P., Layaïda, N., and Quint, V., 2011. Impact of
XML Schema Evolution. Published in Journal ACM
Transactions on Internet Technology (TOIT), vol. 11,
article 4.
Halevy, A., Rajarama, A., and Ordille, J., 2006. Data
Integration: The Teenage Years. Proceedings of the
32nd International Conference on Very large data
bases, pp 9-16. Seoul, Korea.
Halevy, A., Sarma, A. D., and Dong, X., 2008.
Bootstrapping pay-as-you-go data integration systems.
Proceeding of the 2008 ACM SIGMOD International
Conference of Data, pp. 861-874. Vancouver, Canada.
Joung, Y., and Chuang, F., 2009. OntoZilla: An ontology-
based, semi-structured, and evolutionary peer-to-peer
network for information systems and services. Journal
of Future Generation Computer Systems, vol. 25, n° 1,
pp. 53-63.
Kantere, V., Tsoumakos, D., and Sellis, T. ,2008. A
framework for semantic grouping in P2P databases.
Published in Journal Information Systems, vol. 33, pp.
611-636.
Montanelli, S., Bianchini, D., Aiello, C., Baldoni, R.,
Bolchini, C., Bonomi, S., Castano, S., Catarci, T.,
Antonellis, V., Ferrara, A., Melchiori, M., Quintarelli,
E., Scannapieco, M., Schreiber, A., and Tanca,
L.,2011. The ESTEEM platform: enabling P2P
semantic collaboration through emerging collective
knowledge. Published in Journal of Intelligent
Information Systems, vol. 36, n° 2.
Pires, C. E., Santiago, R., Kedad, Z., Bouzehoub, M. and
Salgado, A. C., 2012. Ontology-based Clustering in a
Peer Data Management System. Published in
International Journal of Distributed Systems and
Technologies (IJDST), vol. 3, Issue 2, pp. 1-21.
Raftopoulou, P., and Petrakis, E. G. M., 2008. A Measure
for Cluster Cohesion in Semantic Overlay Semantic.
Proceedings of the 2008 ACM workshop on Large-
Scale distributed systems for information retrieval.
Napa Valley, USA.
Rijsbergen, C. J., 1979. Information Retrieval, 2
nd
Edition,
MA: Butterworths.
Roth, A., and Skritek, S., 2013. Peer Data Management. In
Data Exchange, Information and Streams, vol. 5, pp.
185-215.
Silva, E. R., Salgado, A. C., 2013. Load Balance for
Semantic Cluster-based Data Integration Systems.
Proceeding of the 17th International Database
Engineering & Applications Symposium (IDEAS’13).
Barcelona, Spain.
Sockut, G. H., and Iyer, B. R, 2011. Online
Reorganization of Databases. Published in Journal
ACM Computing Surveys (CSUR), vol. 41, article 14.
Terwilliger, J. F., Bernstein, P. A., and Unnitha, A., 2010.
“Worry-Free Database Upgrades: Automated Model-
Driven Evolution of Schemas and Complex
Mappings”. Proceedings of the ACM SIGMOD
International Conference on Management of Data, pp.
1191-1194. Indianapolis, USA.
Tian, Y., Song, B., and Huh, E. N.. “Dynamic content-
based cloud data integration system with privacy and
cost concern”. Proceedings of the 8th Annual
Collaboration, Electronic messaging, Anti-Abuse and
Spam Conference, pp. 193-199. Redmond, USA.
2011.
Wall, B., and Angryk, R., 2011. Minimal Data Sets vs.
Synchronized Data Copies in a Schema and Data
Versioning System. Proceedings of the 4th workshop
on Workshop for Ph.D. students in information &
knowledge Management, pp. 67-74. Glasgow, United
Kingdom.
W3C. “OWL – Web Ontology Language”, 2013.
Available in http://www.w3.org/TR/owl-features.
Accessed on October 1
st
..
Zamboulis, L., Martin, N., and Poulovassillis, A., 2010.
Query performance evaluation of an architecture for
fine-grained integration of heterogeneous grid data
sources. Published in Journal Future Generation
Computer Science Systems, vol. 26, pp. 1073-1091.
ICEIS2014-16thInternationalConferenceonEnterpriseInformationSystems
98