using the Ant Colony Optimization heuristics
(ACO). We evaluated the quality and complexities
of our location method. As ACO consists of ants or
agents that explore the solution space and cyclically
improve the results, we found the best number of
agents that produce the most efficient exploration.
Each round consists of two phases, traversal and
prizing. Next, we devised a prizing mechanism that
accelerates convergence. For the instances of GE
here addressed, we were able to propose a model
that predicts the total number of rounds as a linear
function of each of the parameters under study.
Some important issues remain as directions for
further work. For the sake of simplicity we assumed
that each document in the CSW is labelled with a
single index. What should we do with multi-labelled
or highly nested contents? How should we deal with
the CSW growing? Preliminary work suggest that,
multi-labelled documents can fit well in our look-up
table, by means of linking mechanisms
subordinating all the indices of a document to a
couple of main concepts that define the actual
location of the corresponding file. As for the CSW
dynamics, we consider that storage capacities must
be kept in order to foresee a middle-term growing.
In the long term, it might be the case that the whole
CSW partitioning, i.e. its granularity, should be
redefined and a new allocation procedure might be
invoked. It is also possible that whenever a small
subset of related concepts shows a rapid growing on
the size of their documents, the entire collection
might migrate to a new store node.
Distributed storage is driving many R&D efforts.
From the users’ point of view, it may turn into the
basic mechanism able to unleash the potential
benefits of knowledge management. Health sciences,
agriculture, geomatics, are only a few examples of
the many domains that may dramatically improve
their operations with the adoption of this new trend.
REFERENCES
Berners-Lee, T., Hendler, J., Lassila, O., 2001. The
semantic web.
Scientific American.
Cai, M., Frank, M., 2004. RDFPeers: A Scalable
Distributed RDF Repository Based on a Structured
Peer-to-Peer Network.
In Proceedings of the 13th
international conference on the World Wide Web, New
York, USA
.
Cidon, I. 1988. Yet Another Distributed Depth-First-
Search Algorithm.
Inf. Process. Lett. 26(6)
Crespo, A. Garcia-Molina, H. 2002. Semantic Overlay
Networks for P2P Systems.
Technical report, Stanford
University.
Dorigo, M. 1992. Optimization, Learning and Natural
Algorithms.
Ph.D. Thesis, Dept. of Electronics,
Politecnico di Milano. Italy.
Gandon, Fabien. 2002. ONTOLOGY ENGINEERING: A
SURVEY AND A RETURN ON EXPERIENCE,
Report of research INRIA, team ACACIA.
Gutjahr W. 1999. A generalized convergence result for the
graph-based ant system metaheuristic.
Technical
Report 99-09, University of Vienna
.
Halevy A. Y., Ives Z. G., Mork P., and Tatarinov I. 2003.
Piazza: Data management infrastructure for semantic
web applications.
In Proceedings of the Twelfth
International World Wide Web Conference
(WWW’2003), Budapest, Hungary.
Kjetil, N., Christos, D., Michalis, V. 2006. The SOWES
Approach to P2P Web Search Using Semantic
Overlays.
WWW '06: Proceedings of the 15th
international conference on World Wide Web.
Montresor, A. 2001. Anthill: a Framework for the Design
and the Analysis of Peer-to-Peer Systems.
4th
European Research Seminar on Advances in
Distributed Systems
.
Nejdl, W. Wolf,B. Qu, C. Decker, S. Sintek, M. Naeve, A.
Nilsson, M. Palmer, M. Risch, T. 2002. EDUTELLA:
a P2P Networking Infrastructure based on RDF.
In
Proceedings of the 11th International World Wide
Web Conference, Hawaii, USA
.
Tang, C., Z. Xu, and M. Mahalingam. 2002. PeerSearch:
Efficient Information retrieval in Peer-Peer Networks.
Hewlett-Packard Labs: Palo Alto.
Risson, J, Moors, T. 2006. “Survey of research towards
robust peer-to-peer networks: search methods”,
Computer Networks: The International Journal of
Computer and Telecommunications Networking,
Volume 50 , Issue 17, pp 3485-3521
.
Rodrigues R. and Liskov B. 2005. High Availability in
DHT’s: Erasure Coding vs Replication. IPTPS. 226-
239.
Savage J. E. & Wloka M. G.. 1991. MOB a parallel
heuristic for graph embedding,
5th SIAM Conference
on Parallel Processing for Scientific Computing
.
Segall A. 1983. Distributed network protocols.
IEEE
Transaction on Information Theory IT-29(1):23.
Wolf-Tilo Balke, W Nejdl, W Siberski, and U Thaden.
2005. DL meets P2P - Distributed Document Retrieval
based on Classification and Content.
European
Conference on Digital Libraries (ECDL), Vienna,
Austria
.
DISTRIBUTED ALLOCATION OF A CORPORATE SEMANTIC WEB
179