An Efficient Decentralized Multidimensional Data Index: A Proposal

Francesco Gargiulo, Antonio Picariello, Vincenzo Moscato

2018

Abstract

The main objective of this work is the proposal of a decentralized data structure storing a large amount of data under the assumption that it is not possible or convenient to use a single workstation to host all data. The index is distributed over a computer network and the performance of the search, insert, delete operations are close to the traditional indices that use a single workstation. It is based on k-d trees and it is distributed across a network of "peers", where each one hosts a part of the tree and uses message passing for communication between peers. In particular, we propose a novel version of the k-nearest neighbour algorithm that starts the query in a randomly chosen peer and terminates the query as soon as possible. Preliminary experiments have demonstrated that in about 65% of cases it starts a query in a random peer that does not involve the peer containing the root of the tree and in the 98% of cases it terminates the query in a peer that does not contain the root of the tree.

Download


Paper Citation


in Harvard Style

Gargiulo F., Picariello A. and Moscato V. (2018). An Efficient Decentralized Multidimensional Data Index: A Proposal.In Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-318-6, pages 231-238. DOI: 10.5220/0006851202310238


in Bibtex Style

@conference{data18,
author={Francesco Gargiulo and Antonio Picariello and Vincenzo Moscato},
title={An Efficient Decentralized Multidimensional Data Index: A Proposal},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2018},
pages={231-238},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006851202310238},
isbn={978-989-758-318-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - An Efficient Decentralized Multidimensional Data Index: A Proposal
SN - 978-989-758-318-6
AU - Gargiulo F.
AU - Picariello A.
AU - Moscato V.
PY - 2018
SP - 231
EP - 238
DO - 10.5220/0006851202310238