Machine Learning-based Query Augmentation for SPARQL Endpoints
Mariano Rico, Rizkallah Touma, Anna Queralt, María S. Pérez
2018
Abstract
Linked Data repositories have become a popular source of publicly-available data. Users accessing this data through SPARQL endpoints usually launch several restrictive yet similar consecutive queries, either to find the information they need through trial-and-error or to query related resources. However, instead of executing each individual query separately, query augmentation aims at modifying the incoming queries to retrieve more data that is potentially relevant to subsequent requests. In this paper, we propose a novel approach to query augmentation for SPARQL endpoints based on machine learning. Our approach separates the structure of the query from its contents and measures two types of similarity, which are then used to predict the structure and contents of the augmented query. We test the approach on the real-world query logs of the Spanish and English DBpedia and show that our approach yields high-accuracy prediction. We also show that, by caching the results of the predicted augmented queries, we can retrieve data relevant to several subsequent queries at once, achieving a higher cache hit rate than previous approaches.
DownloadPaper Citation
in Harvard Style
Rico M., Touma R., Queralt A. and Pérez M. (2018). Machine Learning-based Query Augmentation for SPARQL Endpoints.In Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-324-7, pages 57-67. DOI: 10.5220/0006925300570067
in Bibtex Style
@conference{webist18,
author={Mariano Rico and Rizkallah Touma and Anna Queralt and María S. Pérez},
title={Machine Learning-based Query Augmentation for SPARQL Endpoints},
booktitle={Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2018},
pages={57-67},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006925300570067},
isbn={978-989-758-324-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Machine Learning-based Query Augmentation for SPARQL Endpoints
SN - 978-989-758-324-7
AU - Rico M.
AU - Touma R.
AU - Queralt A.
AU - Pérez M.
PY - 2018
SP - 57
EP - 67
DO - 10.5220/0006925300570067