Comparing Data Store Performance for Full-Text Search: To SQL or to NoSQL?

George Fotopoulos, Paris Koloveas, Paraskevi Raftopoulou, Christos Tryfonopoulos

2023

Abstract

The amount of textual data produced nowadays is constantly increasing as the number and variety of both new and reproduced textual information created by humans and (lately) also by bots is unprecedented. Storing, handling and querying such high volumes of textual data have become more challenging than ever and both research and industry have been using various alternatives, ranging from typical Relational Database Management Systems to specialised text engines and NoSQL databases, in an effort to cope with the volume. However, all these decisions are, largely, based on experience or personal preference for one system over another, since there is no performance comparison study that compares the available solutions regarding full-text search and retrieval. In this work, we fill this gap in the literature by systematically comparing four popular databases in full-text search scenarios and reporting their performance across different datasets, full-text search operators and parameters. To the best of our knowledge, our study is the first to go beyond the comparison of characteristics, like expressiveness of the query language or popularity, and actually compare popular relational, NoSQL, and textual data stores in terms of retrieval efficiency for full-text search. Moreover, our findings quantify the differences in full-text search performance between the examined solutions and reveal both anticipated and less anticipated results.

Download


Paper Citation


in Harvard Style

Fotopoulos G., Koloveas P., Raftopoulou P. and Tryfonopoulos C. (2023). Comparing Data Store Performance for Full-Text Search: To SQL or to NoSQL?. In Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-664-4, SciTePress, pages 406-413. DOI: 10.5220/0012089200003541


in Bibtex Style

@conference{data23,
author={George Fotopoulos and Paris Koloveas and Paraskevi Raftopoulou and Christos Tryfonopoulos},
title={Comparing Data Store Performance for Full-Text Search: To SQL or to NoSQL?},
booktitle={Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2023},
pages={406-413},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012089200003541},
isbn={978-989-758-664-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Comparing Data Store Performance for Full-Text Search: To SQL or to NoSQL?
SN - 978-989-758-664-4
AU - Fotopoulos G.
AU - Koloveas P.
AU - Raftopoulou P.
AU - Tryfonopoulos C.
PY - 2023
SP - 406
EP - 413
DO - 10.5220/0012089200003541
PB - SciTePress