FAIR Principles and Big Data: A Software Reference Architecture for Open Science
João P. C. Castro, João P. C. Castro, Lucas M. F. Romero, Anderson C. Carniel, Cristina D. Aguiar
2022
Abstract
Open Science pursues the assurance of free availability and usability of every digital outcome originated from scientific research, such as scientific publications, data, and methodologies. It motivated the emergence of the FAIR Principles, which introduce a set of requirements that contemporary data sharing repositories must adopt to provide findability, accessibility, interoperability, and reusability. However, implementing a FAIR-compliant repository has become a core problem due to two main factors. First, there is a significant complexity related to fulfilling the requirements since they demand the management of research data and metadata. Second, the repository must be designed to support the inherent big data complexity of volume, variety, and velocity. In this paper, we propose a novel FAIR-compliant software reference architecture to store, process, and query massive volumes of scientific data and metadata. We also introduce a generic metadata warehouse model to handle the repository metadata and support analytical query processing, providing different perspectives of data insights. We show the applicability of the architecture through a case study in the context of a real-world dataset of COVID-19 Brazilian patients, detailing different types of queries and highlighting their importance to big data analytics.
DownloadPaper Citation
in Harvard Style
Castro J., Romero L., Carniel A. and Aguiar C. (2022). FAIR Principles and Big Data: A Software Reference Architecture for Open Science. In Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-758-569-2, pages 27-38. DOI: 10.5220/0011045500003179
in Bibtex Style
@conference{iceis22,
author={João P. C. Castro and Lucas M. F. Romero and Anderson C. Carniel and Cristina D. Aguiar},
title={FAIR Principles and Big Data: A Software Reference Architecture for Open Science},
booktitle={Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2022},
pages={27-38},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011045500003179},
isbn={978-989-758-569-2},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - FAIR Principles and Big Data: A Software Reference Architecture for Open Science
SN - 978-989-758-569-2
AU - Castro J.
AU - Romero L.
AU - Carniel A.
AU - Aguiar C.
PY - 2022
SP - 27
EP - 38
DO - 10.5220/0011045500003179