Trust the Data You Use: Scalability Assurance Forms (SAF) for a Holistic Quality Assessment of Data Assets in Data Ecosystems

Maximilian Stäbler, Tobias Müller, Frank Köster, Chris Langdon

2024

Abstract

Companies generate terabytes of raw, unstructured data daily, which requires processing and organization to become valuable data assets. In the era of data-driven decision-making, evaluating these data assets’ quality is crucial for various data services, users, and ecosystems. This paper introduces ”Scalability Assurance Forms” (SAF), a novel framework to assess the quality of data assets, including raw data and semantic descriptions, with essential contextual information for cross-domain AI systems. The methodology includes a comprehensive literature review on quality models for linked data and knowledge graphs, and previous research findings on data quality. The SAF framework standardizes data asset quality assessments through 31 dimensions and 10 overarching groups derived from the literature. These dimensions enable a holistic assessment of data set quality by grouping them according to individual user requirements. The modular approach of the SAF framework ensures the maintenance of data asset quality across interconnected data sources, supporting reliable data-driven services and robust AI application development.The SAF framework addresses the need for trust in systems where participants may not know or historically trust each other, promoting the quality and reliability of data assets in diverse ecosystems.

Download


Paper Citation


in Harvard Style

Stäbler M., Müller T., Köster F. and Langdon C. (2024). Trust the Data You Use: Scalability Assurance Forms (SAF) for a Holistic Quality Assessment of Data Assets in Data Ecosystems. In Proceedings of the 20th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST; ISBN 978-989-758-718-4, SciTePress, pages 199-208. DOI: 10.5220/0012915900003825


in Bibtex Style

@conference{webist24,
author={Maximilian Stäbler and Tobias Müller and Frank Köster and Chris Langdon},
title={Trust the Data You Use: Scalability Assurance Forms (SAF) for a Holistic Quality Assessment of Data Assets in Data Ecosystems},
booktitle={Proceedings of the 20th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST},
year={2024},
pages={199-208},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012915900003825},
isbn={978-989-758-718-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 20th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST
TI - Trust the Data You Use: Scalability Assurance Forms (SAF) for a Holistic Quality Assessment of Data Assets in Data Ecosystems
SN - 978-989-758-718-4
AU - Stäbler M.
AU - Müller T.
AU - Köster F.
AU - Langdon C.
PY - 2024
SP - 199
EP - 208
DO - 10.5220/0012915900003825
PB - SciTePress