Using Well-Known Techniques to Visualize Characteristics of Data Quality

Roy Ruddle

2023

Abstract

Previous work has identified more than 100 distinct characteristics of data quality, most of which are aspects of completeness, accuracy and consistency. Other work has developed new techniques for visualizing data quality, but there is a lack of research into how users visualize data quality issues with existing, well-known techniques. We investigated how 166 participants identified and illustrated data quality issues that occurred in a 54-file, longitudinal collection of open data. The issues that participants identified spanned 27 different characteristics, nine of which do not appear in existing data quality taxonomies. Participants adopted nine visualization and tabular methods to illustrate the issues, using the methods in five ways (quantify; alert; examples; serendipitous discovery; explain). The variety of serendipitous discoveries was noteworthy, as was how rarely participants used visualization to illustrate completeness and consistency, compared with accuracy. We conclude by presenting a 106-item data quality taxonomy that combines seven previous works with our findings.

Download


Paper Citation


in Harvard Style

Ruddle R. (2023). Using Well-Known Techniques to Visualize Characteristics of Data Quality. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP; ISBN 978-989-758-634-7, SciTePress, pages 89-100. DOI: 10.5220/0011664300003417


in Bibtex Style

@conference{ivapp23,
author={Roy Ruddle},
title={Using Well-Known Techniques to Visualize Characteristics of Data Quality},
booktitle={Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP},
year={2023},
pages={89-100},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011664300003417},
isbn={978-989-758-634-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP
TI - Using Well-Known Techniques to Visualize Characteristics of Data Quality
SN - 978-989-758-634-7
AU - Ruddle R.
PY - 2023
SP - 89
EP - 100
DO - 10.5220/0011664300003417
PB - SciTePress