Using Well-Known Techniques to Visualize Characteristics of Data Quality
Roy Ruddle
2023
Abstract
Previous work has identified more than 100 distinct characteristics of data quality, most of which are aspects of completeness, accuracy and consistency. Other work has developed new techniques for visualizing data quality, but there is a lack of research into how users visualize data quality issues with existing, well-known techniques. We investigated how 166 participants identified and illustrated data quality issues that occurred in a 54-file, longitudinal collection of open data. The issues that participants identified spanned 27 different characteristics, nine of which do not appear in existing data quality taxonomies. Participants adopted nine visualization and tabular methods to illustrate the issues, using the methods in five ways (quantify; alert; examples; serendipitous discovery; explain). The variety of serendipitous discoveries was noteworthy, as was how rarely participants used visualization to illustrate completeness and consistency, compared with accuracy. We conclude by presenting a 106-item data quality taxonomy that combines seven previous works with our findings.
DownloadPaper Citation
in Harvard Style
Ruddle R. (2023). Using Well-Known Techniques to Visualize Characteristics of Data Quality. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP; ISBN 978-989-758-634-7, SciTePress, pages 89-100. DOI: 10.5220/0011664300003417
in Bibtex Style
@conference{ivapp23,
author={Roy Ruddle},
title={Using Well-Known Techniques to Visualize Characteristics of Data Quality},
booktitle={Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP},
year={2023},
pages={89-100},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011664300003417},
isbn={978-989-758-634-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 3: IVAPP
TI - Using Well-Known Techniques to Visualize Characteristics of Data Quality
SN - 978-989-758-634-7
AU - Ruddle R.
PY - 2023
SP - 89
EP - 100
DO - 10.5220/0011664300003417
PB - SciTePress