Effect of Errors on the Evaluation of Machine Learning Systems
Vanessa Bracamonte, Seira Hidano, Shinsaku Kiyomoto
2022
Abstract
Information such as accuracy and outcome explanations can be useful for the evaluation of machine learning systems, but they can also lead to over-trust. This means that an evaluator may not have suspicion that a machine learning system could have errors, and that they may overlook problems in the explanation of those systems. Research has shown that errors not only decrease trust but can also promote curiosity about the performance of the system. Therefore, presenting errors to evaluators may be an option to induce suspicion in the context of the evaluation of a machine learning system. In this paper, we evaluate this possibility by conducting three experiments where we asked participants to evaluate text classification systems. We presented two types of errors: incorrect predictions and errors in the explanation. The results show that patterns of errors in explanation negatively influenced willingness to recommend a system, and that fewer participants chose a system with higher accuracy when there was an error pattern, compared to when the errors were random. Moreover, more participants gave evidence from the explanations in their reason for their evaluation of the systems, suggesting that they were able to detect error patterns.
DownloadPaper Citation
in Harvard Style
Bracamonte V., Hidano S. and Kiyomoto S. (2022). Effect of Errors on the Evaluation of Machine Learning Systems. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 2: HUCAPP; ISBN 978-989-758-555-5, SciTePress, pages 48-57. DOI: 10.5220/0010839300003124
in Bibtex Style
@conference{hucapp22,
author={Vanessa Bracamonte and Seira Hidano and Shinsaku Kiyomoto},
title={Effect of Errors on the Evaluation of Machine Learning Systems},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 2: HUCAPP},
year={2022},
pages={48-57},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010839300003124},
isbn={978-989-758-555-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 2: HUCAPP
TI - Effect of Errors on the Evaluation of Machine Learning Systems
SN - 978-989-758-555-5
AU - Bracamonte V.
AU - Hidano S.
AU - Kiyomoto S.
PY - 2022
SP - 48
EP - 57
DO - 10.5220/0010839300003124
PB - SciTePress