Analysis of Incremental Learning and Windowing to Handle Combined Dataset Shifts on Binary Classification for Product Failure Prediction
Marco Spieß, Peter Reimann, Peter Reimann, Christian Weber, Bernhard Mitschang
2022
Abstract
Dataset Shifts (DSS) are known to cause poor predictive performance in supervised machine learning tasks. We present a challenging binary classification task for a real-world use case of product failure prediction. The target is to predict whether a product, e. g., a truck may fail during the warranty period. However, building a satisfactory classifier is difficult, because the characteristics of underlying training data entail two kinds of DSS. First, the distribution of product configurations may change over time, leading to a covariate shift. Second, products gradually fail at different points in time, so that the labels in training data may change, which may a concept shift. Further, both DSS show a trade-off relationship, i. e., addressing one of them may imply negative impacts on the other one. We discuss the results of an experimental study to investigate how different approaches to addressing DSS perform when they are faced with both a covariate and a concept shift. Thereby, we prove that existing approaches, e. g., incremental learning and windowing, especially suffer from the trade-off between both DSS. Nevertheless, we come up with a solution for a data-driven classifier, that yields better results than a baseline solution that does not address DSS.
DownloadPaper Citation
in Harvard Style
Spieß M., Reimann P., Weber C. and Mitschang B. (2022). Analysis of Incremental Learning and Windowing to Handle Combined Dataset Shifts on Binary Classification for Product Failure Prediction. In Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-758-569-2, pages 394-405. DOI: 10.5220/0011093300003179
in Bibtex Style
@conference{iceis22,
author={Marco Spieß and Peter Reimann and Christian Weber and Bernhard Mitschang},
title={Analysis of Incremental Learning and Windowing to Handle Combined Dataset Shifts on Binary Classification for Product Failure Prediction},
booktitle={Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2022},
pages={394-405},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011093300003179},
isbn={978-989-758-569-2},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - Analysis of Incremental Learning and Windowing to Handle Combined Dataset Shifts on Binary Classification for Product Failure Prediction
SN - 978-989-758-569-2
AU - Spieß M.
AU - Reimann P.
AU - Weber C.
AU - Mitschang B.
PY - 2022
SP - 394
EP - 405
DO - 10.5220/0011093300003179