The Explainability-Privacy-Utility Trade-Off for Machine Learning-Based Tabular Data Analysis

Wisam Abbasi, Paolo Mori, Andrea Saracino

2023

Abstract

In this paper, we present a novel privacy-preserving data analysis model, based on machine learning, applied to tabular datasets, which defines a general trade-off optimization criterion among the measures of data privacy, model explainability, and data utility, aiming at finding the optimal compromise among them. Our approach regulates the privacy parameter of the privacy-preserving mechanism used for the applied analysis algorithms and explainability techniques. Then, our method explores all possible configurations for the provided privacy parameter and manages to find the optimal configuration with the maximum achievable privacy gain and explainability similarity while minimizing harm to data utility. To validate our methodology, we conducted experiments using multiple classifiers for a binary classification problem on the Adult dataset, a well-known tabular dataset with sensitive attributes. We used (ε,δ)-differential privacy as a privacy mechanism and multiple model explanation methods. The results demonstrate the effectiveness of our approach in selecting an optimal configuration, that achieves the dual objective of safeguarding data privacy and providing model explanations of comparable quality to those generated from real data. Furthermore, the proposed method was able to preserve the quality of analyzed data, leading to accurate predictions.

Download


Paper Citation


in Harvard Style

Abbasi W., Mori P. and Saracino A. (2023). The Explainability-Privacy-Utility Trade-Off for Machine Learning-Based Tabular Data Analysis. In Proceedings of the 20th International Conference on Security and Cryptography - Volume 1: SECRYPT; ISBN 978-989-758-666-8, SciTePress, pages 511-519. DOI: 10.5220/0012137800003555


in Bibtex Style

@conference{secrypt23,
author={Wisam Abbasi and Paolo Mori and Andrea Saracino},
title={The Explainability-Privacy-Utility Trade-Off for Machine Learning-Based Tabular Data Analysis},
booktitle={Proceedings of the 20th International Conference on Security and Cryptography - Volume 1: SECRYPT},
year={2023},
pages={511-519},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012137800003555},
isbn={978-989-758-666-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 20th International Conference on Security and Cryptography - Volume 1: SECRYPT
TI - The Explainability-Privacy-Utility Trade-Off for Machine Learning-Based Tabular Data Analysis
SN - 978-989-758-666-8
AU - Abbasi W.
AU - Mori P.
AU - Saracino A.
PY - 2023
SP - 511
EP - 519
DO - 10.5220/0012137800003555
PB - SciTePress