Synthetic Data Generation and Federated Learning as Innovative Solutions for Data Privacy in Finance
Elif Özcan, Elif Özcan, Ruşen Akkuş Halepmollası, Ruşen Akkuş Halepmollası, Yusuf Yaslan
2025
Abstract
Financial services generate vast, complex and diverse datasets, yet data privacy issues pose significant challenges for secure usage and collaborative analysis. Synthetic data generation can offer an innovative solution while preserving privacy without exposing sensitive information. Also, federated learning enables collaborative model training across clients while maintaining data privacy. In this study, we used Default Credit Card dataset and employed diffusion based synthetic data generation to evaluate its impact on centralized and federated learning approaches. To this end, we offer comprehensive benchmarking of synthetic, real, and hybrid datasets by employing four machine learning classifiers both centrally and federated. Our findings demonstrate that synthetic data effectively improves results, especially when combined with real data. We also conduct client specific experiments in federated learning when addressing highly imbalanced or incomplete class distributions. Moreover, we evaluate FedF1 aggregation method, which aims to improve global model performance by optimizing F1-score. To the best of our knowledge, this is the first study to integrate synthetic data generation and federated learning on a financial dataset to provide valuable insights for secure and collaborative learning.
DownloadPaper Citation
in Harvard Style
Özcan E., Halepmollası R. and Yaslan Y. (2025). Synthetic Data Generation and Federated Learning as Innovative Solutions for Data Privacy in Finance. In Proceedings of the 7th International Conference on Finance, Economics, Management and IT Business - Volume 1: FEMIB; ISBN 978-989-758-748-1, SciTePress, pages 78-89. DOI: 10.5220/0013440900003956
in Bibtex Style
@conference{femib25,
author={Elif Özcan and Ruşen Halepmollası and Yusuf Yaslan},
title={Synthetic Data Generation and Federated Learning as Innovative Solutions for Data Privacy in Finance},
booktitle={Proceedings of the 7th International Conference on Finance, Economics, Management and IT Business - Volume 1: FEMIB},
year={2025},
pages={78-89},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013440900003956},
isbn={978-989-758-748-1},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 7th International Conference on Finance, Economics, Management and IT Business - Volume 1: FEMIB
TI - Synthetic Data Generation and Federated Learning as Innovative Solutions for Data Privacy in Finance
SN - 978-989-758-748-1
AU - Özcan E.
AU - Halepmollası R.
AU - Yaslan Y.
PY - 2025
SP - 78
EP - 89
DO - 10.5220/0013440900003956
PB - SciTePress