D-LeDe: A Data Leakage Detection Method for Automotive Perception Systems
Md Abu Ahammed Babu, Md Abu Ahammed Babu, Sushant Kumar Pandey, Darko Durisic, Ashok Chaitanya Koppisetty, Miroslaw Staron
2025
Abstract
Data leakage is a very common problem that is often overlooked during splitting data into train and test sets before training any ML/DL model. The model performance gets artificially inflated with the presence of data leakage during the evaluation phase which often leads the model to erroneous prediction on real-time deployment. However, detecting the presence of such leakage is challenging, particularly in the object detection context of perception systems where the model needs to be supplied with image data for training. In this study, we conduct a computational experiment to develop a method for detecting data leakage. We then conducted an initial evaluation of the method as a first step on a public dataset, “Kitti”, which is a popular and widely accepted benchmark dataset in the automotive domain. The evaluation results show that our proposed D-LeDe method are able to successfully detect potential data leakage caused by image similarity. A further validation was also provided to justify the evaluation outcome by conducting pair-wise image similarity analysis using perceptual hash (pHash) distance.
DownloadPaper Citation
in Harvard Style
Babu M., Pandey S., Durisic D., Koppisetty A. and Staron M. (2025). D-LeDe: A Data Leakage Detection Method for Automotive Perception Systems. In Proceedings of the 11th International Conference on Vehicle Technology and Intelligent Transport Systems - Volume 1: VEHITS; ISBN 978-989-758-745-0, SciTePress, pages 210-221. DOI: 10.5220/0013476700003941
in Bibtex Style
@conference{vehits25,
author={Md Babu and Sushant Pandey and Darko Durisic and Ashok Koppisetty and Miroslaw Staron},
title={D-LeDe: A Data Leakage Detection Method for Automotive Perception Systems},
booktitle={Proceedings of the 11th International Conference on Vehicle Technology and Intelligent Transport Systems - Volume 1: VEHITS},
year={2025},
pages={210-221},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013476700003941},
isbn={978-989-758-745-0},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 11th International Conference on Vehicle Technology and Intelligent Transport Systems - Volume 1: VEHITS
TI - D-LeDe: A Data Leakage Detection Method for Automotive Perception Systems
SN - 978-989-758-745-0
AU - Babu M.
AU - Pandey S.
AU - Durisic D.
AU - Koppisetty A.
AU - Staron M.
PY - 2025
SP - 210
EP - 221
DO - 10.5220/0013476700003941
PB - SciTePress