A Distance-Based Feature Selection Approach for Software Anomaly Detection

Suravi Akhter, Afia Sajeeda, Ahmedul Kabir

2023

Abstract

An anomaly of software refers to a bug or defect or anything that causes the software to deviate from its normal behavior. Anomalies should be identified properly to make more stable and error-free software systems. There are various machine learning-based approaches for anomaly detection. For proper anomaly detection, feature selection is a necessary step that helps to remove noisy and irrelevant features and thus reduces the dimensionality of the given feature vector. Most of the existing feature selection methods rank the given features using different selection criteria, such as mutual information (MI) and distance. Furthermore, these, especially MI-based methods fail to capture feature interaction during the ranking/selection process in case of larger feature dimensions which degrades the discrimination ability of the selected feature set. Moreover, it becomes problematic to make a decision about the appropriate number of features from the ranked feature set to get acceptable performance. To solve these problems, in this paper we propose anomaly detection for software data (ADSD), which is a feature subset selection method and is able to capture interactive and relevant feature subsets. Experimental results on 15 benchmark software defect datasets and two bug severity classification datasets demonstrate the performance of ADSD in comparison to four state-of-the-art methods.

Download


Paper Citation


in Harvard Style

Akhter S., Sajeeda A. and Kabir A. (2023). A Distance-Based Feature Selection Approach for Software Anomaly Detection. In Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE, ISBN 978-989-758-647-7, SciTePress, pages 149-157. DOI: 10.5220/0011859500003464


in Bibtex Style

@conference{enase23,
author={Suravi Akhter and Afia Sajeeda and Ahmedul Kabir},
title={A Distance-Based Feature Selection Approach for Software Anomaly Detection},
booktitle={Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,},
year={2023},
pages={149-157},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011859500003464},
isbn={978-989-758-647-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,
TI - A Distance-Based Feature Selection Approach for Software Anomaly Detection
SN - 978-989-758-647-7
AU - Akhter S.
AU - Sajeeda A.
AU - Kabir A.
PY - 2023
SP - 149
EP - 157
DO - 10.5220/0011859500003464
PB - SciTePress