Union k-Fold Feature Selection on Microarray Data
Artur Ferreira, Artur Ferreira, Mário Figueiredo, Mário Figueiredo
2023
Abstract
Cancer detection from microarray data is an important problem to be handled by machine learning techniques. This type of data poses many challenges to machine learning techniques, namely because it usually has large number of features (genes) and small number of instances (patients). Moreover, it is important to characterize which genes are the most important for a given classification task, providing explainability on the classification. In this paper, we propose a feature selection approach for microarray data, which is an extension of the recently proposed k-fold feature selection algorithm. We propose performing the union of the feature subspaces found independently by two feature selection filters, which have been proven to be adequate for this type of data, individually. The experimental results show that the union of the subsets of features found by each filter, in some cases, produces better results than the use of each individual filter, yielding human manageable subsets of features.
DownloadPaper Citation
in Harvard Style
Ferreira A. and Figueiredo M. (2023). Union k-Fold Feature Selection on Microarray Data. In Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-664-4, SciTePress, pages 540-547. DOI: 10.5220/0012135800003541
in Bibtex Style
@conference{data23,
author={Artur Ferreira and Mário Figueiredo},
title={Union k-Fold Feature Selection on Microarray Data},
booktitle={Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2023},
pages={540-547},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012135800003541},
isbn={978-989-758-664-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Union k-Fold Feature Selection on Microarray Data
SN - 978-989-758-664-4
AU - Ferreira A.
AU - Figueiredo M.
PY - 2023
SP - 540
EP - 547
DO - 10.5220/0012135800003541
PB - SciTePress