Applying PySCMGroup to Breast Cancer Biomarkers Discovery

Mazid Abiodoun Osseni, Prudencio Tossou, Prudencio Tossou, Jacques Corbeil, François Laviolette, François Laviolette

2021

Abstract

Background. The identification of biomarkers associated with triple-negative breast cancer (TNBC) is still an active area of research due to the complexity of finding robust biomarkers associated with the disease. Previous methods have attempted to tackle the problem from a mono-perspective view by analyzing each omics individually in the search of biomarkers. The majority of these methods mainly focus on gene expression analysis since their impact on the phenotype is easier to measure and possibly more direct. However, it is common understanding that genes belong to pathways and tend to work together within various metabolic, regulatory, and signalling pathways. Hence, in this work, we tackled the TNBC biomarker discovery problem as a multi-omic pathway-based problem by efficiently combining the biological knowledge from multiple pathways using a novel machine learning algorithm. The proposed algorithm, called GroupSCM, is an extension of the Set Covering Machine (SCM) that incorporate the pathway features as priors. Results. Although the GroupSCM performed similarly to the SCM, metric-wise, it helps identify new biomarkers not previously found by the SCM. By leveraging the pathway priors, the GroupSCM was able to uncover two miRNAs: hsa-mir-18a and hsa-mir-190b, already known to be associated with various cancers including breast cancer and yet to be linked to the Triple-Negative Breast Cancer phenotype. Conclusion. The addition of priors to the SCM leads to interpretable, complete and sparser models which are easier to analyze in vivo settings. It also provides insight into the omics interaction by highlighting the miRNAs and epigenome contribution to the prediction task. Code Availability: The code is available at: https://github.com/dizam92/BRCA experiments and paper

Download


Paper Citation


in Harvard Style

Osseni M., Tossou P., Corbeil J. and Laviolette F. (2021). Applying PySCMGroup to Breast Cancer Biomarkers Discovery. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-490-9, SciTePress, pages 72-82. DOI: 10.5220/0010375500002865


in Bibtex Style

@conference{bioinformatics21,
author={Mazid Abiodoun Osseni and Prudencio Tossou and Jacques Corbeil and François Laviolette},
title={Applying PySCMGroup to Breast Cancer Biomarkers Discovery},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 3: BIOINFORMATICS},
year={2021},
pages={72-82},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010375500002865},
isbn={978-989-758-490-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 3: BIOINFORMATICS
TI - Applying PySCMGroup to Breast Cancer Biomarkers Discovery
SN - 978-989-758-490-9
AU - Osseni M.
AU - Tossou P.
AU - Corbeil J.
AU - Laviolette F.
PY - 2021
SP - 72
EP - 82
DO - 10.5220/0010375500002865
PB - SciTePress