Method Choice in Gene Set Analysis Has Important Consequences for Analysis Outcome

Farhad Maleki, Katie L. Ovens, Elham Rezaei, Alan M. Rosenberg, Anthony J. Kusalik

2019

Abstract

Gene set enrichment analysis is a well-established approach for gaining biological insight from expression data. With many gene set analysis methods available, a question is raised about the consistency of the results of these methods. In this paper, we answer this question with a systematic analysis of ten commonly used gene set analysis methods when applied to microarray data. The statistical analysis suggests that there is a significant difference between the results of these methods. Comparison of the 20 most statistically significant gene sets reported by these methods showed little to no agreement regardless of the dataset being used. This observation suggests that the outcome of a study can be highly dependent on the choice of the gene set analysis method. Comparing the 100 most statistically significant gene sets also led to the same conclusion. Furthermore, biological evaluation using a juvenile idiopathic arthritis dataset agreed with the results of the statistical analysis. The 20 most statistically significant gene sets for some methods showed relevance to the biology of juvenile arthritis, supporting their utility, while most methods led to results that were irrelevant or marginally relevant to the known biology of the disease.

Download


Paper Citation


in Harvard Style

Maleki F., Ovens K., Rezaei E., Rosenberg A. and Kusalik A. (2019). Method Choice in Gene Set Analysis Has Important Consequences for Analysis Outcome. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-353-7, SciTePress, pages 43-54. DOI: 10.5220/0007375000430054


in Bibtex Style

@conference{bioinformatics19,
author={Farhad Maleki and Katie L. Ovens and Elham Rezaei and Alan M. Rosenberg and Anthony J. Kusalik},
title={Method Choice in Gene Set Analysis Has Important Consequences for Analysis Outcome},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS},
year={2019},
pages={43-54},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007375000430054},
isbn={978-989-758-353-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS
TI - Method Choice in Gene Set Analysis Has Important Consequences for Analysis Outcome
SN - 978-989-758-353-7
AU - Maleki F.
AU - Ovens K.
AU - Rezaei E.
AU - Rosenberg A.
AU - Kusalik A.
PY - 2019
SP - 43
EP - 54
DO - 10.5220/0007375000430054
PB - SciTePress