COMBINING GENE EXPRESSION AND CLINICAL DATA TO INCREASE PERFORMANCE OF PROGNOSTIC BREAST CANCER MODELS
Jana Šilhavá, Pavel Smrž
2012
Abstract
Microarray class prediction is an important application of gene expression data in biomedical research. Combining gene expression data with other relevant data may add valuable information and can generate more accurate prognostic predictions. In this paper, we combine gene expression data with clinical data. We use logistic regression models that can be built through various regularized techniques. Generalized linear models enables combining of these models with different structure of data. Our two suggested approaches are evaluated with publicly available breast cancer data sets. Based on the results, our approaches have a positive effect on prediction performances and are not computationally intensive.
References
- Akaike, H. (1974). A New Look at the Statistical Model Identification. IEEE Trans. Automat. Contr., 19 (6), 716-723.
- Amaratunga, D. and Cabrera, J. (2004). Exploration and Analysis of DNA Microarray and Protein Array Data. John Wiley & Sons, Hoboken.
- Azuaje, F. (2010). Bioinformatics and Biomarker Discovery: “Omic” Data Analysis for Personalized Medicine. John Wiley & Sons, Singapore.
- Bühlmann, P. and Hothorn, T. (2007). Boosting Algorithms: Regularization, Prediction and Model Fitting. Statist. Sci., 22, 477-505.
- Friedman, J., Hastie, T., Höfling, H., and Tibshirani, R. (2007). Pathways Coordinate Optimization. Ann. Appl. Stat., 1, 302-332.
- Friedman, J. H. (2001). Greedy Function Approximation: A Gradient Boosting Machine. Ann. Statist., 29, 1189- 1232.
- Friedman, J. H., Hastie, T., and Tibshirani, R. (2010). Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, 33 (1), 1-24.
- Gevaert, O., Smet, F. D., Timmerman, D., Moreau, Y., and Moor, B. D. (2007). Predicting the Prognosis of Breast Cancer by Integrating Clinical and Microarray Data with Bayesian Networks. Bioinformatics, 22 (14), 147-157.
- Gruvberger, S. K., Ringner, M., and Eden, P. (2003). Expression Profiling to Predict Outcome in Breast Cancer: the Influence of Sample Selection. Breast Cancer Res., 5(1), 23-26.
- Li, L. (2006). Survival Prediction of Diffuse Large-B-Cell Lymphoma Based on both Clinical and Gene Expression Information. Bioinformatics, 22(04), 466-471.
- McCullagh, P. and Nelder, J. A. (1989). Generalized Linear Models. Chapman and Hall.
- Pittman, J., Huang, E., and Dressman, H. (2004). Integrated Modeling of Clinical and Gene Expression Information for Personalized Prediction of Disease Outcomes. Proc.Natl.Acad.Sci., 101(22), 8431-8436.
- S?ilhavá, J. and Smrz?, P. (2010). Improved Disease Outcome Prediction Based on Microarray and Clinical Data Combination and Pre-validation. Biomedical Engineering Systems and Technologies, 36-41.
- van't Veer, L. J., Dai, H., and van de Vijver, M. J. (2002). Gene Epression Profiling Predicts Clinical Outcome of Breast Cancer. Nature, 530-536.
- Zou, H. and Hastie, T. (2005). Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society, Series B, 67, 301-320.
Paper Citation
in Harvard Style
Šilhavá J. and Smrž P. (2012). COMBINING GENE EXPRESSION AND CLINICAL DATA TO INCREASE PERFORMANCE OF PROGNOSTIC BREAST CANCER MODELS . In Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012) ISBN 978-989-8425-95-9, pages 589-594. DOI: 10.5220/0003881505890594
in Bibtex Style
@conference{ssml12,
author={Jana Šilhavá and Pavel Smrž},
title={COMBINING GENE EXPRESSION AND CLINICAL DATA TO INCREASE PERFORMANCE OF PROGNOSTIC BREAST CANCER MODELS},
booktitle={Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012)},
year={2012},
pages={589-594},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003881505890594},
isbn={978-989-8425-95-9},
}
in EndNote Style
TY  - CONF 
JO  - Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: SSML, (ICAART 2012)
TI  - COMBINING GENE EXPRESSION AND CLINICAL DATA TO INCREASE PERFORMANCE OF PROGNOSTIC BREAST CANCER MODELS
SN  - 978-989-8425-95-9
AU  - Šilhavá J. 
AU  - Smrž P. 
PY  - 2012
SP  - 589
EP  - 594
DO  - 10.5220/0003881505890594