NIH prognostics were taken from the clinical dataset
as given by (Chang
et al., 2005).
Table 2: Comparative results between hybrid markers and
pure clinical indices (NIH, St Gallen).
TP FP FN TN Sens Spec Acc
Hybrid 13 12 28 81 0.32 0.87
94/134
(70.15%)
NIH 41 91 0 2 1 0.02
41/134
(32.09%)
St
Gallen
38 85 3 8 0.93 0.09
46/134
(34.33%)
Both indices have a very high sensitivity, but an
intolerable low specificity which would lead to give
unnecessary adjuvant systematic treatment to many
patients. Thus the obtained hybrid markers
outperforms also the pure clinically indices.
4 CONCLUSIONS
In this paper a new approach to perform cancer
prognosis is proposed based on a hybrid marker
selection. We evaluated our approach on a public
available breast cancer prognosis dataset. Patients
included in this dataset are classified into two groups
according to whether a distant subclinical metastasis
was occurred or not. This dataset represents two
challenges: high-dimensionality (microarray data)
and mixed-type data (clinical data). To cope
appropriately with this, a marker selection was
performed based on a fuzzy feature selection
approach which handles both challenges. It has been
shown that the obtained hybrid markers, composed
of clinical markers and genes, can improve the
prediction accuracy and outperform both genetic
based approaches (i.e. the well-known Amsterdam
70-genes signature) and pure clinical indices (St
Gallen and NIH). Moreover, the proposed approach
reduces significantly the number of markers needed
to perform a cancer prognosis task.
Future work will be devoted to test this algorithm on
other public available datasets and integrate other
sources of information than clinical and microarray
data.
REFERENCES
Aguado J. C., and Aguilar-Martin J., 1999. A mixed
qualitative-quantitative self-learning classification
technique applied to diagnosis, QR'99 The Thirteenth
International Workshop on Qualitative Reasoning.
Chris Price, 124-128.
Chang H. Y., Nuyten D. S. A., Sneddon J. B., Hastie T.,
Tibshirani R., et al., 2005. Robustness, scalability, and
integration of a wound-response gene expression
signature in predicting breast cancer survival. PNAS
2005, 102 (10): 3738-3743
Deepa, S., Claudine I., 2005. Utilizing Prognostic and
Predictive Factors in Breast Cancer. Current
Treatment Options in Oncology 2005, 6:147-159.
Current Science Inc
Gevaert O., De Smet F., Timmerman D., Moreau Y., De
Moor B., 2006. Predicting the prognosis of breast
cancer by integrating clinical and microarray data with
Bayesian network, Bioinformatics 22 (14), 184-190.
Golub T. , Slonim D., Tamayo P., Huard C., Gaasenbeek
M., et al., 1999. Molecular Classification of Cancer
Class Discovery and Class Prediction by Gene
Expression Monitoring, Science 286 (5439), 531-537.
Guyon I., Elisseeff A., 2003. An introduction to variable
and feature selection, J. Mach. Learn. Res, 3, 1157-
1182.
Hedjazi L., Aguilar-Martin J., Le Lann M. V., and
Kempowsky T., 2010a, Membership-Margin based
Feature Selection for Mixed-Type and High-
Dimensional Data, Fuzzy Sets and Systems Journal.,
submitted for publication.
Hedjazi, L., Kempowsky T., Le Lann M. V., Aguilar-
Martin J., 2010b. Prognosis of breast cancer based on
a fuzzy classification method. 3rd International Joint
Conference on Biomedical Engineering Systems and
Technologies (BIOSTEC 2010); 1st International
Conference on Bioinformatics (BIOINFORMATICS
2010). Valence (Spain), 20-23 January 2010, pp.123-
130.
Kohavi R., and John G. H., 1997. Wrapper for feature
subset selection, Artificial Intelligence 97, 273-324.
Ramaswamy S., Tamayo P., Rifkin R., Mukherjee S.,
Yeang C., et al., 2001. MultiClass Cancer Diagnosis
Using Tumor Gene Expression Signatures, Proc. Nat’l
Acad. Sc. USA 98 ( 26) , 15149-15154.
Straver M. E., Glas A. M., Hannemann J., Wesseling J.,
van de Vijver M. J., et al.,2009. The 70-gene signature
as a response predictor for neoadjuvant chemotherapy
in breast cancer, Breast Cancer Res. Treat.,
doi:10.1007/s10549-009-0333-
Sun Y., 2007. Iterative RELIEF for Feature Weighting:
Algorithms, Theories, and Applications, IEEE TPAMI,
2 (6) , 1035-1051.
Sun Y., Goodison S., Li J., Liu L., Farmerie W., 2007.
Improved breast cancer prognosis through the
combination of clinical and genetic markers.
Bioinformatics, Gene expression. Oxford University
Press 23 (1), 30-37.
Van’t Veer L.J., Dai H., van de Vijver M. J., et al., 2002.
Gene expression profiling predicts clinical outcome of
breast cancer, Nature, 415, pp. 530-536.
Van de Vijver M. J., He Y. D., Van’t Veer L. J., Dai H.,
Hart A., et al., 2002. A Gene expression signature as a
predictor of survival in breast cancer, N Engl J Med,
347 (25), pp. 1999-2009.
BIOINFORMATICS 2011 - International Conference on Bioinformatics Models, Methods and Algorithms
164