Machines and Textural Analysis. In fact, this paper
demonstrates the existence of enough textural
information to discriminate proteins from noise and
background, as well as to show the potential of
SVMs in proteomic classification problems.
A new dataset with six features, starting from the
296 original ones, is created without loss of
accuracy, and the most representative textural group
is the Co-ocurrence matrix Group (second-order
statistics). In our experiments, the GLCM has
appeared as the best approximation for a good
classification of proteins in two-dimensional gel
electrophoresis. According to SVM, the 1%
histogram percentile, difference entropy, correlation,
inverse difference moment, difference variance and
sum entropy, are the most representative features for
solving this problem. A proper statistical test has
determined that there is a significant improvement in
using this reduced feature set with respect to the full
feature set.
ACKNOWLEDGEMENTS
This work is supported by the General Directorate of
Culture, Education and University Management of
the Xunta de Galicia (Ref. 10SIN105004PR). Pablo
Mesejo and Youssef S.G. Nashed are funded by the
European Comission (MIBISOC Marie Curie Initial
Training Network, FP7 PEOPLE-ITN-2008, GA n.
238819).
REFERENCES
Bartlett, M. S. (1937). "Properties of Sufficiency and
Statistical Tests." Proceedings of the Royal Society of
London. Series A, Mathematical and Physical
Sciences 160(901): 268-282.
Bonilha, L., E. Kobayashi, et al. (2003). "Texture Analysis
of Hippocampal Sclerosis." Epilepsia 44(12): 1546-
1550.
Buciu, I., C. Kotropoulos, et al. (2006). "Demonstrating
the stability of support vector machines for
classification." Signal Processing 86(9): 2364-2380.
Burges, C. J. C. (1998). "A tutorial on support vector
machines for pattern recognition." Data Mining and
Knowledge Discovery 2(2): 121-167.
Chang, C. C. and C. J. Lin (2011). "LIBSVM: A Library
for support vector machines." ACM Transactions on
Intelligent Systems and Technology 2(3).
Chapelle, O., P. Haffner, et al. (1999). "Support vector
machines for histogram-based image classification."
IEEE Transactions on Neural Networks 10(5): 1055-
1064.
Ferri, C., J. Hernádez-Orallo, et al. (2009). "An
experimental comparison of performance measures for
classification." Pattern Recognition Letters 30(1): 27-
38.
García, S., A. Fernández, et al. (2009). "A study of
statistical techniques and performance measures for
genetics-based machine learning: Accuracy and
interpretability." Soft Computing 13(10): 959-977.
Goldberg, D. (1989). Genetic Algorithms in Search,
Optimization, and Machine Learning, Addison-
Wesley Professional.
Hall, M., E. Frank, et al. (2009). "The WEKA data mining
software: an update." SIGKDD Explor. Newsl. 11(1):
10-18.
Haralick, R. M., K. Shanmugam, et al. (1973). "Textural
features for image classification." IEEE Transactions
on Systems, Man and Cybernetics smc 3(6): 610-621.
Harrison, L., P. Dastidar, et al. (2008). "Texture analysis
on MRI images of non-Hodgkin lymphoma."
Computers in Biology and Medicine 38(4): 519-524.
Holland, J. H. (1975). Adaptation in natural and artificial
systems: an introductory analysis with applications to
biology, control, and artificial intelligence, University
of Michigan Press.
Huang, C. L. and C. J. Wang (2006). "A GA-based feature
selection and parameters optimizationfor support
vector machines." Expert Systems with Applications
31(2): 231-240.
Huang, J. and C. X. Ling (2005). "Using AUC and
accuracy in evaluating learning algorithms." IEEE
Transactions on Knowledge and Data Engineering
17(3): 299-310.
Hunt, S. M. N., M. R. Thomas, et al. (2005). "Optimal
Replication and the Importance of Experimental
Design for Gel-Based Quantitative Proteomics."
Journal of Proteome Research 4(3): 809-819.
Jain, A. (1997). "Feature selection: evaluation, application,
and small sample performance." IEEE Transactions on
Pattern Analysis and Machine Intelligence
19(2): 153-
158.
Kim, K. I., K. Jung, et al. (2002). "Support vector
machines for texture classification." IEEE
Transactions on Pattern Analysis and Machine
Intelligence 24(11): 1542-1550.
Kudo, M. and J. Sklansky (1998). "A comparative
evaluation of medium- and large-scale feature
selectors for pattern classifiers." Kybernetika 34(4):
429-434.
Lemkin, P. F. ”The LECB 2D page gel image data set”,
from http://www.ccrnp.ncifcrf.gov/users/lemkin.
Létal, J., D. Jirák, et al. (2003). "MRI 'texture' analysis of
MR images of apples during ripening and storage."
LWT - Food Science and Technology 36(7): 719-727.
Li, S., J. T. Kwok, et al. (2003). "Texture classification
using the support vector machines." Pattern
Recognition 36(12): 2883-2893.
Manimala, K., K. Selvi, et al. (2011). "Hybrid soft
computing techniques for feature selection and
parameter optimization in power quality data mining."
Applied Soft Computing Journal 11(8): 5485-5497.
2D-PAGETextureClassificationusingSupportVectorMachinesandGeneticAlgorithms-AnHybridApproachfor
TextureImageAnalysis
11