SUBJECT RECOGNITION USING A NEW APPROACH FOR FEATURE SELECTION

Àgata Lapedriza, David Masip, Jordi Vitrià

2008

Abstract

In this paper we propose a feature selection method that uses the mutual information (MI) measure on a Principal Component Analysis (PCA) based decomposition. PCA finds a linear projection of the data in a non-supervised way, which preserves the larger variance components of the data under the reconstruction error criterion. Previous works suggest that using the MI among the PCA projected data and the class labels applied to feature selection can add the missing discriminability criterion to the optimal reconstruction feature set. Our proposal goes one step further, defining a global framework to add independent selection criteria in order to filter misleading PCA components while the optimal variables for classification are preserved. We apply this approach to a face recognition problem using the AR Face data set. Notice that, in this problem, PCA projection vectors strongly related to illumination changes and occlusions are usually preserved given their high variance. Our additional selection tasks are able to discard this type of features while the relevant features to perform the subject recognition classification are kept. The experiments performed show an improved feature selection process using our combined criterion.

References

  1. Bekkerman, R., El-Yaniv, R., Tishby, N., and Winter, Y. (2003). Distributional word clusters vs. words for text categorization. J. Mach. Learn. Res., 3:1183-1208.
  2. Bellman, R. (1961). Adaptive Control Process: A Guided Tour. Princeton University Press, New Jersey.
  3. Duda, R., P.Hart, and Stork, D. (2001). Pattern Classification. Jon Wiley and Sons, Inc, New York, 2nd edition.
  4. Fisher, R. (1936). The use of multiple measurements in taxonomic problems. Ann. Eugenics, 7:179-188.
  5. Fukunaga, K. and Mantock, J. (1983). Nonparametric discriminant analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(6):671-678.
  6. Guyon, I. and Elisseeff, A. (2003). An introduction to variable and feature selection. J. Mach. Learn. Res., 3:1157-1182.
  7. Kirby, M. and Sirovich, L. (1990). Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12(1):103-108.
  8. Martinez, A. and Benavente, R. (1998). The AR Face database. Technical Report 24, Computer Vision Center.
  9. Torkkola, K. (2003). Feature extraction by non parametric mutual information maximization. J. Mach. Learn. Res., 3:1415-1438.
Download


Paper Citation


in Harvard Style

Lapedriza À., Masip D. and Vitrià J. (2008). SUBJECT RECOGNITION USING A NEW APPROACH FOR FEATURE SELECTION . In Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008) ISBN 978-989-8111-21-0, pages 61-66. DOI: 10.5220/0001079100610066


in Bibtex Style

@conference{visapp08,
author={Àgata Lapedriza and David Masip and Jordi Vitrià},
title={SUBJECT RECOGNITION USING A NEW APPROACH FOR FEATURE SELECTION},
booktitle={Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008)},
year={2008},
pages={61-66},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001079100610066},
isbn={978-989-8111-21-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2008)
TI - SUBJECT RECOGNITION USING A NEW APPROACH FOR FEATURE SELECTION
SN - 978-989-8111-21-0
AU - Lapedriza À.
AU - Masip D.
AU - Vitrià J.
PY - 2008
SP - 61
EP - 66
DO - 10.5220/0001079100610066