Speaker State Recognition: Feature Selection Method based on Self-adjusting Multi-criteria Evolutionary Algorithms

Roman Sergienko, Elena Loseva

Abstract

In supervised learning scenarios there are different existing methods for solving a task of feature selection for automatic speaker state analysis; many of them achieved reasonable results. Feature selection in unsupervised learning scenarios is a more complicated problem, due to the absence of class labels that would guide the search for relevant information. Supervised feature selection methods are “wrapper” techniques that require a learning algorithm to evaluate the candidate feature subsets; unsupervised feature selection methods are “filters” which are independent of any learning algorithm. However, they are usually performed separately from each other. In this paper, we propose a method which can be performed in supervised and unsupervised forms simultaneously based on multi-criteria evolutionary procedure which consists of two stages: self-adjusting multi-criteria genetic algorithm and self-adjusting multi-criteria genetic programming. The proposed approach was compared with different methods for feature selection on four audio corpora for speaker emotion recognition and for speaker gender identification. The obtained results showed that the developed technique provides to increase emotion recognition performance by up to 46.5% and by up to 20.5% for the gender identification task in terms of accuracy.

References

  1. Akthar, F. and Hahne, C., 2012. Rapid Miner 5 Operator reference, Rapid-I. Dortmund. pages 25-55.
  2. Ashish, G. and Satchidanada D., 2004. Evolutionary Algorithm for Multi-Criterion Optimization: A Survey. In International Journal of Computing & Information Science. vol. 2, no. 1. pages 43-45.
  3. Brester, Ch., Sidorov, M., Semenkin, E., 2014. Acoustic Emotion Recognition: Two Ways of Features Selection Based on Self-Adaptive Multi-Objective Genetic Algorithm. In Informatics in Control, Automation and Robotics. ICINCO14. pages 851 - 855.
  4. Burkhardt, F., Paeschke, A., Rolfes M., Sendlmeier, W. F., Weiss, B., 2005. A database of german emotional speech. In Interspeech. pages 1517-1520.
  5. Fan, R. K. and Chung, Spectral, 1997. Graph Theory. In Regional Conference Series in Mathematics. no. 92. pages 2-5.
  6. Golub, T. R., Slonim D. K., Tomayo, P., Huard, C., Gaasenbeek M., Mesirov J. P., Coller, H., Loh, M. L., Downing J. R., Caligiuri, M. A., Bloofield, C. D. and Lander, E. S. 1999. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. In Science. vol. 286. pages 531-537.
  7. He, X., Cai, D., and Niyogi, P., 2005. Learning a Laplacian Score for Feature Selection. In Advances in Neural Information Processing Systems 18. NIPS. pages 14-14.
  8. Holland, J. H., 1975. Adaptation in Natural and Artificial System. In University of Michigan Press. pages 18-35.
  9. Koza, J.R., 1992. Genetic Programming: On the Programming of Computers by Means of Natural Selection, MIT Press. London. pages 109-120.
  10. Liu, J., Iba, H. and Ishizuka, M. 2001. Selecting informative genes with parallel genetic algorithms in tissue classification. In Genome Informatics. vol. 12, pages 14-23.
  11. Loseva, E. D., 2014. Recognition of humaÀs emotion, gender and age for the functioning of the system "Smart House". In Fundamental computer science, information technology and management of systems: realities and prospects. FIIT, pages 239 - 249.
  12. Loseva, E. D., 2015a. Ensembles of neural network models using multi-criteria self-configuring genetic programming. In Actual problems of aviation and cosmonautics. APAC. Part 1. pages 340-343.
  13. Loseva, E. D., 2015b. Application of ten algorithm for optimization of the support vector machine parameters and for optimization of feature selection process in the task of recognition humaÀs gender and age. In XII International scientific-practical conference. MCSR. Part 7. pages 132-136.
  14. Mori, H., Satake, T., Nakamura, M., and Kasuya, H., 2011. Constructing a spoken dialogue corpus for studying paralinguistic information in expressive conversation and analyzing its statistical/acoustic characteristics. Speech Communication. pages 36-50.
  15. Nguyen, V. D. and Rocke, D. M. 2002. Tumor classification by partial least squares using microarray gene expression data. In Bioinformatics. vol. 8. no. 1. pages 39-50.
  16. Platt., J. 1998. Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines. In Microsoft Research. pages 1-21.
  17. Polzehl, T., Schmitt, A., and Metze, F., 2011. Salient features for anger recognition in german and english ivr portals. In Minker, W., Lee, G. G., Nakamura, S., and Mariani, J., editors, Spoken Dialogue Systems Technology and Design. Springer New York, 1st edition. pages 83-105.
  18. Schmitt, A., Ultes, S., and Minker, W., 2012. A parameterized and annotated corpus of the cmu let's go bus information system. In International Conference on Language Resources and Evaluation. LREC. pages 208-217.
  19. Semenkin, E. and Semenkina M., 2012. Self-configuring genetic programming algorithm with modified uniform crossover. IEEE Congress on Evolutionary Computation 2012.
  20. Sergienko, R. and Semenkin, E, 2010. Competitive cooperation for strategy adaptation in coevolutionary genetic algorithm for constrained optimization. IEEE Congress on Evolutionary Computation 2010.
  21. Sidorov, M., Brester, Ch., Semenkin, E., and Minker, W., 2014. Speaker State Recognition with Neural Network-based Classification and Self-adaptive Heuristic Feature Selection. 11th International Conference Informatics in Control, Automation and Robotics. ICINCO14. Vol. 1. pages 699 - 703.
  22. Venkatadri, M. and Srinivasa R. K., 2010. A multiobjective genetic algorithm for feature selection in data mining. International Journal of Computer Science and Information Technologies. vol. 1, no. 5. pages 443-448.
  23. Zitzler, E. and Thiele, L., 1999. Multiobjective
Download


Paper Citation


in Harvard Style

Sergienko R. and Loseva E. (2016). Speaker State Recognition: Feature Selection Method based on Self-adjusting Multi-criteria Evolutionary Algorithms . In Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-758-198-4, pages 123-129. DOI: 10.5220/0005946801230129


in Bibtex Style

@conference{icinco16,
author={Roman Sergienko and Elena Loseva},
title={Speaker State Recognition: Feature Selection Method based on Self-adjusting Multi-criteria Evolutionary Algorithms},
booktitle={Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2016},
pages={123-129},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005946801230129},
isbn={978-989-758-198-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - Speaker State Recognition: Feature Selection Method based on Self-adjusting Multi-criteria Evolutionary Algorithms
SN - 978-989-758-198-4
AU - Sergienko R.
AU - Loseva E.
PY - 2016
SP - 123
EP - 129
DO - 10.5220/0005946801230129