Advanced Learning Techniques for Chemometric Modelling

Carlos Cernuda, Edwin Lughofer, Erich Peter Klement

2013

Abstract

The European chemical industry is the world leader in its field. 8 out of the 15 largest chemical companies are EU based. Furthermore, 29 % of the worldwide chemical sales originate from the EU. These industries face future challenges such as rising costs and scarcity of raw materials, an increase in the price of energy, and an intensified competition from Asian countries. Process Analytical Chemistry represents one of the most significant developments in chemical and process engineering over the past decade. Chemical information is of increasing importance in today's chemical industry. It is required for efficient process development, scale-up and production. It is used to assure product quality and compliance with regulations that govern chemical production processes. If reliable analytical information on the chemical process under investigation is available, adjustments and actions can be undertaken immediately in order to assure maximum yield and product quality while minimizing energy consumption and waste production. As a consequence, chemical information has a direct impact on the productivity and thus competitiveness, and on the environmental issues of the respective industries. Chemometrics is the application of mathematical or statistical methods to chemical data. The International Chemometrics Society (ICS) offers the following definition: “Chemometrics is the science of relating measurements made on a chemical system or process to the state of the system via application of mathematical or statistical methods”. Chemometric research spans a wide area of different methods which can be applied in chemistry. There are techniques for collecting good data (optimization of experimental parameters, design of experiments, calibration, signal processing) and for getting information from these data (statistics, pattern recognition, modeling, structure-property-relationship estimations). In this extense list of tasks, we are focused on calibration. Calibration consists on stablishing relationships, i.e. chemometric models, between some instrumental response and chemical concentrations. The usual instrumental responses come from the use of spectrometers, because they allow us to get a lot of on-line cheap data in a non-destructive way. There are two types of calibration, univariate or multivariate calibration, depending on the use of only a single predictor variable or several ones. The current instalations provide us with thousands of variables and thousands of samples, thus more and more new sophisticated techniques, which are capable to handle and take advantage of this tsunami of data, are required. Our goal is to provide the analytical chemistry community with modern and sophisticated tools in order to overcome the incoming future challenges.

References

  1. Abraham, W., Robins, A., 2005. Memory retention-the synaptic stability vs plasticity dilemma, Trends in Neurosciences 28 (2) 73-78.
  2. Backer, S. D., Scheunders, P., 2001. Texture segmentation by frequency-sensitive elliptical competitive learning, Image and Vision Computing 19 (9-10) 639-648.
  3. Cernuda, C., Lughofer, E., Maerzinger, M., Kasberger, J., 2011. Nir-based quantification of process parameters in polyetheracrylat (pea) production using flexible non-linear fuzzy systems, Chemometrics and Intelligent Laboratory Systems 109 (1) 22-33.
  4. Cernuda, C., Lughofer, E., Suppan, L., Roeder, T., Schmuck, R., Hintenaus, P., Maerzinger, W., Kasberger, J., 2012. Evolving chemometric models for predicting dynamic process parameters in viscose production, Analytica Chimica Acta, 725, 22-38.
  5. Cernuda, C., Lughofer, E., Maerzinger, M., Summerer, W., 2012. Waveband selection in NIR spectra using enhanced genetic operators, Chemometrics for Analytical Chemistry 2012, Budapest, CIO-4.
  6. Cernuda, C., Lughofer, E., Hintenaus, P., Maerzinger, W., Reischer, T., Pawliczek, M., Kasberger, J., 2013. Hybrid adaptive calibration methods and ensemble strategy for prediction of cloud point in melamine resin production, Chemometrics and Intelligent Laboratory Systems, volume 126, pp. 60 - 75.
  7. Cernuda, C., Lughofer, E., Mayr, G., Roeder, T., Hintenaus, P., Maerzinger, W., 2013. Decremental active learning for optimized self-adaptive calibration in viscose production, 13th Scandinavian Symposium in Chemometrics, Stockholm.
  8. Cernuda, C., Lughofer, E., Hintenaus, P., Maerzinger, W., 2013. Enhanced Genetic Operators Design for Waveband Selection in Multivariate Calibration by NIR Spectroscopy, Journal of Chemometrics, (submitted).
  9. Cleveland, W., Devlin, S., 1988. Locally weighted regression: an approach to regression analysis by local fitting, Journal of the American Statistical Association 84 (403) 596-610.
  10. Draper, N., Smith, H., 1998. Applied regression analysis, Wiley Interscience, Hoboken, NJ.
  11. Gray, R., 1984. Vector quantization, IEEE ASSP Magazine 1 (2) (1984) 4-29.
  12. Haenlein, M., Kaplan, A., 2004. A beginner's guide to partial least squares (PLS) analysis, Understanding Statistics 3 (4) (2004) 283-297.
  13. Hamker, F., 2001. RBF learning in a non-stationary environment: the stability-plasticity dilemma, in: R. Howlett, L. Jain (Eds.), Radial Basis Function Networks 1: Recent Developments in Theory and Applications, Physica Verlag, Heidel- berg, New York, pp. 219-251.
  14. Hastie, T., Tibshirani, R., Friedman, J., 2007. Pathwise coordinate optimization, The Annals of Applied Statistics 1 (2) 302-332.
  15. Hastie, T., Tibshirani, R., Friedman, J., 2010. Regularized paths for generalized lin- ear models via coordinate descent, Journal of Statistical Software 33 (1).
  16. Haykin, S., 1999. Neural Networks: A Comprehensive Foundation, Prentice Hall.
  17. Jolliffe, I., 2002. Principal Component Analysis, Springer Verlag, Berlin Heidelberg New York.
  18. Klement, E., Mesiar, R., Pap, E., 2000. Triangular Norms, Kluwer Academic Publishers, Dordrecht, Norwell, New York, London.
  19. Ljung, L., 1999. System Identification: Theory for the User, Prentice Hall PTR, Prentice Hall Inc, Upper Saddle River, NJ.
  20. Lughofer, E., 2008. FLEXFIS: a robust incremental learning approach for evolving TS fuzzy models, IEEE Transactions on Fuzzy Systems 16 (6) 1393-1410.
  21. Lughofer, E., Kindermann, S., 2008. Improving the robustness of data-driven fuzzy systems with regularization, in: Proceedings of the IEEE World Congress on Computational Intelligence (WCCI) 2008, Hongkong, pp. 703-709.
  22. Lughofer, E., 2011. On-line incremental feature weighting in evolving fuzzy classifiers, Fuzzy Sets and Systems 163 (1) 1-23.
  23. Lughofer, E., Hüllermeier, E., 2011. On-line redundancy elimination in evolving fuzzy regression models using a fuzzy inclusion measure, in: Proceedings of the EUSFLAT 2011 Conference, Elsevier, Aix-Les-Bains, France, pp. 380-387.
  24. Lughofer, E., Bouchot, J.-L., Shaker, A. 2011, On-line elimination of local redundancies in evolving fuzzy systems, Evolving Systems, vol. 2 (3), pp. 165-187.
  25. Lughofer, E., Angelov, P., 2011. Handling drifts and shifts in on-line data streams with evolving fuzzy systems, Applied Soft Computing, vol. 11, pp. 2057-2068.
  26. Miller, J., Miller, J., 2009. Statistics and Chemometrics for Analytical Chemistry, Prentice Hall, Essex, England.
  27. Qin, S., Li, W., Yue, H., 2000. Recursive PCA for adaptive process monitoring, Journal of Process Control 10 (5) 471-486.
  28. Reeves, J., Delwiche, S., 2003. Partial least squares regression for analysis of spectroscopic data, Journal of Near Infrared Spectroscopy 11 (6) 415-431.
  29. Ros, L., Sabater, A., Thomas, F., 2002. An ellipsoidal calculus based on propagation and fusion, IEEE Transactions on Systems, Man and Cybernetics-Part B: Cybernetics 32 (4) 430-442.
  30. Shao, X., Bian, X., Cai, W., 2010. An improved boosting partial leastsquares method for near-infrared spectroscopic quantitative analysis, Analytica Chimica Acta 666 (1-2) 32-37.
  31. Takagi, T., Sugeno, M., 1985. Fuzzy identification of systems and its applications to modeling and control, IEEE Transactions on Systems, Man and Cybernetics 15 (1) 116-132.
  32. Vaira, S., Mantovani, V.E., Robles, J., Sanchis, J.C., Goicoechea, H., 1999. Use of chemometrics: principal component analysis (PCA) and principal component regression (PCR) for the authentication of orange juice, Analytical Letters 32 (15) 3131-3141.
  33. Widmer, G., Kubat, M., 1996. Learning in the presence of concept drift and hidden contexts, Machine Learning 23 (1) 69-101.
Download


Paper Citation


in Harvard Style

Cernuda C., Lughofer E. and Klement E. (2013). Advanced Learning Techniques for Chemometric Modelling . In Doctoral Consortium - Doctoral Consortium, (IJCCI 2013) ISBN Not Available, pages 19-28


in Bibtex Style

@conference{doctoral consortium13,
author={Carlos Cernuda and Edwin Lughofer and Erich Peter Klement},
title={Advanced Learning Techniques for Chemometric Modelling},
booktitle={Doctoral Consortium - Doctoral Consortium, (IJCCI 2013)},
year={2013},
pages={19-28},
publisher={SciTePress},
organization={INSTICC},
doi={},
isbn={Not Available},
}


in EndNote Style

TY - CONF
JO - Doctoral Consortium - Doctoral Consortium, (IJCCI 2013)
TI - Advanced Learning Techniques for Chemometric Modelling
SN - Not Available
AU - Cernuda C.
AU - Lughofer E.
AU - Klement E.
PY - 2013
SP - 19
EP - 28
DO -