Predicting the Efficiency with Knowledge Discovery of a Budgeted Company: A Cuban University - Validation through Three Semesters

Libia I. García, Isel Grau, Ricardo Grau

2012

Abstract

The efficiency analysis of a company cannot be reduced to a great number of statistical tables, despite the reliability of them. It has been shown to be a better idea to seek the “essence” using Knowledge Discovery (KD) techniques. In this paper, a simple methodology to apply KD in the efficiency analysis of a budgeted company is presented. These analyses complement those from classical OLAP and Interactive Graphics. Specifically, it is shown how to use in three steps: univariate analysis, non-supervised and supervised multi-variate machine learning techniques in order to support the decision making. All these procedures are illus-trated using the SIGENU database (in Spanish: Sistema de Gestión de la Nueva Universidad) of UCLV stu-dents and the efficiency measure was the student graduation on time or not. The presented methodology was elaborated in 2009 and it has been preliminary validated during three semesters of the years 2010 to 2012.

References

  1. Bacher, J., et al., 2004. SPSS Two Step Cluster. A First Evaluation. In RC33 6th International Conference on Social Science Methodology, Amsterdam.
  2. Cramer, 2011. http://changingminds.org/explanations /research/analysis/cramers_v.htm, consulted Oct. 2011.
  3. Decision Trees, 2011. Learning to Classify with Branching Test, http://www.aaai.org/AITopics/pmwiki /pmwiki.php/AITopics/, consulted Oct. 2011 Fawcett, T., 2004. ROC Graphs: Notes and Practical Considerations for Data Mining Researchers. Kluwer Academic Publishers. Netherland.
  4. Forbrig, P., Günther, H., 2010. Perspectives in Business Informatics Research. In Referred proceedings of BIR 2010. Lecture Notes on Business Information Processing, 64, Rostock, Germany.
  5. Grau, R. et al., 2004. Metodología de la Investigación. Editorial EL POIRA S.A. Ibagué, Colombia. 2nd Edition.
  6. Meulman, J. J. et al., 2002. New features of Categorical Principal Components Analysis for Complicated Date Sets, Including Data Mining. In Gaul, W., Ritter, G. (eds.) Proceedings of the 24 annual conference of the Geselschaft für Classifikation V. 24. Part 3: Data Mining, Information Processing and Automation, Springer, pp. 207-218.
  7. Nuñez, I. et al., 2006. Two-Step Cluster Procedure after Principal Component Analysis Identifies Sperm Subpopulations in Canine Ejaculates and Its Relation to Cryoresistance. In J. Androl., 27 (4) p. 596 Prentice, R. L, Farewell, V. T., 1986. Relative Risk and Odd Ratio Regression. In Annual review of public health. 7, 35-58.
Download


Paper Citation


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)
TI - Predicting the Efficiency with Knowledge Discovery of a Budgeted Company: A Cuban University - Validation through Three Semesters
SN - 978-989-8565-29-7
AU - García L.
AU - Grau I.
AU - Grau R.
PY - 2012
SP - 315
EP - 318
DO - 10.5220/0004106603150318


in Harvard Style

García L., Grau I. and Grau R. (2012). Predicting the Efficiency with Knowledge Discovery of a Budgeted Company: A Cuban University - Validation through Three Semesters . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012) ISBN 978-989-8565-29-7, pages 315-318. DOI: 10.5220/0004106603150318


in Bibtex Style

@conference{kdir12,
author={Libia I. García and Isel Grau and Ricardo Grau},
title={Predicting the Efficiency with Knowledge Discovery of a Budgeted Company: A Cuban University - Validation through Three Semesters},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)},
year={2012},
pages={315-318},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004106603150318},
isbn={978-989-8565-29-7},
}