MINING SCIENTIFIC RESULTS THROUGH THE COMBINED USE OF CLUSTERING AND LINEAR PROGRAMMING TECHNIQUES

Andrea Tagarelli, Irina Trubitsyna, Sergio Greco

2004

Abstract

The paper proposes a technique based on a combined approach of data mining algorithms and linear programming methods for classifying organizational units, such as research centers. We exploit clustering algorithms for grouping information concerning the scientific activity of research centers. We also show that the replacement of an expensive efficiency measurement, based on the solution of linear programs, with a simple formula allows clusters of very good quality to be computed efficiently. Some initial experimental results, obtained from an analysis of research centers in the agro-food sector, show the effectiveness of our approach, both from an efficiency and a quality-of-results point of view.

References

  1. Baeza-Yates, R. and Ribeiro-Neto, B. (1999). Modern Information Retrieval. ACM Press Books, Addison Wesley.
  2. Charnes, A., Cooper, W. W., Lewin, A. Y., and Seiford, L. M. (1994). Data Envelopment Analysis: Theory, Methodology and Applications. Kluwer Academic Publishers.
  3. Charnes, A., Cooper, W. W., and Rhodes, E. (1978). Measuring the ef ciency of decision making units. European Journal of Operational Research 2, 429-444.
  4. Chung, S. H., Yang, Y. S., and Wu, T. -H. (2000). Evaluating the Ef ciency of University via DEA approach. In Proc. 5th Annual Int. Conf. on Industrial Engineering Theory, Applications and Practice.
  5. Emrouznejad, A. (2001). An Extensive Bibliography of Data Envelopment Analysis. Tech. Rep., Business School, Univ. of Warwick,
  6. Farrell, M. J. (1957). The measurements of productive ef - ciency. J.R. Statis Soc., Series A 120, 253-281.
  7. Farrell, M. J., and Fieldhouse, M. (1962). Estimating ef - cient production functions under increasing returns to scale. J.R. Statis Soc., Series A 125, 252-267.
  8. Galante, E., Sala, C., and Lanini, I. (1998). Valutazione della ricerca agricola, Franco Angeli (ed.), Milano.
  9. Halkidi, M., Batistakis, Y., and Vazirgiannis, M. (2002). Cluster Validity Methods: Sigmod Record 31(2), 40- 45.
  10. Jain, A. K., and Dubes, R. C. (1988). Algorithms for Clustering Data. Prentice-Hall advanced reference series.
  11. Okubo, Y. (1997). Bibliometric indicators and analysis of research systems: Methods and examples. OECD, WP#1.
  12. Schubert, A., Glaenzel, W., and Braun, T. (1988). Against Absolute Methods: Relative Scientometrics Indicators and Relational Charts as Evaluation Tools. Handbook of Quantitative Studies of Science and Technology, Van Ran A. F. J. (ed.), North-Holland, Amsterdam.
  13. Sohn, S. Y., and Choi, H. (2001). Ensemble Based on Data Envelopment Analysis. In Proc. Aspects of Data Mining, Decision Support and Meta-Learning, 129-137.
  14. Stern, Z. S., Mehrez, A., and Barboy, A. (1994). Academic departments ef ciency via DEA. Computers and Operations Research 21(5), 543-556.
  15. Thanassoulis, E., Dyson, R. G., and Foster, M. J. (1987). Relative Ef ciency Assessments using Data Envelopment Analysis: an Application to Data on Rates Departments. J. Opl. Res. Soc. 38, 397-412.
  16. Viveros, M. S., Nearhos, J. P., and Rothman, M. J. (1996). Applying Data Mining Techniques to a Health Insurance Information System. In 22th VLDB Conf., 286- 294.
  17. Zhu, J. (2002). Quantitative Models for Performance Evaluation and Benchmarking: Data Envelopment Analysis with Spreadsheets and DEA Excel Solver. Kluwer Academic Publishers, Boston.
Download


Paper Citation


in Harvard Style

Tagarelli A., Trubitsyna I. and Greco S. (2004). MINING SCIENTIFIC RESULTS THROUGH THE COMBINED USE OF CLUSTERING AND LINEAR PROGRAMMING TECHNIQUES . In Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-00-7, pages 84-91. DOI: 10.5220/0002624000840091


in Bibtex Style

@conference{iceis04,
author={Andrea Tagarelli and Irina Trubitsyna and Sergio Greco},
title={MINING SCIENTIFIC RESULTS THROUGH THE COMBINED USE OF CLUSTERING AND LINEAR PROGRAMMING TECHNIQUES},
booktitle={Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2004},
pages={84-91},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002624000840091},
isbn={972-8865-00-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - MINING SCIENTIFIC RESULTS THROUGH THE COMBINED USE OF CLUSTERING AND LINEAR PROGRAMMING TECHNIQUES
SN - 972-8865-00-7
AU - Tagarelli A.
AU - Trubitsyna I.
AU - Greco S.
PY - 2004
SP - 84
EP - 91
DO - 10.5220/0002624000840091