Multi-dimensional Pattern Mining - A Case Study in Healthcare

Andreia Silva, Cláudia Antunes

Abstract

Huge amounts of data are continuously being generated in the healthcare system. A correct and careful analysis of these data may bring huge benefits to all people and processes involved in the healthcare management. However, the characteristics of healthcare data do not make this job easy. These data are usually too complex, massive, with high dimensionality, and are irregularly distributed over time. In the last decade, data mining has begun to address this area, providing the technology and approaches to transform these complex data into useful information for decision support. Multi-relational data mining, in particular, has gained attention since it aims for the discovery of frequent relations that involve multiple dimensions. In this work we present a case study on the healthcare domain. Using the Hepatitis dataset, we show how that data can be modeled and explored in a multi-dimensional model, and we present and discuss the results of applying a multi-dimensional data mining algorithm to that model.

References

  1. Crestana-Jensen, V. and Soparkar, N. (2000). Frequent itemset counting across multiple tables. In Proc. of the 4th Pacific-Asia Conf. on Knowl. Discovery and Data Mining, pages 49-61, London. Springer.
  2. Dehaspe, L. and Raedt, L. D. (1997). Mining association rules in multiple relations. In ILP 97: Proc. of the 7th Intern. Workshop on Inductive Logic Programming, pages 125-132, London, UK. Springer.
  3. Dz?eroski, S. (2003). Multi-relational data mining: an introduction. SIGKDD Explor. Newsl., 5(1):1-16.
  4. Frawley, W. J., Piatetsky-Shapiro, G., and Matheus, C. J. (1992). Knowledge discovery in databases: an overview. AI Mag., 13(3):57-70.
  5. Kaur, H. and Wasan, S. (2006). Empirical study on applications of data mining techniques in healthcare. Journal of Computer Science, 2(2):194-200.
  6. Koh, H. and Tan, G. (2005). Data mining applications in healthcare. Journal of Healthcare Information Management, 19(2):64-71.
  7. Ng, E. K. K., Fu, A. W.-C., and Wang, K. (2002). Mining association rules from stars. In ICDM 02: Proc. of the 2002 IEEE Intern. Conf. on Data Mining, pages 322-329, Japan. IEEE.
  8. Pizzi, L., Ribeiro, M., and Vieira, M. (2005). Analysis of hepatitis dataset using multirelational association rules. In ECML/PKDD 2005 Discovery Challenge, Porto, Portugal.
  9. Silva, A. and Antunes, C. (2010). Pattern mining on stars with fp-growth. In MDAI 10: Proc. of the 7th Intern. Conf. on Modeling Decisions for Artificial Intelligence, pages 175-186, Perpignan, France. Springer.
  10. Silva, A. and Antunes, C. (2012). Finding patterns in large star schemas at the right aggregation level. In Proc. of the 9th Intern. Conf. on Modeling Decisions for Artificial Intelligence, pages 329-340, Spain. Springer.
  11. Srikant, R. (1996). Fast algorithms for mining association rules and sequential patterns. PhD thesis, University of Wisconsin, Madison. Supervisor-Jeffrey Naughton.
  12. Watanabe, T., Susuki, E., Yokoi, H., and Takabayashi, K. (2003). Application of prototypelines to chronic hepatitis data. In ECML/PKDD 2003 Discovery Challenge, Cavtat, Croatia.
  13. Xu, L.-J. and Xie, K.-L. (2006). A novel algorithm for frequent itemset mining in data warehouses. Journal of Zhejiang University - Science A, 7(2):216-224.
  14. Zhou, L. and Yau, S. (2007). Efficient association rule mining among both frequent and infrequent items. Computers and Mathematics with Applications, 54(6):737-749.
Download


Paper Citation


in Harvard Style

Silva A. and Antunes C. (2014). Multi-dimensional Pattern Mining - A Case Study in Healthcare . In Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-027-7, pages 273-280. DOI: 10.5220/0004898802730280


in Bibtex Style

@conference{iceis14,
author={Andreia Silva and Cláudia Antunes},
title={Multi-dimensional Pattern Mining - A Case Study in Healthcare},
booktitle={Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2014},
pages={273-280},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004898802730280},
isbn={978-989-758-027-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Multi-dimensional Pattern Mining - A Case Study in Healthcare
SN - 978-989-758-027-7
AU - Silva A.
AU - Antunes C.
PY - 2014
SP - 273
EP - 280
DO - 10.5220/0004898802730280