Divide-n-Discover - Discretization based Data Exploration Framework for Healthcare Analytics

Si-Chi Chin, Kiyana Zolfaghar, Senjuti Basu Roy, Ankur Teredesai, Paul Amoroso

Abstract

Insightful and principled visualization techniques may successfully help complex clinical data exploration tasks and aid in the process of knowledge discovery. In this paper, we propose a framework Divide-n-Discover to visualize and explore clinical data effectively, and demonstrate its effectiveness in predicting readmission risk for Congestive Heart Failure patients. Our proposed method provides clinicians a mechanism to dynamically explore the data and to understand how a single factor may influence the risk of readmission for a given patient. For example, our study indicates that patients between age 47 and 48 have 2.63 time higher chance of getting readmitted to the hospital within 30 days, compared to other patients; likewise, patients with length of stay above 13 days are 2.27 times more likely to be readmitted within 30 days. The finding suggests that hospitals might be under pressure to discharge patients within two week while some patients may benefit from a longer stay. These observations may become valid hypotheses leading to further clinical investigation or discoveries. To the best of our knowledge, this is the first ever work that proposes principled discretization and visualization techniques in the hospital readmission risk prediction problem.

References

  1. Chin, S.-C., Street, W. N., and Teredesai, A. (2012). Discovering meaningful cut-points to predict high HbA1c variation. In Proc. 7th INFORMS Workshop on Data Mining and Health Informatics.
  2. Kansagara D, E. H. (2011). Risk prediction models for hospital readmission: A systematic review. JAMA, 306(15):1688-1698.
  3. Kerber, R. (1992). ChiMerge: discretization of numeric attributes. In Proceedings of the tenth national conference on Artificial intelligence, AAAI'92, pages 123- 128. AAAI Press.
  4. Krumholz, H. M., Normand, S. L. T., Keenan, P. S., Lin, Z. Q., Drye, E. E., Bhat, K. R., Wang, Y. F., Ross, J. S., Schuur, J. D., and Stauffer, B. D. (2008). Hospital 30-day heart failure readmission measure methodology. Report prepared for the Centers for Medicare & Medicaid Services.
  5. Liu, H., Hussain, F., Tan, C. L., and Dash, M. (2002). Discretization: An enabling technique. Data Mining and Knowledge Discovery, 6(4):393-423.
  6. Szumilas, M. (2010). Explaining odds ratios. Journal of the Canadian Academy of Child and Adolescent Psychiatry, 19(3):227-229.
  7. Zolfaghar, K., Agarwal, J., Sistla, D., Chin, S.-C., Roy, S. B., and Verbiest, N. (2013a). Risk-o-meter: an intelligent clinical risk calculator. In KDD, pages 1518- 1521.
  8. Zolfaghar, K., Meadem, N., Sistla, D., Chin, S.-C., Roy, S. B., Verbiest, N., and Teredesai, A. (2013b). Exploring preprocessing techniques for prediction of risk of readmission for congestive heart failure patients. In Data Mining and Healthcare Workshop.
  9. Zolfaghar, K., Meadem, N., Teredesai, A., Roy, S. B., Chin, S.-C., and Muckian, B. (2013c). Big data solutions for predicting risk-of-readmission for congestive heart failure patients. In IEEE Bigdata.
Download


Paper Citation


in Harvard Style

Chin S., Zolfaghar K., Basu Roy S., Teredesai A. and Amoroso P. (2014). Divide-n-Discover - Discretization based Data Exploration Framework for Healthcare Analytics . In Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2014) ISBN 978-989-758-010-9, pages 329-333. DOI: 10.5220/0004802403290333


in Bibtex Style

@conference{healthinf14,
author={Si-Chi Chin and Kiyana Zolfaghar and Senjuti Basu Roy and Ankur Teredesai and Paul Amoroso},
title={Divide-n-Discover - Discretization based Data Exploration Framework for Healthcare Analytics},
booktitle={Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2014)},
year={2014},
pages={329-333},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004802403290333},
isbn={978-989-758-010-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2014)
TI - Divide-n-Discover - Discretization based Data Exploration Framework for Healthcare Analytics
SN - 978-989-758-010-9
AU - Chin S.
AU - Zolfaghar K.
AU - Basu Roy S.
AU - Teredesai A.
AU - Amoroso P.
PY - 2014
SP - 329
EP - 333
DO - 10.5220/0004802403290333