A Study about Discovery of Critical Food Consumption Patterns Linked with Lifestyle Diseases using Data Mining Methods
Farshideh Einsele, Leila Sadeghi, Rolf Ingold, Helena Jenzer
2015
Abstract
Background: To date, the analysis of the implications of dietary patterns on lifestyle diseases is based on data coming either from clinical studies or food surveys, both comprised of a limited number of participants. This article demonstrates that linking big data from a grocery store sales database with demographical and health data by using data mining tools such as classification and association rules is a powerful way to determine if a specific population subgroup is at particular risk for developing a lifestyle disease based on its food consumption patterns. Objective: The objective of the study was to link big data from grocery store sales with demographic and health data to discover critical food consumption patterns linked with lifestyle diseases known to be strongly tied with food consumption. Design: Food consumption databases from a publicly available grocery store database dating from 1997–1998 were gathered along with corresponding demographics and health data from the U. S. west coast, pre-processed, cleaned and finally integrated to a unique database. Results: This study applied data mining techniques such as classification and association mining analysis. Firstly, the studied population was classified according to the demographical information “ age groups” and “race” and data for lifestyle diseases were correspondingly attributed. Secondly, association mining analysis was used to incorporate rules about food consumption and lifestyle diseases. A set of promising preliminary rules and their corresponding interpretation was generated and reported in the present paper. Conclusions: Association mining rules were successfully used to describe and predict rules linking food consumption patterns with lifestyle diseases. In the selected grocery store database, information about interesting aspects of the grocery store customers were found such as marital status, educational background, profession and number of children at home. An in-depth research on these attributes is needed to further expand the present demographical database. Since the search on the internet for demographical attributes back to the year of 2000 corresponding to the studied population subgroup was extremely laborious, the selected demographical attributes to prove the feasibility of the study were limited to age groups and race.
References
- WHO, World Health Organization Geneva 2003, Diet, Nutrition and the Prevention of Chronic Diseases, Report of a Joint WHO/FAO Expert Consultation.
- I. P Hearty and M. J Gibney, 2008, Analysis of meal patterns with the use of supervised data mining techniques artificial neural networks and decision trees, 88:1632-42. American Society for Nutrition.
- M. Sulaiman Khan, Maybin Muyeba, Frans Coenen, 2008, On Exraction of Nutritonal Patterns (NPS) using Fuzzy Association Rule Mining, Healthinf 2008.
- R. Agrawal and R. Srikant, 1996, Quest Synthetic Data Generator, IBM Almaden Research Center.
- L. Manikonda, R. Mall, V. Pudi, 2011, "Mining Nutrition Survey Data", SSCI 2011, CIDM 2011, Paris, France.
- J.D. Kinsey, P. Wolfson, N. Katsaras, B. Senauer, 2001, Data mining A segmentation analysis of US grocery shoppers, Working paper (Univer -sity of Minnesota. Retail Food Industry Center), 01-01.
- S. Kumar, V. Bishnoi, 2011, Indian Consumer Food Shopping Behaviour and their Choice & Preference for Packaged Food and Food Retailers, an Exploratory Study, Proceedings for 2011 International Research Conference And Colloquium, Contemporary Research Issues and Challenges in Emerging Economies.
- J. M. Harris and N. Blisard, 2002, Food -Con-sumption Patterns among Elderly Age Groups.
- N. Habib, S. Inam, S. Batool, S. Naheed and S. Siddiqui, 2013, Nutritional Pattern and its Impact on the Health: A Case Study of Tehsil Kot Addu, Punjab, Pakistan, International Journal of Humanities and Social Science, Vol. 3 No. 10,Special Issue, May 2013.
- Harris Polls, 2010, http://www.harrisinteractive.com /NewsRoom/ HarrisPolls/tabid/447/ctl/ ReadCustom%20Default/mid/1508/ArticleId/614/Defa ult.aspx.
- RecSysWiki, 2012, http://recsyswiki.com/wiki/ Grocery_ shopping_datasets.
- US Census, 2000, http://www.census.gov/ces/ dataproducts/ demographicdata.html.
- http://sandiegohealth.org/disease/diabetes/ diabetes2001.pdf.
- http://www.cdph.ca.gov/pubsforms/Pubs/ OHIRmentalhealthCareCA2001.pdf.
- http://adai.washington.edu/pubs/ infobriefs/ ADAI-IB2004- 06.pdf.
- https://fortress.wa.gov/doh/wscr/WSCR/PDF/02REPORT/ CancerByCounty02.pdf.
- http://www.doh.wa.gov/portals/1/ Documents/ Pubs/345- 271- ChronicDisease ProfileSpokane.pdf.
- http://public.health.oregon.gov/DiseasesConditions/Chronic Disease/HeartDiseaseStroke/ Documents/2006Heart DiseaseRpt.pdf
- C. JL Murray & S. S Lim, Control of hypertension with medication: a comparative analysis of national surveys in 20 countries, Bulletin of the World Health Organization, 2014; 92:10-19C
- Statistics 2004 Incidence and Mortality
- www.lapublichealth.org, County of Los Angeles, Department of health, Obesity on the Rise, July 2003
- L. A. Chaput, The Burden of of Cardiovascular Disease in California, July 2007
- L.A.Health, Physical Activity Among Adults in Los Angeles County, November 2000, www.lapublichealth.org
- H. Lee, Obesity Among Racial and Ethnic Differences, Copyright © 2006 by Public Policy Institute of California
- EFSA, 2014, http://www.efsa.europa.eu/en/datex/ datexfoodclass.htm.
Paper Citation
in Harvard Style
Einsele F., Sadeghi L., Ingold R. and Jenzer H. (2015). A Study about Discovery of Critical Food Consumption Patterns Linked with Lifestyle Diseases using Data Mining Methods . In Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2015) ISBN 978-989-758-068-0, pages 239-245. DOI: 10.5220/0005170402390245
in Bibtex Style
@conference{healthinf15,
author={Farshideh Einsele and Leila Sadeghi and Rolf Ingold and Helena Jenzer},
title={A Study about Discovery of Critical Food Consumption Patterns Linked with Lifestyle Diseases using Data Mining Methods},
booktitle={Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2015)},
year={2015},
pages={239-245},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005170402390245},
isbn={978-989-758-068-0},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2015)
TI - A Study about Discovery of Critical Food Consumption Patterns Linked with Lifestyle Diseases using Data Mining Methods
SN - 978-989-758-068-0
AU - Einsele F.
AU - Sadeghi L.
AU - Ingold R.
AU - Jenzer H.
PY - 2015
SP - 239
EP - 245
DO - 10.5220/0005170402390245