Green Data Science - Using Big Data in an “Environmentally Friendly” Manner

Wil Van Der Aalst

Abstract

The widespread use of “Big Data” is heavily impacting organizations and individuals for which these data are collected. Sophisticated data science techniques aim to extract as much value from data as possible. Powerful mixtures of Big Data and analytics are rapidly changing the way we do business, socialize, conduct research, and govern society. Big Data is considered as the “new oil” and data science aims to transform this into new forms of “energy”: insights, diagnostics, predictions, and automated decisions. However, the process of transforming “new oil” (data) into “new energy” (analytics) may negatively impact citizens, patients, customers, and employees. Systematic discrimination based on data, invasions of privacy, non-transparent life-changing decisions, and inaccurate conclusions illustrate that data science techniques may lead to new forms of “pollution”. We use the term “Green Data Science” for technological solutions that enable individuals, organizations and society to reap the benefits from the widespread availability of data while ensuring fairness, confidentiality, accuracy, and transparency. To illustrate the scientific challenges related to “Green Data Science”, we focus on process mining as a concrete example. Recent breakthroughs in process mining resulted in powerful techniques to discover the real processes, to detect deviations from normative process models, and to analyze bottlenecks and waste. Therefore, this paper poses the question: How to benefit from process mining while avoiding “pollutions” related to unfairness, undesired disclosures, inaccuracies, and non-transparency?

Download


Paper Citation


in Harvard Style

Van Der Aalst W. (2016). Green Data Science - Using Big Data in an “Environmentally Friendly” Manner.In Proceedings of the 18th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-187-8, pages 9-21. DOI: 10.5220/0006806900010001


in Bibtex Style

@conference{iceis16,
author={Wil Van Der Aalst},
title={Green Data Science - Using Big Data in an “Environmentally Friendly” Manner},
booktitle={Proceedings of the 18th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2016},
pages={9-21},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006806900010001},
isbn={978-989-758-187-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 18th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Green Data Science - Using Big Data in an “Environmentally Friendly” Manner
SN - 978-989-758-187-8
AU - Van Der Aalst W.
PY - 2016
SP - 9
EP - 21
DO - 10.5220/0006806900010001