Realization of a Machine Learning Domain Specific Modeling Language: A Baseball Analytics Case Study

Kaan Koseler, Kelsea McGraw, Matthew Stephan

Abstract

Accompanying the Big Data (BD) paradigm is a resurgence in machine learning (ML). Using ML techniques to work with BD is a complex task, requiring specialized knowledge of the problem space, domain specific concepts, and appropriate ML approaches. However, specialists who possess that knowledge and programming ability are difficult to find and expensive to train. Model-Driven Engineering (MDE) allows developers to implement quality software through modeling using high-level domain specific concepts. In this research, we attempt to fill the gap between MDE and the industrial need for development of ML software by demonstrating the plausibility of applying MDE to BD. Specifically, we apply MDE to the setting of the thriving industry of professional baseball analytics. Our case study involves developing an MDE solution for the binary classification problem of predicting if a baseball pitch will be a fastball. We employ and refine an existing, but untested, ML Domain-Specific Modeling Language (DSML); devise model instances representing prediction features; create a code generation scheme; and evaluate our solution. We show our MDE solution is comparable to the one developed through traditional programming, distribute all our artifacts for public use and extension, and discuss the impact of our work and lessons we learned.

Download


Paper Citation


in Harvard Style

Koseler K., McGraw K. and Stephan M. (2019). Realization of a Machine Learning Domain Specific Modeling Language: A Baseball Analytics Case Study.In Proceedings of the 7th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD, ISBN 978-989-758-358-2, pages 15-26. DOI: 10.5220/0007245800150026


in Bibtex Style

@conference{modelsward19,
author={Kaan Koseler and Kelsea McGraw and Matthew Stephan},
title={Realization of a Machine Learning Domain Specific Modeling Language: A Baseball Analytics Case Study},
booktitle={Proceedings of the 7th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,},
year={2019},
pages={15-26},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007245800150026},
isbn={978-989-758-358-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,
TI - Realization of a Machine Learning Domain Specific Modeling Language: A Baseball Analytics Case Study
SN - 978-989-758-358-2
AU - Koseler K.
AU - McGraw K.
AU - Stephan M.
PY - 2019
SP - 15
EP - 26
DO - 10.5220/0007245800150026