loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Marco Lattuada 1 ; Eugenio Gianniti 1 ; Marjan Hosseini 1 ; Danilo Ardagna 1 ; Alexandre Maros 2 ; Fabricio Murai 2 ; Ana Paula Couto da Silva 2 and Jussara M. Almeida 2

Affiliations: 1 Politecnico di Milano and Italy ; 2 Universidade Federal de Minas Gerais and Brazil

Keyword(s): Spark, Big Data, Machine Learning.

Abstract: Big data applications are among the most suitable applications to be executed on cluster resources because of their high requirements of computational power and data storage. Correctly sizing the resources devoted to their execution does not guarantee they will be executed as expected. Nevertheless, their execution can be affected by perturbations which can change the expected execution time. Identifying when these types of issue occurred by comparing their actual execution time with the expected one is mandatory to identify potentially critical situations and to take the appropriate steps to prevent them. To fulfill this objective, accurate estimates are necessary. In this paper, machine learning techniques coupled with a posteriori knowledge are exploited to build performance estimation models. Experimental results show how the models built with the proposed approach are able to outperform a reference state-of-the-art method (i.e., Ernest method), reducing in some scenarios the err or from the 221.09-167.07% to 13.15-30.58%. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.17.166.157

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lattuada, M.; Gianniti, E.; Hosseini, M.; Ardagna, D.; Maros, A.; Murai, F.; Couto da Silva, A. and Almeida, J. (2019). Gray-Box Models for Performance Assessment of Spark Applications. In Proceedings of the 9th International Conference on Cloud Computing and Services Science - IWFCC; ISBN 978-989-758-365-0; ISSN 2184-5042, SciTePress, pages 609-618. DOI: 10.5220/0007877806090618

@conference{iwfcc19,
author={Marco Lattuada. and Eugenio Gianniti. and Marjan Hosseini. and Danilo Ardagna. and Alexandre Maros. and Fabricio Murai. and Ana Paula {Couto da Silva}. and Jussara M. Almeida.},
title={Gray-Box Models for Performance Assessment of Spark Applications},
booktitle={Proceedings of the 9th International Conference on Cloud Computing and Services Science - IWFCC},
year={2019},
pages={609-618},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007877806090618},
isbn={978-989-758-365-0},
issn={2184-5042},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Cloud Computing and Services Science - IWFCC
TI - Gray-Box Models for Performance Assessment of Spark Applications
SN - 978-989-758-365-0
IS - 2184-5042
AU - Lattuada, M.
AU - Gianniti, E.
AU - Hosseini, M.
AU - Ardagna, D.
AU - Maros, A.
AU - Murai, F.
AU - Couto da Silva, A.
AU - Almeida, J.
PY - 2019
SP - 609
EP - 618
DO - 10.5220/0007877806090618
PB - SciTePress