ing application will help users to improve the perfor-
mance and the time they pay for using the cloud in-
frastructure. Also, by optimizing the use of an infras-
tructure we will be offering more resources for other
users.
The reason why we present this methodology as
a new contribution is that it helps cloud computing
users take advantage of the exclusive feature of the
cloud, like elasticity, freedom for sizing the virtual
machines, the shorter time that would take for a sys-
tem administrator review his cluster sizing and per-
form changes on it.
This methodology could be used in other dis-
tributed environments, but it is in the cloud that a user
can really play with several configurations of virtual
machines.
We want to present this methodology for applica-
tions running in the cloud as general as we can, so
that any user can take advantage of it. The future
work indicated in the previous section will confirm
(or not) if this methodology is general enough to have
the strength to help defining a mathematical model for
MapReduce applications, and hopefully some other
cloud applications as well.
ACKNOWLEDGEMENTS
This work was made with the support of the Programa
Estudantes-Convˆenio de P´os-Graduac¸˜ao PEC-PG, of
CAPES/CNPq - Brazil.
This position paper was partially funded by Ed-
ital FAPERGS-CNPq n. 008/2009 Research
Project: GREEN-GRID: Sustainable High Perfor-
mance Computing, Edital MCT/CNPq No 70/2009
PGAESTMCT/CNPq
Also, we would like to thank Professor Arnaud
Legrand from the CNRS, Centre National de la
Recherche Scientifique, and the University of Greno-
ble in France for his help and guidance in simplifying
the methodology presented in this work.
And a final acknowledgement to the special per-
son who inspired this vision of Impressionism in
Cloud Computing a few years ago by explaining the
art movement.
REFERENCES
Abad, C. L., Roberts, N., Lu, Y., and Campbell, R. H.
(2012). A storage-centric analysis of mapreduce
workloads: File popularity, temporal locality and ar-
rival patterns. In Workload Characterization (IISWC),
2012 IEEE International Symposium on, pages 100–
109. IEEE.
Anjos, J., Kolberg, W., Geyer, C. R., and Arantes, L. B.
(2012). Addressing data-intensive computing prob-
lems with the use of mapreduce on heterogeneous en-
vironments as desktop grid on slow links. In Com-
puter Systems (WSCAD-SSC), 2012 13th Symposium
on, pages 148–155. IEEE.
Boutaba, R., Cheng, L., and Zhang, Q. (2012). On cloud
computational models and the heterogeneity chal-
lenge. Journal of Internet Services and Applications,
3(1):77–86.
Carissimi, A. (2008). Virtualizac¸˜ao: da teoria a soluc¸˜oes.
Minicursos do Simp´osio Brasileiro de Redes de
Computadores–SBRC, 2008:173–207.
Herodotou, H., Dong, F., and Babu, S. (2011). No one
(cluster) size fits all: automatic cluster sizing for data-
intensive analytics. In Proceedings of the 2nd ACM
Symposium on Cloud Computing, page 18. ACM.
Ibrahim, S., Jin, H., Lu, L., Qi, L., Wu, S., and Shi, X.
(2009). Evaluating mapreduce on virtual machines:
The hadoop case. In Cloud Computing, pages 519–
528. Springer.
Jain, R. (1991). The art of computer systems performance
analysis, volume 182. John Wiley & Sons Chichester.
Mell, P. and Grance, T. (2011). The nist definition of cloud
computing (draft). NIST special publication, 800:145.
Mietzner, R. and Leymann, F. (2008). Towards provisioning
the cloud: On the usage of multi-granularity flows and
services to realize a unified provisioning infrastructure
for saas applications. In Services-Part I, 2008. IEEE
Congress on, pages 3–10. IEEE.
Sangroya, A., Serrano, D., and Bouchenak, S. (2012).
Benchmarking dependability of mapreduce systems.
In Reliable Distributed Systems (SRDS), 2012 IEEE
31st Symposium on, pages 21–30. IEEE.
Wang, P., Huang, W., and Varela, C. A. (2010). Impact of
virtual machine granularity on cloud computing work-
loads performance. In Grid Computing (GRID), 2010
11th IEEE/ACM International Conference on, pages
393–400. IEEE.
Yelick, K., Coghlan, S., Draney, B., Canon, R. S., et al.
(2011). The magellan report on cloud computing for
science. US Department of Energy Office of Sci-
ence, Office of Advanced Scientific Computing Re-
search (ASCR) December.
Zaharia, M., Konwinski, A., Joseph, A. D., Katz, R., and
Stoica, I. (2008). Improving mapreduce performance
in heterogeneous environments. In Proceedings of the
8th USENIX conference on Operating systems design
and implementation, pages 29–42.
ICEIS2013-15thInternationalConferenceonEnterpriseInformationSystems
338