Joan Navarro, Ainhoa Azqueta-Alzuaz, Pablo Murta Baião Albino, José Enrique Armendáriz-Iñigo


Cloud computing—implemented by tool suites like Amazon S3, Dynamo, or Hadoop—has been designed to overcome classical constraints of distributed systems (i.e. poor scale out, low elasticity, and static behaviour) and to provide high scalability when dealing with large amounts of data. This paper proposes the usage of Hadoop functionalities to efficiently (1) process financial data and (2) detect and correct errors from data repositories; in particular, the work is focused on the database SABI. There is a set of operations that performed with the distributed computation paradigm may increase the calculation performance.


  1. Albino, P. M. B. (2008). Eficiencia y productividad de las cooperativas de crédito espan˜olas frente al desafío de la desintermediaci ón financiera. In INTERNATIONAL, C. E. A. C. (Ed.) innovation and Management: Answers to the great challenges of public, social economy and cooperative enterprises.
  2. Brantner, M., Florescu, D., Graf, D., Kossmann, D., and Kraska, T. (2008). Building a database on s3. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD 7808, pages 251-264, New York, NY, USA. ACM.
  3. Brewer, E. A. (2000). Towards robust distributed systems (abstract). In PODC Conf., page 7, New York, NY, USA. ACM.
  4. Bureau van Dijk (2010). Sabi.
  5. Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., Chandra, T., Fikes, A., and Gruber, R. E. (2006). Bigtable: a distributed storage system for structured data. In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7, pages 15-15, Berkeley, CA, USA. USENIX Association.
  6. Cooper, B. F., Baldeschwieler, E., Fonseca, R., Kistler, J. J., Narayan, P. P. S., Neerdaels, C., Negrin, T., Ramakrishnan, R., Silberstein, A., Srivastava, U., and Stata, R. (2009). Building a cloud for yahoo! IEEE Data Eng. Bull., 32(1):36-43.
  7. Dean, J. and Ghemawat, S. (2010). Mapreduce: a flexible data processing tool. Commun. ACM, 53(1):72-77.
  8. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., and Vogels, W. (2007). Dynamo: Amazon's highly available key-value store. In SOSP Conf., pages 205-220. ACM.
  9. DeCandia, Giuseppe, Hastorun, Deniz, Jampani, Madan, Kakulapati, Gunavardhan, Lakshman, Avinash, Pilchin, Alex, Sivasubramanian, Swaminathan, Vosshall, Peter, and Vogels, Werner (2007). Dynamo: amazon's highly available key-value store. SIGOPS Oper. Syst. Rev., 41(6):205-220.
  10. Ghemawat, S., Gobioff, H., and Leung, S.-T. (2003). The google file system. In Scott, M. L. and Peterson, L. L., editors, SOSP, pages 29-43. ACM.
  11. Guzmán, I., Arcas, N., Ghelfi, R., and Rivaroli, S. (2009). Technical efficiency in the fresh fruit and vegetable sector: a comparison study of italian and spanish firms. Fruits, 64(4):243-252.
  12. Hernández-Cánovas, G. and Martínez-Solano, P. (2010). Relationship lending and sme financing in the continental european bank-based system. Small Business Economics, 34(4):465-482.
  13. Informa (2010). Informa D&B.
  14. Kapelko, M. and Rialp-Criado, J. (2009). Efficiency of the textile and clothing industry in poland and spain. Fibres & Textiles in Eastern Europe, 17(3):7-10.
  15. Kraska, T., Hentschel, M., Alonso, G., and Kossmann, D. (2009). Consistency rationing in the cloud: Pay only when it matters. PVLDB, 2(1):253-264.
  16. Lakshman, Avinash and Malik, Prashant (2010). Cassandra: a decentralized structured storage system. SIGOPS Operating Systems Review, 44(2).
  17. Martínez-Campillo, A. and Gago, R. F. (2009). What factors determine the decision to diversify? the case of spanish firms (1997-2001). Investigaciones Europeas de Direcci ón y Economía de la Empresa, 15(1):15-28.
  18. Palankar, M. R., Iamnitchi, A., Ripeanu, M., and Garfinkel, S. (2008). Amazon s3 for science grids: a viable solution? In DADC 7808: Proceedings of the 2008 international workshop on Data-aware distributed computing, pages 55-64, New York, NY, USA. ACM.
  19. Paz, A., Pérez-Sorrosal, F., Pati n˜o-Martínez, M., and Jiménez-Peris, R. (2010). Scalability evaluation of the replication support of jonas, an industrial j2ee application server. In 2010 European Dependable Computing Conference, pages 55-60. IEEE-CS.
  20. Retolaza, J. L. and San-Jose, L. (2008). Efficiency in work insertion social enterprises: a dea analysis. In Universidad, Sociedad y Mercados Globales, pages 55-64.
  21. Shafer, J., Rixner, S., and Cox, A. L. (2010). The hadoop distributed filesystem: Balancing portability and performance. In ISPASS, pages 122-133. IEEE Computer Society.
  22. White, Tom (2009). Hadoop: The Definitive Guide. O'Reilly Media, 1 edition.

Paper Citation

in Harvard Style

Navarro J., Azqueta-Alzuaz A., Murta Baião Albino P. and Enrique Armendáriz-Iñigo J. (2011). CLOUD COMPUTING KEEPS FINANCIAL METRIC COMPUTATION SIMPLE . In Proceedings of the 6th International Conference on Software and Database Technologies - Volume 1: ICSOFT, ISBN 978-989-8425-76-8, pages 143-148. DOI: 10.5220/0003506901430148

in Bibtex Style

author={Joan Navarro and Ainhoa Azqueta-Alzuaz and Pablo Murta Baião Albino and José Enrique Armendáriz-Iñigo},
booktitle={Proceedings of the 6th International Conference on Software and Database Technologies - Volume 1: ICSOFT,},

in EndNote Style

JO - Proceedings of the 6th International Conference on Software and Database Technologies - Volume 1: ICSOFT,
SN - 978-989-8425-76-8
AU - Navarro J.
AU - Azqueta-Alzuaz A.
AU - Murta Baião Albino P.
AU - Enrique Armendáriz-Iñigo J.
PY - 2011
SP - 143
EP - 148
DO - 10.5220/0003506901430148