Authors:
Marco António de Sousa Reis
and
Aletéia Patrícia Favacho de Araújo
Affiliation:
Department of Computer Science, University of Brasília, UnB, Brasília and Brazil
Keyword(s):
Big Data, Cloud Computing, NoSQL, Hadoop, Data Engineering.
Related
Ontology
Subjects/Areas/Topics:
Cloud Computing
;
Cloud Computing Enabling Technology
;
Xaas
Abstract:
There are multiple definitions and technologies making the path to a big data solution a challenging task. The use of cloud computing together with a proven big data software architecture helps reducing project costs, development time and abstracts the complexity of the underlying implementation technologies. The combination of cloud computing and big data platforms results in a new service model, called Big Data as a Service (BDaaS), that automates the process of provisioning the infrastructure. This paper presents an architecture for big data systems in private clouds, using a real system to evaluate the functionalities. The architecture supports batch/real-time processing, messaging systems and data services based on web APIs. The architectural description defines the technology roadmap, composed exclusively of big data tools. The results showed that the proposed architecture supports the facilities of cloud computing and performs well in the analysis of large datasets.