Authors:
Oumaima El Haddadi
1
;
Mahmoud El Hamlaoui
1
;
Dkaki Taoufiq
2
and
Mahmoud Nassar
1
Affiliations:
1
IMS Team, ADMIR Laboratory, Rabat IT Center, Mohammed V University in Rabat, Rabat, Morocco
;
2
IRIS Team, IRIT Laboratory, University Toulouse Jean - Jaurès, Toulouse, France
Keyword(s):
Data Lake, Big Data, Metadata, Data Management.
Abstract:
Due to the digital transformation and huge amount of publicly available data, decision support systems are becoming highly useful in helping to defining, managing and improving business strategies and objectives. Indeed, data is a key asset and a key competitive differentiator for all organizations. This newly available data has changed traditional data processing and created new challenges related to the velocity, volume and variety of data. To address these challenges related to the storage of heterogeneous data and to provide the ability of rapid data processing, we explore the data lake paradigm. In this paper, we present the state-of-the-art of Data Lake systems and highlight their major advantages and drawbacks. We also will propose a solution to improve Data Lake System.