Authors:
Luis Bautista
and
Alain April
Affiliation:
ETS and University of Quebec, Canada
Keyword(s):
Cloud Computing, High Availability Cluster, Hadoop Distributed File System, ZooKeeper.
Related
Ontology
Subjects/Areas/Topics:
Cloud Computing
;
Cloud Computing Enabling Technology
;
Cloud Ilities (Scalability, Availability, Reliability)
;
Disaster Recovery
Abstract:
Hadoop is a set of utilities and frameworks for the development and storage of distributed applications in cloud computing, the core component of which is the Hadoop Distributed File System (HDFS). NameNode is a key element of its architecture, and also its “single point of failure”. To address this issue, we propose a replication mechanism that will protect the NameNode data in case of failure. The proposed solution involves two distinct components: the creation of a BackupNode cluster that will use a leader election function to replace the NameNode, and a mechanism to replicate and synchronize the file system namespace that is used as a recovery point.