Note: This post is part of the CCA Administrator Exam (CCA131) objectives series HDFS High Availability Overview A single NameNode is a single point of failure in a Hadoop cluster. You can experience HDFS downtime from an unexpected NameNode Crash or planned maintenance of NameNode. Having a NameNode high availability setup avoids these single points […]
CCA 131
CCA131 – Configuring HDFS snapshot Policy
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series What is HDFS Snapshot Policy You can create Snapshot Policies using Cloudera Manager for taking an automated snapshot of snapshottable paths on HDFS. The snapshot policies run at the time specified (hourly, daily, weekly etc) by the user. Before we can create […]
CCA 131 – Create/restore a snapshot of an HDFS directory (Using Cloudera Manager)
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series HDFS Snapshot Directories in HDFS can be snapshotted, which means creating one or more point-in-time images, or snapshots, of the directory. Snapshots include subdirectories, and can even include the entire filesystem (be careful with this for obvious reasons). Snapshots can be used […]
CCA131 – Create an HDFS user’s home directory
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series In the exam, you may be asked to create a home directory for an existing local user onto HDFS. You may further be asked to set a specific ownership or permission to the home directory. The process basically involves: Create a local […]
CCA 131 – Configure HDFS ACLs
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series The basis for Hadoop Access Control Lists is POSIX ACLs, available on the Linux filesystem. These ACLs allow you to link a set of permissions to a file or directory that is not limited to just one user and a group who […]
CCA 131 – Rebalance the cluster
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series In a long-running cluster, there might be an unequal distribution of data across Datanodes. This could be due to failures of nodes or the addition of nodes to the cluster. To make sure that the data is equally distributed across Datanodes, it […]
CCA 131 – Commission/decommission a node
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series Cloudera Manager makes it very simple to add and remove hosts in a cluster. All host management operations in Cloudera Manager are done from the Hosts screen. In this post, we will go through the steps of Commissioning and decommissioning a host […]
CCA 131 – Add a service using Cloudera Manager
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series To start with we have just installed few services like HDFS, YARN, and zookeeper in our CDH setup. The Cloudera Manager enables you to add any other service later on. As a part of the exam, you may be asked to add […]
CCA 131 – Add a new node to an existing cluster
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series Cloudera Manager makes it very simple to add and remove hosts in a cluster. All host management operations in Cloudera Manager are done from the Hosts screen. In this section, we will go through the steps of adding a new host to […]
CCA 131 – Install CDH using Cloudera Manager
Note: This post is part of the CCA Administrator Exam (CCA131) objectives series In the last post, we have seen how to install the Cloudera server manager and Cloudera Manager agents. In this post, we will begin installing the CDH cluster using the Cloudera Manager. You can point your browser to http://[cloudera manager host]:7180 and […]