Hadoop Administration Training

BY
Mindmajix Technologies

Mode

Online

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study, Virtual Classroom
Mode of Delivery Video and Text Based
Frequency of Classes Weekdays, Weekends

Course and certificate fees

certificate availability

Yes

certificate providing authority

Mindmajix Technologies

The syllabus

Chapter 1: Getting Started with Apache Hadoop

  • History of Apache Hadoop and its trends
  • Components of Apache Hadoop
  • Understanding the Apache Hadoop daemons
  • Namenode
  • Secondary namenode
  • Jobtracker
  • Tasktracker
  • ResourceManager
  • NodeManager
  • Job submission in YARN
  • Introducing Cloudera
  • Introducing CDH
  • Responsibilities of a Hadoop administrator
  • Summary

Chapter 2: HDFS and MapReduce

  • Essentials of HDFS
  • Configuring HDFS
  • The read/write operational flow in HDFS
  • Writing files in HDFS
  • Reading files in HDFS
  • Understanding the namenode UI
  • Understanding the secondary namenode UI
  • Exploring HDFS commands
  • Commonly used HDFS commands
  • Commands to administer HDFS 
  • Getting acquainted with MapReduce
  • Understanding the map phase
  • Understanding the reduced phase
  • Learning all about the MapReduce job flow
  •  Configuring MapReduce
  • Understanding the job tracker UI
  • Getting MapReduce job information
  • Summary

Chapter 3: Cloudera's Distribution Including Apache Hadoop

  • Getting started with CDH
  • Understanding the CDH components
  • Apache Hadoop
  • Apache Flume NG
  • Apache Sqoop
  • Apache Pig
  • Apache Hive
  • Apache ZooKeeper
  • Apache HBase
  • Apache Mahout
  • Apache Avro
  • Apache Oozie
  • Cloudera Impala
  • Cloudera Hue
  •  Pig UI
  •  File Browser
  • Sqoop Jobs
  •  Job Browser
  • Collection Manager
  • Hue Shell
  • HBase Browser
  • Installing CDH
  • Stopping Hadoop services
  • Understanding a YARN cluster
  • Installing the CDH components
  • Installing Apache Flume
  • Installing Apache Sqoop
  • Installing Apache Sqoop
  • Installing Apache Pig
  • Installing Apache Hive
  • Installing Apache Oozie

Chapter 4: Exploring HDFS Federation and Its High Availability

  • Implementing HDFS Federation
  • Configuring HDFS Federation
  • Configuring ViewFS for a federated HDFS
  • Implementing HDFS High Availability
  • The Quorum-based storage
  • Configuring HDFS high availability by the Quorum-based storage
  • Shared storage using NFS
  • Configuring HDFS high availability by shared storage using NFS
  • Configuring automatic failover for HDFS high availability
  • Jobtracker high availability
  • Configuring job tracker high availability
  • Configuring automatic failover for job tracker high availability

Chapter 5: Using Cloudera Manager

  • Introducing Cloudera Manager
  • Understanding the Cloudera Manager architecture
  • Installing Cloudera Manager
  • Navigating the Cloudera Manager Web console
  • Navigating the Home screen
  • Navigating the Clusters menu
  • Exploring the Hosts menu
  • Understanding the Diagnostics menu
  • Understanding the Audits screen
  • Understanding the Charts menu
  • Understanding the Backup menu
  • Understanding the Administration menu
  • Configuring High Availability using Cloudera Manager Summary

Chapter 6: Implementing Security Using Kerberos

  • Understanding authentication and authorization
  • Introducing Kerberos
  • Understanding the Kerberos Architecture
  • Accessing a secure file server
  • Understanding important Kerberos terms
  • Installing Kerberos
  • Configuring the KDC Server
  • Testing the KDC installation
  • Configuring the Kerberos clients
  • Configuring Kerberos for Apache Hadoop
  • Configuring Kerberos principal for Cloudera Manager Server
  • Configuring the Cloudera Manager Server for Kerberos
  • Authorization in Apache Hadoop
  • Configuring access control lists in Hadoop
  • SummaryAuthenticating a user

Chapter 7: Managing an Apache Hadoop Cluster

  • Configuring Hadoop services using Cloudera Manager
  • Adding a service to the cluster
  • Removing a service from the cluster
  • Role management in Cloudera Manager
  • Adding a role instance to a host
  • Adding a DataNode role to a host
  • Adding a TaskTracker role to a host
  • Managing hosts using Cloudera Manager
  • Adding a new host
  • Removing an existing host
  • Managing multiple clusters with Cloudera Manager
  • Rebalancing a Hadoop cluster from Cloudera Manager
  • Adding the Balancer service to the cluster
  •  Rebalancing the cluster
  • Summary

Chapter 8: Cluster Monitoring Using Events and Alerts

  • Monitoring Hadoop services from Cloudera Manager
  • Understanding events and alerts
  • Configuring events and alerts
  • Configuring the alert delivery by an e-mail

Practice Test & Interview Questions

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books