- History of Apache Hadoop and its trends
- Components of Apache Hadoop
- Understanding the Apache Hadoop daemons
- Namenode
- Secondary namenode
- Jobtracker
- Tasktracker
- ResourceManager
- NodeManager
- Job submission in YARN
- Introducing Cloudera
- Introducing CDH
- Responsibilities of a Hadoop administrator
- Summary
Quick Facts
particular | details | ||||
---|---|---|---|---|---|
Medium of instructions
English
|
Mode of learning
Self study, Virtual Classroom
|
Mode of Delivery
Video and Text Based
|
Frequency of Classes
Weekdays, Weekends
|
Course and certificate fees
certificate availability
Yes
certificate providing authority
Mindmajix Technologies
The syllabus
Chapter 1: Getting Started with Apache Hadoop
Chapter 2: HDFS and MapReduce
- Essentials of HDFS
- Configuring HDFS
- The read/write operational flow in HDFS
- Writing files in HDFS
- Reading files in HDFS
- Understanding the namenode UI
- Understanding the secondary namenode UI
- Exploring HDFS commands
- Commonly used HDFS commands
- Commands to administer HDFS
- Getting acquainted with MapReduce
- Understanding the map phase
- Understanding the reduced phase
- Learning all about the MapReduce job flow
- Configuring MapReduce
- Understanding the job tracker UI
- Getting MapReduce job information
- Summary
Chapter 3: Cloudera's Distribution Including Apache Hadoop
- Getting started with CDH
- Understanding the CDH components
- Apache Hadoop
- Apache Flume NG
- Apache Sqoop
- Apache Pig
- Apache Hive
- Apache ZooKeeper
- Apache HBase
- Apache Mahout
- Apache Avro
- Apache Oozie
- Cloudera Impala
- Cloudera Hue
- Pig UI
- File Browser
- Sqoop Jobs
- Job Browser
- Collection Manager
- Hue Shell
- HBase Browser
- Installing CDH
- Stopping Hadoop services
- Understanding a YARN cluster
- Installing the CDH components
- Installing Apache Flume
- Installing Apache Sqoop
- Installing Apache Sqoop
- Installing Apache Pig
- Installing Apache Hive
- Installing Apache Oozie
Chapter 4: Exploring HDFS Federation and Its High Availability
- Implementing HDFS Federation
- Configuring HDFS Federation
- Configuring ViewFS for a federated HDFS
- Implementing HDFS High Availability
- The Quorum-based storage
- Configuring HDFS high availability by the Quorum-based storage
- Shared storage using NFS
- Configuring HDFS high availability by shared storage using NFS
- Configuring automatic failover for HDFS high availability
- Jobtracker high availability
- Configuring job tracker high availability
- Configuring automatic failover for job tracker high availability
Chapter 5: Using Cloudera Manager
- Introducing Cloudera Manager
- Understanding the Cloudera Manager architecture
- Installing Cloudera Manager
- Navigating the Cloudera Manager Web console
- Navigating the Home screen
- Navigating the Clusters menu
- Exploring the Hosts menu
- Understanding the Diagnostics menu
- Understanding the Audits screen
- Understanding the Charts menu
- Understanding the Backup menu
- Understanding the Administration menu
- Configuring High Availability using Cloudera Manager Summary
Chapter 6: Implementing Security Using Kerberos
- Understanding authentication and authorization
- Introducing Kerberos
- Understanding the Kerberos Architecture
- Accessing a secure file server
- Understanding important Kerberos terms
- Installing Kerberos
- Configuring the KDC Server
- Testing the KDC installation
- Configuring the Kerberos clients
- Configuring Kerberos for Apache Hadoop
- Configuring Kerberos principal for Cloudera Manager Server
- Configuring the Cloudera Manager Server for Kerberos
- Authorization in Apache Hadoop
- Configuring access control lists in Hadoop
- SummaryAuthenticating a user
Chapter 7: Managing an Apache Hadoop Cluster
- Configuring Hadoop services using Cloudera Manager
- Adding a service to the cluster
- Removing a service from the cluster
- Role management in Cloudera Manager
- Adding a role instance to a host
- Adding a DataNode role to a host
- Adding a TaskTracker role to a host
- Managing hosts using Cloudera Manager
- Adding a new host
- Removing an existing host
- Managing multiple clusters with Cloudera Manager
- Rebalancing a Hadoop cluster from Cloudera Manager
- Adding the Balancer service to the cluster
- Rebalancing the cluster
- Summary
Chapter 8: Cluster Monitoring Using Events and Alerts
- Monitoring Hadoop services from Cloudera Manager
- Understanding events and alerts
- Configuring events and alerts
- Configuring the alert delivery by an e-mail
Practice Test & Interview Questions
Articles
Popular Articles
prev
next
Latest Articles
Top 50 Hadoop Interview Questions for Freshers and Experienced Professionals
Updated On 17 Apr, 2024
prev
next