Hive to Advance Hive Real Time Usage Hadoop Querying Tool

BY
Udemy

Mode

Online

Fees

₹ 529 3199

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 529  ₹3,199
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Introduction (Theory)

  • Introduction to Hive
  • Announcement
  • Motivation of Hive
  • Sql vs Hive
  • Trailer - Workng of Hive
  • Architecture of Hive

Hive Basic Commands

  • HADOOP AND HIVE INSTALLATION
  • Create databases
  • Table creation and loading data into it |Part 1
  • Table creation and loading data into it |Part 2
  • Internal vs External table- Explained
  • Create Tables
  • Insert statement
  • Multi insert statement (Advance)
  • Alter Table Schema
  • Sorting -- sort by, order by, distribute by, cluster by

Functions in Hive

  • Date and Mathematical functions
  • String functions
  • Split(), Substr(), instr() functions
  • Conditional statements
  • Explode and Lateral view ( Advance functions)
  • Rlike function (Advance)
  • Rank(), Dense_rank(), Row_number() (Advance)
  • Practice Mathematical Functions

Partitioning in Hive

  • What is Partitioning?
  • Static partitioning
  • Dynamic partitioning
  • Alter Partitioned Table and MSCK Repair command (Advance)

Bucketing in Hive

  • What is Bucketing?
  • Create Bucketed Table
  • Tablesampling (Advance)
  • No_drop, Offline command (Advance)
  • Partitioning

Joins in Hive

  • Inner Joins on 2 Tables
  • Outer Joins on 2 Tables
  • Join 3 Tables in Hive
  • Memory Management & Optimization of Joins
  • Map Joins (Advanced)

Views in Hive

  • What are views?
  • Creating Views in different ways
  • Advantages of Views

Indexing (Advance)

  • Creation of Indexes (Compact and Bitmap)
  • Multiple Indexes on same table
  • When and When not to use Indexing

UDFs (User Defined Functions) Advance

  • What is UDF
  • UDF Implementation - Practical

Table Properties (Advance)

  • Skipping Header and Footer records while loading in table
  • Immutable Table property
  • Purge property + Difference between Drop and Truncate
  • Null Format Property
  • ACID/Transactional features of Hive (Advance)
  • ORC Table properties

Configurations & Settings in Hive (Advance)

  • Part 1
  • Part 2
  • Merge files in Hive
  • Parallelism Property

Variables in Hive (Advance)

  • Executing Hive queries from Bash shell
  • Run Unix & Hadoop commands from Hive Shell
  • Variables in Hive (hiveconf & hivevar)
  • Difference between hiveconf & hivevar
  • Using variables in bash shell
  • Substituting value of a Variable
  • QUIZ

Different Types of Files in Hadoop

  • Text, Sequence, Avro Files
  • RC, ORC, Parquet Files
  • Performance Test results of Various Files
  • Which File Format to choose

Custom Input Formatter in Hive (Advance)

  • Using Custom Input Format in Hive

Miscellaneous (Advance)

  • Modes of Hive
  • Compression techniques in Hive
  • Hiverc file (Creation and Uses)
  • Archiving files in Hive

Tez engine in Hive

  • What is Tez engine and its comparison with MR

Load XML data in Hive

  • How to Load XML data in Hive tables-- Part1
  • How to Load XML data in Hive tables-- Part2

Implementing SCD's in Hive (Advance)

  • What is Slowly changing Dimension (SCD)
  • Implement SCD 1 in Hive--part1
  • Implement SCD 1 in Hive--part2

Use Cases asked in Interviews

  • Word count in Hive
  • Multiple Tables on a single file

Instructors

J Garg
Data Engineering
Udemy

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books