Pentaho for ETL & Data Integration Masterclass - PDI 9

BY
Udemy

Mode

Online

Fees

₹ 3499

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 3,499
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Introduction

  • Welcome to the course
  • Course Resources

Pentaho Data Integration (PDI) Installation and Setup

  • Setting up environment and installing PDI
  • This is a milestone!
  • Opening Spoon - The Graphical UI

A Simple ETL Demonstration

  • The example problem statement
  • Demonstration of a PDI transformation
  • Demonstration of a PDI Job

Basic Concepts - Theory for foundational understanding

  • What is ETL?
  • Data Warehouse, Ops Database and Data mart
  • Inmon vs Kimball Architecture
  • ETL vs ELT

The ETL Process: The Practical Part Begins Here

  • Data and the ETL process

DATA EXTRACTION: Extracting Tabular Data

  • Manually entering data into PDI
  • Inputting Data from a TXT (text) file
  • Input from multiple CSV files at the same time
  • Inputting Data from an Excel file
  • Extracting Data from Zipped files

DATA EXTRACTION: Extracting Non-Tabular Data

  • Extracting from XML
  • Extracting from JSON

Extracting from an SQL Table

  • Plan for importing sales Data
  • Installing and setting up PostgreSQL
  • Creating Sales Table in SQL
  • Extracting from an SQL table

Storing and Retrieving Data from Cloud Storage

  • Storing Data on AWS S3
  • Reading data from AWS S3

Merging Data Streams

  • Concepts: Merging Data Streams
  • Sorted Merge Step - Merging customer data
  • Merging product data
  • Time to check your understanding
  • Append data stream - merging sales data
  • Time to check your understanding

Data Cleansing

  • Introduction to Data Cleansing
  • Value Mapper Step
  • Replace in String Step
  • Time to check your understanding
  • Fuzzy Match concepts
  • Fuzzy Match Step in PDI
  • Fuzzy Match Algorithms
  • Time to check your understanding
  • Formula Step and changing data format
  • Common Data Cleaning Steps
  • Quiz

Data Validation

  • Introduction to Data validation
  • Data_validation 1 - String-to-Int and integer range validations
  • Data validation 2 - Checking Reference Values using stream look-up
  • Data validation 3 - Order date < shipping date using calculator step
  • Common Data Validation steps

Error Handling

  • Correcting the errors and merging with main stream
  • Time to check your understanding
  • Writing the errors to the log
  • Time to check your understanding
  • Writing the errors to a separate file
  • Time to check your understanding

Transformation and Analytics Steps

  • Concatenating Address Fields
  • Data Aggregation using Group-by
  • Normalization and Denormalization
  • Number Range Step

PDI - SQL connection

  • Introduction to PDI - SQL connection
  • Reading and filtering data from DB into PDI
  • Updating and Inserting data into DB from PDI
  • Deleting data from SQL DB using PDI

Conceptual Understanding for Loading Data

  • Facts and Dimensions tables
  • Time to check your understanding
  • Surrogate Keys in Dimension tables
  • Type 1 & 2 Slowly Changing Dimensions
  • Time to check your understanding
  • Schemas

Loading Data into Data Mart

  • Creating tables in DB
  • Loading Customer Data using combination lookup/ update step
  • Loading product data using dimension lookup step
  • Loading sales data after database lookup steps

Running Java and Java Scripts

  • Scripting Steps

PDI Jobs

  • PDI Jobs vs Transformation
  • Controlling the flow of execution
  • Setting variables using set variables step
  • File and Folder Management
  • Sending Email Step
  • Abort Job Step
  • Time to check your understanding

Scheduling a job for production environment

  • Running using command prompt and scheduling

Metadata Injection

Regex Notation

  • Regular Expressions for advanced String Matching

Bonus Section

  • Alternative to Pentaho
  • The final milestone!
  • Bonus Lecture

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books