Data Engineering, Serverless ETL & BI on Amazon Cloud

BY
Udemy

Mode

Online

Fees

₹ 2999

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 2,999
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

About the Course & Introduction

  • Instructor Introduction
  • Course & Project Overview
  • AWS Billing Component and Precautions to be taken

Getting Started with Redshift and MySql RDS

  • Redshift Overview
  • Redshift vs BigQuery
  • Redshift - Data Consistency
  • Lab:Setup Mysql RDS Instance on AWS Cloud
  • Lab: Mysql RDS Database Import
  • Load Data into Mysql RDS using DBeaver
  • Lab : Redshift Cluster Setup
  • Lab : Sql Client for Redshift and RDS Mysql

ETL and Syncing Traditional Data with Redshift DWH

  • Introduction - Flow of Data
  • Understanding the different Components and their Roles
  • Designing your Data Warehouse - Basic Concepts
  • Lab - AWS DataPipeline - Getting Started with the first import Job
  • Lab : One-time Load Historical Data into Redshift Tables using Copy Command
  • AWS Glue - Overview and Walkthrough
  • Lab - AWS DataPipeline - Setup our first Hourly Jobs for Incremental Data Loads
  • Lab - AWS Glue - First Python Shell Job for incremental Data loads into Redshift
  • Lab - AWS Lambda Function to Trigger our Glue Job
  • Lab - AWS DataPipeline - Second import Job
  • Lab : One-time Load Historical Data into Redshift Tables using Copy Command
  • Lab - AWS Glue - Python Shell Job for incremental Data loads into Redshift
  • AWS Glue Python - Capacity
  • Important - Data Syncing Approach and the Bigger picture
  • Redshift - Cluster Snapshot and restoring
  • Sync the Other Mysql Tables

Data Lakes & Handling External Data Sources

  • Section Overview and Introduction
  • Lab - AWS Glue Crawler Setup
  • Lab - Athena - Data and Table Scan Explanation
  • Lab - Pyspark Development Local
  • Lab - Port Local Pyspark Script to AWS Glue
  • Lab - AWS Glue Pyspark - Parquet File Format & Snappy Compression
  • Lab - AWS Lambda to Trigger Glue Jobs
  • Lab - Glue Crawler Run - Populate Partitions in Data Catalog

Redshift Spectrum

  • Introduction to Redshift Spectrum
  • Lab - Redshift Spectrum | Create External Schema
  • Lab - Redshift Spectrum | Cross Database Joins

Quicksight - BI Reporting and Visualization

  • Quicksight - Introduction
  • Lab - Connecting with Redshift and Create Dashboards/Analyses
  • Lab - Run Custom Sql Queries for QuickSight Analyses and Dashboards

Redshift Optimization Techniques and Fine Tuning

  • Redshift - Sort Keys and Compound Sort Keys
  • Redshift - Interleaved Sort Keys
  • Redshift - Vacuum Operations
  • Redshift - Choosing Keys
  • Redshift - Distribution Keys
  • Lab - Parameter Group | Redshift Cluster Modification
  • Lab - Sort and Dist Keys & Vacuuming | Alter Table Commands

Bonus - Do more with AWS Glue

  • Lab - AWS Glue Pyspark - Insert External Data into Redshift
  • Lab - AWS Glue - Pyspark - Connect to RDS directly

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books