Apache Spark and Scala

BY
Udemy

Mode

Online

Fees

₹ 499 1499

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 499  ₹1,499
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Module 1 - Introduction to Big Data, Spark and Hadoop

  • Overview of Big Data
  • Introduction to Apache Hadoop
  • Hadoop Distributed File System
  • Hadoop Map Reduce
  • Introduction to Apache Spark
  • Characteristics of Apache Spark
  • Users and Use Cases of Apache Spark
  • Job Execution Flow and Spark Execution
  • Spark Unified Stack
  • Complete Picture of Apache Spark
  • Why Spark with Scala
  • Apache spark Architecture

Introduction to Scala Programming Language

  • Introduction to Scala
  • Scala Basic Syntax
  • Scala Class and Objects
  • If else Statements in Scala
  • Loops in Scala

Module 3 - Advanced Scala Programming

  • Functions and Procedures in Scala
  • 3.2 Access Modifiers
  • 3.3 Strings and Arrays
  • 3.4 Scala Collections
  • 3.5 Scala Traits
  • 3.6 Pattern Matching
  • 3.7 Scala Extractors
  • 3.8 Scala Exception Handling
  • 3.9 Scala Files IO

Module 4 - Apache Spark RDDs

  • Programming with RDDs
  • Starting with Spark
  • Creating RDDs
  • RDD Operations
  • Lifecycle of Spark

Module 5 - Apache Spark RDDs-II

  • Spark Caching
  • Common Transformations and Actions
  • Spark Functions
  • Some more Spark functions

Module 6 - Working with Key Value Pairs

  • Key Value Pairs
  • Aggregate Functions
  • Working with Aggregate Functions
  • Joins in Spark
  • Practical on Word count example

Module 7 - Advanced Spark Programming

  • Spark Shared Variables
  • Spark and Fault Tolerance
  • Broadcast variables
  • Numeric RDD Operations
  • Per-Partition Operations

Module 8 - Running Spark Jobs on Cluster

  • Spark Runtime Architecture
  • Spark Driver
  • Executors
  • Cluster Managers
  • Cluster Managers II

Module 9 - Spark SQL

  • Introduction to Spark SQL
  • Starting Point-SQL Context
  • Hive with Spark SQL
  • Spark SQL Caching

Module 10 - Spark Streaming

  • People.json, Employee.json

Module 11 - Machine Learning in Spark

  • Machine learning with mllib
  • MLib Data Types
  • labeled point data types
  • Local Matrices in mllib
  • MLib Algorithms
  • Classification and Regression
  • Clustering

Module 12 - GraphX in Spark

  • GraphX Introduction
  • Creating Graphs
  • Graph Operators
  • Subgraph Transformation
  • Computation with map reduce triplets

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books