Apache Spark (TM) SQL for Data Analysts

BY
Databricks via Coursera

If you are a data analyst and want to learn Apache Spark and Delta Lake, join Coursera.

Lavel

Intermediate

Mode

Online

Duration

13 Hours

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course overview

Apache Spark (TM) SQL for Data Analysts is a 13 hours completion time online programme. The courses are developed for intermediate-level candidates who have familiarity with SQL.  Apache Spark (TM) SQL for Data Analysts Certification Course is curated by Kate Sullivan, the technical curriculum developer. The online programme is one of the courses of the Data Science with Databricks for Data Analysts Specialization that is offered by Databricks, the data and AI firm.

Apache Spark (TM) SQL for Data Analysts Training, administered by Coursera, will walk you through the data analysis, SQL and Apache Spark which is one of the most sought-after tools in big data analytics.  Apache Spark (TM) SQL for Data Analysts Certification by Coursera will also provide the learners with practical exposure to Apache Spark through various practical exercises. 

The highlights

  • Provided by Coursera
  • Offered by Databricks 
  • Intermediate Level Course
  • Self-Paced Learning Option
  • 100% Online Course
  • Around 13 Hours to Complete 
  • Flexible Deadlines
  • Shareable Certificate
  • Financial Aid Available

Program offerings

  • English videos with multiple subtitles
  • Shareable certificate
  • Financial aid available
  • Shareable certificates
  • Self-paced learning option
  • Course videos & readings
  • Practice quizzes
  • Graded assignments with peer feedback
  • Graded quizzes with feedback

Course and certificate fees

certificate availability

Yes

certificate providing authority

Coursera

Who it is for

Apache Spark (TM) SQL for Data Analysts Classes is an ideal certification programme for the professionals like  Big Data developers, Big Data engineers, and Big Data Analytics engineers.

Eligibility criteria

Certification Qualifying Details

After the completion of the  Apache Spark (TM) SQL for Data Analysts online course, Coursera will award certificates only to the learners who have covered the course completely and paid the fee specified by Coursera.  

What you will learn

Knowledge of apache spark

At the end of the course, the students will have the capacity to make use of Spark SQL and Delta Lake to ingest, query, and transform data for valuable insights extraction. Some other learnings the learners can earn from the Apache Spark (TM) SQL for Data Analysts Certification Syllabus:

  • Data Analysis
  • Spark SQL
  • SQL
  • Delta Lake

The syllabus

Week 1: Welcome to Apache Spark SQL for Data Analysts

Video
  • Course goals
Reading
  • Before you begin
Practice Exercise
  • End of module knowledge check

Week 2: Spark makes big data easy

Videos
  • Introduction to module 2
  • What is big data?
  • Common struggles with big data
  • Big Data Needs
  • Apache Spark Intro
  • Spark SQL
Practice Exercise
  • Module 2 Concept Review

Week 3: Using Spark SQL on Databricks

Videos
  • Introduction to Module 3
  • Signing up for Databricks Community Edition
  • Preparing your workspace
  • Working with notebooks
  • Using course materials
  • Basic queries with Spark SQL reading introduction
  • Data Visualization on Databricks reading introduction
  • Data visualization tools
  • Exploratory Data Analysis lab introduction
Readings
  • Course Materials
  • Basic Queries reading activity
  • Data Visualization reading activity
  • Your turn! Exploratory Data Analysis lab
Practice Exercises
  • Module 3 Concept Review
  • 3.3 Exploratory Data Analysis Quiz

Week 4: Spark Under the Hood

Videos
  • Introduction to module 4
  • Understanding optimizations
  • The physical cluster
  • The SparkUI and SQL tab
  • Optimizing query logic
  • Impact of Caching
  • Optimizing with selective data loading
Practice Exercise
  • Module 4 Concept Review

Week 5: Complex Queries

Videos
  • Introduction to module 5
  • What is nested data? 
  • Introduction to managing nested data
  • Introduction to Manipulating Data 
  • Introduction to Data Munging
Readings
  • Managing Nested Data reading activity
  • Manipulating Data reading activity
  • 5.3 Data Munging Lab
Practice Exercises
  • Module 5 Concept Review
  • Lab 5.3 Quiz

Week 6: Applied Spark SQL

Videos
  • Introduction to module 6
  • Complex data - common strategies
  • About higher-order functions
  • Higher-order functions introduction
  • Introducing Aggregating and Summarizing Data
  • Partitioning Tables Introduction
  • Sharing Insights Lab Introduction
Readings
  • Higher Order Functions reading activity
  • Aggregating and Summarizing Data reading activity
  • Partitioning Tables
  • Sharing Insights
Practice Exercises
  • Module 6 concept review
  • Lab 6.4 Quiz

Week 7: Data Storage and Optimization

Videos
  • Introduction to module 7
  • A quick refresher
  • Introducing a new data management paradigm
  • Introduction to the lesson
  • What is Delta Lake
Readings
  • Data Warehouses
  • Data Lakes
  • Data Lakes vs Data Warehouses
  • The Lakehouse

Week 8: Delta Lake with Spark SQL

Videos
  • Introduction to the module
  • Intro to Using Delta reading
  • Managing Records in a Delta table
  • Delta Engine Optimization Introduction
  • Delta Lake Lab Introduction
Readings
  • 8.1 Using Delta
  • 8.2 Managing records
  • 8.3 Optimizing Delta
  • Delta Lab
Practice Exercise
  • 8.4 Delta Lab

Week 9: SQL Coding Challenges

Reading
  • SQL coding challenges
Practice Exercise
  • Final Exam

Admission details

Step 1 -Browse the official URL : https://www.coursera.org/learn/apache-spark-sql-for-data-analysts

Step 2- Join the online course by choosing the option ‘Enroll Now’. 

Scholarship Details

The Apache Spark (TM) SQL for Data Analysts Certification learners who cannot afford the Coursera course fee can apply for financial aid. The scholarship will be rendered to the learners purely based on their financial background.

How it helps

The Apache Spark (TM) SQL for Data Analysts Certification benefits includes that the learner can have a thorough understanding of Apache Spark along with practical exposure. Plus, the learners will be provided with a shareable certificate after the completion of the programme. 

Instructors

Ms Kate Sullivan

Ms Kate Sullivan
Technical Curriculum Developer
Databricks

FAQs

Which AI company is working with Coursera to offer the Apache Spark (TM) SQL for Data Analysts online course?

The AI company that collaborated with Coursera to provide the online programme is Databricks. 

Who is the person who curated and tutors the Apache Spark (TM) SQL for Data Analysts online certification?

The online certificate programme is curated and tutored by Kate Sullivan who is a technical curriculum developer.

Who is the intended audience of the online course? Is there any pre-requirement to be able to join the programme?

The intended audience of the online certification course is intermediate-level students and Coursera recommends that learners have the familiarity with SQL to pursue the online course. 

What is the name of the Coursera-offered specialization that includes this online course?

The Coursera-offered specialization which includes this programme is Data Science with Databricks for Data Analysts Specialization. 

How many horses will be enough at minimum to cover the online course fully?

The online programmes will need approximately 13 hours to complete the course successfully. 

Similar Courses

Perform data science with Azure Databricks

Perform data science with Azure Databricks

Microsoft Corporation via Coursera

3 Weeks Online
Intermediate
Big Data Analysis with Scala and Spark

Big Data Analysis with Scala and Spark

Swiss Federal Institute of Technology Lausanne via Coursera

4 Weeks Online
Intermediate
Introduction to Big Data with Spark and Hadoop

Introduction to Big Data with Spark and Hadoop

IBM via Coursera

7 Weeks Online
Intermediate

Scalable Machine Learning on Big Data using Apache...

IBM via Coursera

Online
Intermediate
Free

Courses of your Interest

Salesforce Administrator and App Builder

Salesforce Administrator and App Builder

SkillUp Online via Simplilearn

16 Hours Online
Intermediate
Free
Introduction to Medical Software

Introduction to Medical Software

Yale University, New Haven via Coursera

3 Weeks Online
Intermediate
Free

Google Cloud Architect Program

Google Cloud via SkillUp Online

11 Weeks Online
Intermediate
₹ 54,999

Google Cloud Architect Program

Google via SkillUp Online

11 Weeks Online
Intermediate
₹ 54,999
Information Security Design and Development

Information Security Design and Development

Coventry University, Coventry via Futurelearn

10 Weeks Online
Intermediate
Ethics Laws and Implementing an AI Solution on Mic...

Ethics Laws and Implementing an AI Solution on Mic...

CloudSwyft Global Systems, Inc via Futurelearn

14 Weeks Online
Intermediate
Network Security and Defence

Network Security and Defence

Coventry University, Coventry via Futurelearn

10 Weeks Online
Intermediate

Cyber Security Foundations Start Building Your Car...

EC-Council via Futurelearn

15 Weeks Online
Intermediate
Applied Data Analysis

Applied Data Analysis

CloudSwyft Global Systems, Inc via Futurelearn

14 Weeks Online
Intermediate
₹ 900

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books