Introduction to Big Data with Spark and Hadoop

BY
IBM via Coursera

Get familiar with Big with Spark, Big Data with Hadoop and the like through the Coursera-offered online course.

Lavel

Intermediate

Mode

Online

Duration

7 Weeks

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course overview

Introduction to Big Data with Spark and Hadoop is a beginner-level online course offered by Coursera in collaboration with IBM. The course, which can be covered in about 19 hours, will guide and introduce the learners to Big data with Spark and Hadoop. Introduction to Big Data with Spark and Hadoop Certification Course is part of two different programmes or specializations offered by IBM, namely, IBM Data Engineering Professional Certificate and NoSQL, Big Data, and Spark Foundations Specialization. 

Introduction to Big Data with Spark and Hadoop Training, provided by Coursera, will help the learners to learn Big Data with Hadoop, the implication of Big Data in Big Data Analytics, Resilient Distributed Datasets (RDDs), and many more. Introduction to Big Data with Spark and Hadoop Certification by Coursera demands from the learners 7 weeks of study. The course will provide the candidates with an insight into the  Big Data processing tools and their features, limitations, benefits, and applications. 

The highlights

  • Provided by Coursera
  • Offered by IBM
  • Beginner Level Course
  • Self-Paced Learning Option
  • 100% Online Course
  • Around 19 Hours to Complete 
  • Flexible Deadlines
  • Shareable Certificate
  • Financial Aid Available

Program offerings

  • English videos with multiple subtitles
  • Shareable certificate
  • Financial aid available
  • Shareable certificates
  • Self-paced learning option
  • Course videos & readings
  • Practice quizzes
  • Graded assignments with peer feedback
  • Graded quizzes with feedback.

Course and certificate fees

Coursera specifies the Introduction to Big Data with Spark and Hadoop Certification Fee depending on the number of months the students want to cover the course successfully. The learners can make the payment of the month-wise course fee in EMI payment mode. Coursera also renders the students 14 days of refund duration. 

Introduction to Big Data with Spark and Hadoop Fee Structure

Duration

Amount in INR

1 Month 

INR 4,117

3 Months 

INR 8,234 (INR 2,745/month)

6 Months 

INR 12,352 ( INR 2,059/month)

certificate availability

Yes

certificate providing authority

Coursera

Who it is for

Introduction to Big Data with Spark and Hadoop Classes are highly recommended for professionals who work in Big Data-related jobs such as Big Data Developer, Big Data Engineer, Big Data Analytics Engineer, and the like. 

Eligibility criteria

Certification Qualifying Details

To get hold of the certification after the completion of the Introduction to Big Data with Spark and Hadoop online course, the candidates will need to duly finish all the gamut of proceedings of the programmes. Plus, the learners will have to pay the course fee prescribed by Coursera. 

What you will learn

Knowledge of big data Knowledge of apache spark

Introduction to Big Data with Spark and Hadoop Certification Syllabus will help the learners to have a thorough knowledge of the following: 

  • Big Data with Hadoop
  • SparkML
  • Big Data
  • HDFS, HBase, Spark, and MapReduce.
  • Apache Hadoop architecture,
  • Apache Spark
  • Impact of Big Data, processing methods, use cases, and tools.

The syllabus

Module 1: What is Big Data?

Videos
  • What is Big Data?
  • Impact of Big Data
  • Parallel Processing, Scaling, and Data Parallelism
  • Big Data Tools and Ecosystem
  • Open Source and Big Data
  • Beyond the Hype
  • Big Data Use Cases
Readings
  • Course Introduction
  • Summary & Highlights
Practice Exercises
  • Practice Quiz: Introduction to Big Data
  • Graded Quiz: Introduction to Big Data

Module 2: Introduction to the Hadoop Ecosystem

Videos
  • Introduction to Hadoop
  • Intro to MapReduce
  • Hadoop Ecosystem 
  • HDFS 
  • HIVE 
  • HBASE
Reading
  • Summary & Highlights
Practice Exercises
  • Practice Quiz: Introduction to Hadoop
  • Graded Quiz: Introduction to Hadoop

Module 3: Apache Spark

Videos
  • Why use Apache Spark?
  • Functional Programming Basics
  • Parallel Programming using Resilient Distributed Datasets 
  • Scale out / Data Parallelism in Apache Spark
  • Dataframes and SparkSQL
Reading
  • Summary & Highlights
Practice Exercises
  • Practice Quiz: Introduction to Apache Spark
  • Graded Quiz: Introduction to Apache Spark

Module 4: DataFrames and SparkSQL

Videos
  • RDDs in Parallel Programming and Spark
  • Data-frames and Datasets
  • Catalyst and Tungsten
  • ETL with DataFrames
  • Real-world usage of SparkSQL
Reading
  • Summary & Highlights
Practice Exercises
  • Practice Quiz: Introduction to Data-Frames & SparkSQL
  • Graded Quiz: Introduction to Data-Frames & SparkSQL

Module 5: Development and Runtime Environment Options

Videos
  • Apache Spark Architecture
  • Overview of Apache Spark Cluster Modes
  • How to Run an Apache Spark Application
  • Using Apache Spark on IBM Cloud
  • Setting Apache Spark Configuration
  • Running Spark on Kubernetes
Readings
  • Summary & Highlights
  • Summary & Highlights
Practice Exercises
  • Practice Quiz: Spark Architecture
  • Graded Quiz: Spark Architecture
  • Practice Quiz: Spark Runtime Environments
  • Graded Quiz: Spark Runtime Environments

Module 6: Monitoring & Tuning

Videos
  • The Apache Spark User Interface
  • Monitoring Application Progress
  • Debugging Apache Spark Application Issues
  • Understanding Memory Resources
  • Understanding Processor Resources
Readings
  • Summary & Highlights
  • Instructions for the Final Exam
  • Congrats & Next Steps
  • Team & Acknowledgements
Practice Exercises
  • Practice Quiz: Introduction to Monitoring & Tuning
  • Graded Quiz: Introduction to Monitoring & Tuning
  • Final Exam

Admission details

Step 1 -Browse the official URL 

https://www.coursera.org/learn/introduction-to-big-data-with-spark-hadoop

Step 2- Kickstart the online certification programme by clicking ‘Enroll Now’.

Scholarship Details

Coursera provides financial aid for the students of the Introduction to Big Data with Spark and Hadoop Certification who cannot afford the course fee. They can apply for it by providing the needed information on the link provided on the course page. The scholarship will be distributed based on the financial background of the students. 

How it helps

The Introduction to Big Data with Spark and Hadoop Certification benefits for the learners include that they can understand the big data with Spark and Hadoop and related concepts in greater depth. Plus, the candidates will be conferred a certificate of completion at the end of the programme. 

Instructors

Ms Aije Egwaikhide

Ms Aije Egwaikhide
Senior Data Scientist
IBM

Other Bachelors, Other Masters

Mr Romeo Kienzler
Data Scientist
IBM

Other Masters

Mr Rav Ahuja

Mr Rav Ahuja
Global Program Director
IBM

B.E /B.Tech, MBA

FAQs

Who are the instructors who developed the Introduction to Big Data with Spark and Hadoop online course?

The certification programme on Big Data is prepared and supervised by Karthik Muthuraman, a software engineer, and Aije Egwaikhide, the senior data scientist at IBM. 

Which level of audience and learners can join the Introduction to Big Data with Spark and Hadoop online certification?

The online certificate programme is recommended for intermediate-level students who want to explore Big Data Analytics. 

The online course is part of two different programmes offered by IBM; what are they?

The online programme is part of the IBM Data Engineering Professional Certificate and NoSQL, Big Data, and Spark Foundations Specialization. 

What are the enrollment options for the students to pursue the course?

There are two options for students to enroll in the programme, namely, the audit mode which requires no fee, and the enrolment mode by paying the fee. 

Similar Courses

Perform data science with Azure Databricks

Perform data science with Azure Databricks

Microsoft Corporation via Coursera

3 Weeks Online
Intermediate
Big Data Analysis with Scala and Spark

Big Data Analysis with Scala and Spark

Swiss Federal Institute of Technology Lausanne via Coursera

4 Weeks Online
Intermediate
Apache Spark TM SQL for Data Analysts

Apache Spark TM SQL for Data Analysts

Databricks via Coursera

13 Hours Online
Intermediate

Scalable Machine Learning on Big Data using Apache...

IBM via Coursera

Online
Intermediate
Free

Courses of your Interest

Salesforce Administrator and App Builder

Salesforce Administrator and App Builder

SkillUp Online via Simplilearn

16 Hours Online
Intermediate
Free
Introduction to Medical Software

Introduction to Medical Software

Yale University, New Haven via Coursera

3 Weeks Online
Intermediate
Free

Google Cloud Architect Program

Google Cloud via SkillUp Online

11 Weeks Online
Intermediate
₹ 54,999

Google Cloud Architect Program

Google via SkillUp Online

11 Weeks Online
Intermediate
₹ 54,999
Information Security Design and Development

Information Security Design and Development

Coventry University, Coventry via Futurelearn

10 Weeks Online
Intermediate
Ethics Laws and Implementing an AI Solution on Mic...

Ethics Laws and Implementing an AI Solution on Mic...

CloudSwyft Global Systems, Inc via Futurelearn

14 Weeks Online
Intermediate
Network Security and Defence

Network Security and Defence

Coventry University, Coventry via Futurelearn

10 Weeks Online
Intermediate

Cyber Security Foundations Start Building Your Car...

EC-Council via Futurelearn

15 Weeks Online
Intermediate
Applied Data Analysis

Applied Data Analysis

CloudSwyft Global Systems, Inc via Futurelearn

14 Weeks Online
Intermediate
₹ 900

More Courses by IBM

AI Applications With Watson

IBM via Edx

3 Weeks Online
Intermediate
Free

Site Reliability Engineers Infrastructure Resilien...

IBM via Edx

6 Weeks Online
Intermediate
Free

Python for Data Science Project

IBM via Edx

1 Week Online
Intermediate
Free

Site Reliability Engineering Fundamentals and Secu...

IBM via Edx

5 Weeks Online
Intermediate
Free

Site Reliability Engineering Capstone

IBM via Edx

4 Weeks Online
Intermediate
Free

Blockchain Framework and Platforms

IBM via Edx

2 Weeks Online
Intermediate
Free

Introduction to System Programming on IBM Z

IBM via Edx

3 Weeks Online
Intermediate
Free

Smarter Chatbots with Node RED and Watson AI

IBM via Edx

3 Weeks Online
Intermediate
Free

Relational Database Administration

IBM via Coursera

5 Weeks Online
Intermediate

Application Development using Microservices and Se...

IBM via Coursera

6 Weeks Online
Intermediate

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books