PySpark Training

BY
Mindmajix Technologies

Mode

Online

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study, Virtual Classroom
Mode of Delivery Video and Text Based
Frequency of Classes Weekdays, Weekends

Course and certificate fees

certificate availability

Yes

certificate providing authority

Mindmajix Technologies

The syllabus

Getting started with PySpark

  • Introduction to Spark
  • Using Spark in Python
  • Using DataFrames
  • How to create a SparkSession
  • Viewing tables
  • Add some Spark to your data

Data manipulation

  • Create columns
  • SQL overview
  • Filtering Data
  • Aggregating
  • Grouping and Aggregating 
  • Joining
  • Model tuning and selection

Getting to know machine learning pipelines

  • Machine Learning Pipelines introduction
  • Join the DataFrames
  • Data types
  • String to integer
  • Create a new column
  • Making a Boolean
  • Assemble a vector
  • Create the pipeline
  • Data transformation
  • Split the data

Model tuning and selection

  • What is logistic regression and how does it work?
  • Construct the modeler
  • Cross-validation
  • Make an evaluator.
  • Create a grid.
  • Create a validator.
  • Complement the model (s)
  • Using binary classifiers to evaluate
  • Examine the model.

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books