Reinforcement Learning beginner to master AI in Python

BY
Udemy

Lavel

Beginner

Mode

Online

Fees

₹ 3099

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 3,099
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Welcome module

  • [IMPORTANT] English captions available for sections 1-4
  • Welcome
  • Course Structure
  • Environment setup [Important]
  • Setup - Mac

The Markov decision process (MDP)

  • The Markov decision process (MDP)
  • Types of Markov decision process
  • Trajectory vs episode
  • Reward vs Return
  • Discount factor
  • Policy
  • State values v(s) and action values q(s,a)
  • Bellman equations
  • Solving a Markov decision process
  • Setup - MDP in code
  • MDP in code - Part 1
  • MDP in code - Part 2

Dynamic Programming

  • Introduction to Dynamic Programming
  • Value iteration
  • Setup - Value iteration
  • Coding - Value iteration 1
  • Coding - Value iteration 2
  • Coding - Value iteration 3
  • Coding - Value iteration 4
  • Coding - Value iteration 5
  • Policy iteration
  • Setup - Policy iteration
  • Coding - Policy iteration 1
  • Policy evaluation
  • Coding - Policy iteration 2
  • Policy Improvement
  • Coding - Policy iteration 3
  • Coding - Policy iteration 4
  • Policy iteration in practice
  • Generalized Policy Iteration (GPI)

Monte Carlo methods

  • Monte Carlo methods
  • Solving control tasks with Monte Carlo methods
  • On-policy Monte Carlo control
  • Setup - On-policy Monte Carlo control
  • Coding - On-policy Monte Carlo control 1
  • Coding - On-policy Monte Carlo control 2
  • Coding - On-policy Monte Carlo control 3
  • Setup - Constant alpha Monte Carlo
  • Coding - Constant alpha Monte Carlo
  • Off-policy Monte Carlo control
  • Setup - Off-policy Monte Carlo control
  • Coding - Off-policy Monte Carlo 1
  • Coding - Off-policy Monte Carlo 2
  • Coding - Off-policy Monte Carlo 3

Temporal difference methods

  • Temporal difference methods
  • Solving control tasks with temporal difference methods
  • Monte Carlo vs temporal difference methods
  • SARSA
  • Setup - SARSA
  • Coding - SARSA 1
  • Coding - SARSA 2
  • Q-Learning
  • Setup - Q-Learning
  • Coding - Q-Learning 1
  • Coding - Q-Learning 2
  • Advantages of temporal difference methods

N-step bootstrapping

  • N-step temporal difference methods
  • Where do n-step methods fit?
  • Effect of changing n
  • N-step SARSA
  • N-step SARSA in action
  • Setup - n-step SARSA
  • Coding - n-step SARSA

Continuous state spaces

  • Setup - Classic control tasks
  • Coding - Classic control tasks
  • Working with continuous state spaces
  • State aggregation
  • Setup - Continuous state spaces
  • Coding - State aggregation 1
  • Coding - State aggregation 2
  • Coding - State aggregation 3
  • Tile coding
  • Coding - Tile coding 1
  • Coding - Tile coding 2
  • Coding - Tile coding 3

Brief introduction to neural networks

  • Function approximators
  • Artificial Neural Networks
  • Artificial Neurons
  • How to represent a Neural Network
  • Stochastic Gradient Descent
  • Neural Network optimization

Deep SARSA

  • Deep SARSA
  • Neural Network optimization (Deep Q-Network)
  • Experience Replay
  • Target Network
  • Coding - Deep SARSA 1
  • Coding - Deep SARSA 2
  • Coding - Deep SARSA 3
  • Coding - Deep SARSA 4
  • Coding - Deep SARSA 5
  • Coding - Deep SARSA 6
  • Coding - Deep SARSA 7
  • Coding - Deep SARSA 8
  • Coding - Deep SARSA 9
  • Coding -Deep SARSA 10

Deep Q-Learning

  • Deep Q-Learning
  • Setup - Deep Q-Learning
  • Coding - Deep Q-Learning 1
  • Coding - Deep Q-Learning 2
  • Coding - Deep Q-Learning 3

REINFORCE

  • Policy gradient methods
  • Representing policies using neural networks
  • Policy performance
  • The policy gradient theorem
  • REINFORCE
  • Parallel learning
  • Entropy regularization
  • REINFORCE 2
  • Coding - REINFORCE 1
  • Coding - REINFORCE 2
  • Coding - REINFORCE 3
  • Coding - REINFORCE 4
  • Coding - REINFORCE 5

Advantage Actor - Critic (A2C)

  • A2C
  • Setup - A2C
  • Coding - A2C 1
  • Coding - A2C 2
  • Coding - A2C 3
  • Coding - A2C 4

Outro

  • Looking back
  • Next steps

Articles

Popular Articles

Latest Articles

Similar Courses

Getting Started with Generative AI APIs

Codio via Coursera

3 Weeks Online
Beginner

Artificial Intelligence Projects

Great Learning

Online
Beginner
Free

Artificial Intelligence Chatbots Without Programmi...

IBM via Edx

2 Weeks Online
Beginner
Free

Google Artificial Intelligence for JavaScript Deve...

Google via Edx

7 Weeks Online
Beginner
Free

Contact Center Artificial Intelligence Conversatio...

Google via Coursera

2 Weeks Online
Beginner

Introduction to Intel Distribution of OpenVino Too...

Intel via Coursera

1 Week Online
Beginner
Free

Basic Certificate Course in Artificial Intelligenc...

CDAC Noida via FutureSkills

120 Hours Online
Beginner
₹ 3,390

Intelligence Tools for the Digital Age

IE Business School, Madrid via Coursera

3 Weeks Online
Beginner
Free

AI and the Illusion of Intelligence

Copenhagen Business School, Frederiksberg via Coursera

3 Weeks Online
Beginner
Free

Artificial Intelligence Empathy and Ethics

UC Santa Cruz via Coursera

3 Weeks Online
Beginner

Courses of your Interest

An Introduction To Coding Theory

An Introduction To Coding Theory

IIT Kanpur via Swayam

8 Weeks Online
Beginner
Free

C++ Foundation

PW Skills

5 Months Online
Beginner
Free

Advanced CFD Meshing using ANSA

Skill Lync

4 Weeks Online
Beginner
₹ 40,000

Salesforce Platform App Builder Certification Trai...

Simplilearn

12 Hours Online
Beginner

Data Science Foundations to Core Bootcamp

Springboard

7 Months Online
Beginner
$9,900 $13,900
Full Stack Developer Course With Placement

Full Stack Developer Course With Placement

AttainU

7 Months Online
Beginner
₹ 68,000
User Experience Design And Research

User Experience Design And Research

UM–Ann Arbor via Futurelearn

35 Weeks Online
Beginner
Fundamentals of Agile Project Management

Fundamentals of Agile Project Management

UCI Irvine via Futurelearn

21 Weeks Online
Beginner
Artificial intelligence Design and Engineering wit...

Artificial intelligence Design and Engineering wit...

CloudSwyft Global Systems, Inc via Futurelearn

17 Weeks Online
Beginner

More Courses by Udemy

Microsoft Excel 2013 Course Beginners Intermediate...

Udemy

Online
Beginner
₹399 ₹2,699

Python for Beginners to Advance

Udemy

Online
Beginner
₹ 2,499

Learn Python Turtle Using Block Coding

Udemy

Online
Beginner
₹399 ₹799

Master Python Basics For Developer

Udemy

Online
Beginner
₹475 ₹3,499

Programming in Python for Beginners

Udemy

Online
Beginner
₹ 799

Learn Python 3 Programming from Scratch

Udemy

Online
Beginner
₹475 ₹1,299

Automate Your Life With Python

Udemy

Online
Beginner
₹ 2,899

Learn Python Python for Beginners

Udemy

Online
Beginner
₹ 1,799

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books