Skip to content

Latest commit

 

History

History
24 lines (16 loc) · 1.29 KB

README.md

File metadata and controls

24 lines (16 loc) · 1.29 KB

Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python

This repository contains my code and notes from the Udemy course, "Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python".

Course Overview

This course provides an introduction to the field of reinforcement learning with a focus on multi-armed bandit problems. The course covers the practical implementation of various algorithmic strategies for balancing between exploration and exploitation. By the end of the course, I was equipped with the knowledge and skills to build and deploy AI agents that can handle critical business operations under uncertainties.

Course Topics

The following topics were covered in this course:

  • Introduction to reinforcement learning
  • Multi-armed bandit problems
  • Epsilon Greedy algorithm
  • Softmax Exploration algorithm
  • Optimistic Initialization algorithm
  • Upper Confidence Bounds (UCB) algorithm
  • Thompson Sampling algorithm
  • Application of MAB algorithms in robotics using EV3 Mindstorm

Acknowledgements

I would like to thank the instructor of this course for providing clear and concise explanations of the concepts and algorithms covered. The course materials and code examples were instrumental in my learning of reinforcement learning and multi-armed bandit problems.