Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python

This repository contains my code and notes from the Udemy course, "Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python".

Course Overview

This course provides an introduction to the field of reinforcement learning with a focus on multi-armed bandit problems. The course covers the practical implementation of various algorithmic strategies for balancing between exploration and exploitation. By the end of the course, I was equipped with the knowledge and skills to build and deploy AI agents that can handle critical business operations under uncertainties.

Course Topics

The following topics were covered in this course:

Introduction to reinforcement learning
Multi-armed bandit problems
Epsilon Greedy algorithm
Softmax Exploration algorithm
Optimistic Initialization algorithm
Upper Confidence Bounds (UCB) algorithm
Thompson Sampling algorithm
Application of MAB algorithms in robotics using EV3 Mindstorm

Acknowledgements

I would like to thank the instructor of this course for providing clear and concise explanations of the concepts and algorithms covered. The course materials and code examples were instrumental in my learning of reinforcement learning and multi-armed bandit problems.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python

Course Overview

Course Topics

Acknowledgements

Files

README.md

Latest commit

History

README.md

File metadata and controls

Reinforcement Learning: Practical Multi-Armed Bandit Algorithms in Python

Course Overview

Course Topics

Acknowledgements