connect4-AlphaZero

Connect 4 trained under a Reinforcement Learning paradigm, using MCTS + Deep Learning (AlphaGo Zero style), learning via pure self-play without feature engineering or expert input

Based on David Foster's code and model (https://github.com/AppliedDataSciencePartners/DeepReinforcementLearning)

Trained and tested under Windows 7 and Python 3.6

Feel free to clone and play a few games against the trained models (in the folder run0001, trained over 4 days using 1 Nvidia 1050 GPU) using the run.ipynb!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

connect4-AlphaZero

Files

README.md

Latest commit

History

README.md

File metadata and controls

connect4-AlphaZero