Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 874 Bytes

File metadata and controls

15 lines (11 loc) · 874 Bytes

SimpleAlphaZero

A simplified implementation of DeepMind's AlphaZero (with Monte Carlo Tree Search, etc), which can be used to train agents to play simple board games, such as Connect4, Tic-Tac-Toe, and Five in a Row (Gomoku), etc.

Implementation

Connect 4 Board and Rules

Create a board

Self Play

Use Monte Carlo Tree Search with self play

References: