In this project, you will implement value iteration and Q-learning. You will test your agents first on Gridworld (from class), then apply them to a simulated robot controller (Crawler) and Pacman.
atang020/reinforcement
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|