Training a Pong AI with PyTorch in Google Colab

Overview

This notebook trains a Deep Q-Network (DQN) model using PyTorch to play the game Pong. The training process is designed to run in Google Colab and requires mounting Google Drive to save checkpoints and the final trained model.

Demo

Here is a demo of the trained pong model (PongNoFrameskip-v4.pth) (green) playing against an opponent AI provided by the Gym library (orange), in which the pong model wins 21 - 15 against.

Setup Instructions

Run in Google Colab
- Upload the notebook to your Google Drive.
- Open the notebook in Google Colab.
- Ensure your runtime is set to GPU for faster training.
Mount Google Drive
- The notebook saves and loads models from Google Drive. Run the following cell in the notebook to mount your drive:
```
from google.colab import drive
drive.mount('/content/drive')
```
- Modify the pthname variable to point to the correct path in your Google Drive where you want to load the initial incomplete, pretrained model.
- Modify the saved_path variable to point to the correct path in your Google Drive where you want to store the completely trained model.
- Modify the path in the line: torch.save(model.state_dict(), f"/content/drive/My Drive/Colab_Notebooks/Assignments/Deep_Q_Network/saves_2/{frame_idx}_model.pth") to point to the correct path in your Google Drive where you want to store the model checkpoints.

Training Process

The training process involves playing 1 million frames of Pong.
A partially trained model is saved every 20,000 frames.
If you want to continue training from a previous checkpoint, update the pthname variable to use the last saved model instead of model_pretrained.pth. Additionally, update the for loop in the training section to start from the correct frame count. For example, if resuming from 200,000 frames, modify:
```
for frame_idx in range(0, num_frames + 1):
```
to:
```
for frame_idx in range(200000, num_frames + 1):
```

Example:

pthname = '/content/drive/MyDrive/pong_model_checkpoint_200000.pth'  # Load partially trained model and resume training from 200K frames

Testing Your Model

The "Test Your Model" section generates an MP4 video showing the trained model playing Pong.
Ensure that the correct model file is loaded before running this section.

Model Files

Partially trained models: Saved every 20K frames (e.g., pong_model_checkpoint_200000.pth).
Final trained model: The completely trained model is saved as PongNoFrameskip-v4-model.pth.

Notes

Modify path variables (pthname, saved_name), as well as the line torch.save(model.state_dict(), f"/content/drive/My Drive/Colab_Notebooks/Assignments/Deep_Q_Network/saves_2/{frame_idx}_model.pth") as needed based on where the notebook and model files are stored in Google Drive.
Training may take several hours depending on available GPU resources in Colab.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
PongNoFrameskip-v4-model.pth		PongNoFrameskip-v4-model.pth
README.md		README.md
Sub_DQN_pong.ipynb		Sub_DQN_pong.ipynb
model_pretrained.pth		model_pretrained.pth
pong_thumbnail.jpg		pong_thumbnail.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training a Pong AI with PyTorch in Google Colab

Overview

Demo

Setup Instructions

Training Process

Testing Your Model

Model Files

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Training a Pong AI with PyTorch in Google Colab

Overview

Demo

Setup Instructions

Training Process

Testing Your Model

Model Files

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages