sailbench

Sailbench is an end-to-end sailing physics simulator made by Cornell Autonomous Sailboat Team. It is to be used for testing and RL model training for autonomous sailboats.

First-Time Setup

sailbench uses the uv project manager for dependency and environment management. Follow the steps below to set up a local development environment.

Prerequisites

Python 3.12+
UV project manager

Setup Instructions

Install Python
If you don’t already have Python installed, download it from https://www.python.org/downloads/.
Install uv
Follow the installation instructions here: https://docs.astral.sh/uv/getting-started/installation/.
Install project dependencies

In the terminal, navigate to sailbench/

For users who just want to run the simulation, run:
```
uv sync
```
For developers (linters/type-checking), install the dev dependency group:
```
uv sync --group dev
```
If you’re working on RL training/evaluation, also install the rl group:
```
uv sync --group dev --group rl
```
[Optional but recommended] Setup VSCode Extensions
I would recommend utilizing VSCode for developing in this project. The two extensions to install are Ruff (Python formatter and linter) as well as MyPy (Python type checker). This will help keep consistent code quality and style across sailbench.

Running the Web Simulation

Once you've installed the packages, you can play sailbench with manual control with the following instructions.

Start the sim backend (from sailbench/):

uv run python -m sailbench.sim.web_runner --config basic_sailbot.yaml --fps 60

Start the frontend (in a separate terminal):
```
cd web
python -m http.server 8000
```
Open http://localhost:8000 in your browser.

Watch a trained RL policy in the web simulation

To run a trained RL model in sailbench, perform the following.

Start the backend with an RL checkpoint:

uv run python -m sailbench.sim.web_runner \
  --config basic_sailbot.yaml \
  --policy-model runs/<run_name>/best_model/best_model.zip \
  --policy-config configs/rl_waypoint_sb3.yaml

In a second terminal:
```
cd web
python -m http.server 8000
```
Open http://localhost:8000. The red/yellow marker shows the current waypoint.

RL Training (Gymnasium + SB3)

SailBench includes a Gymnasium continuous-control task (sailbench.rl.envs.WaypointEnv) and Stable-Baselines3 scripts for training/evaluating PPO on waypoint navigation.

Install the optional RL dependencies:

uv sync --group rl

Train (PPO)

Use the default waypoint RL config (configs/rl_waypoint_sb3.yaml):

uv run python scripts/train_waypoint_sb3.py --config configs/rl_waypoint_sb3.yaml

To watch training live in the web visualizer, enable the training websocket stream:

uv run python scripts/train_waypoint_sb3.py \
  --config configs/rl_waypoint_sb3.yaml \
  --watch-web

Then run the frontend in another terminal and open http://localhost:8000:

cd web
python -m http.server 8000

Notes:

The live stream is served on ws://127.0.0.1:8765/sim by default (same frontend URL as the normal backend).
Training visualization publishes from env-0 only, so it works with vectorized training (train.n_envs > 1).
You can reduce browser update load with --watch-stride <N> (or train.watch_stride in YAML).

To continue a stopped run from a checkpoint:

uv run python scripts/train_waypoint_sb3.py \
  --config configs/rl_waypoint_sb3.yaml \
  --resume-from runs/<run_name>/checkpoints/ppo_waypoint_<steps>_steps.zip

When --resume-from is provided, training continues from that model state and keeps timestep counting continuous.

Training outputs are written under runs/ by default:

runs/waypoint_ppo_<timestamp>/best_model/best_model.zip: best checkpoint per eval callback
runs/waypoint_ppo_<timestamp>/checkpoints/: periodic checkpoints
runs/waypoint_ppo_<timestamp>/final_model.zip: final model after training
runs/waypoint_ppo_<timestamp>/vecnormalize.pkl: VecNormalize stats (only if enabled via config)
runs/waypoint_ppo_<timestamp>/config_used.yaml: the exact config used for the run

Notes for resume:

Keep --config consistent with the original training setup (especially env settings and train.n_envs).
If normalization is enabled, the trainer automatically attempts to load checkpoint stats from the matching file .../<checkpoint_stem>_vecnormalize.pkl.

TensorBoard

If you keep train.tensorboard_log: runs/tensorboard (the default), you can launch TensorBoard with:

uv run tensorboard --logdir runs/tensorboard

When training, logs will be written into subdirectories under runs/tensorboard/, one per run. You can view your training progress, hyperparameters, and evaluation metrics in TensorBoard at http://localhost:6006 after launching the command above.

Evaluate a trained checkpoint

uv run python scripts/eval_waypoint_sb3.py \
  --config configs/rl_waypoint_sb3.yaml \
  --model runs/<run_name>/best_model/best_model.zip \
  --vecnormalize runs/<run_name>/vecnormalize.pkl

Notes:

Pass --vecnormalize only if your training run produced vecnormalize.pkl (i.e., you enabled observation/reward normalization in the training config).
eval_waypoint_sb3.py prints a JSON blob of summary metrics to stdout; use --output-json <path> to save them.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
configs		configs
runs		runs
sailbench		sailbench
scripts		scripts
test		test
web		web
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sailbench

First-Time Setup

Prerequisites

Setup Instructions

Running the Web Simulation

Watch a trained RL policy in the web simulation

RL Training (Gymnasium + SB3)

Train (PPO)

TensorBoard

Evaluate a trained checkpoint

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sailbench

First-Time Setup

Prerequisites

Setup Instructions

Running the Web Simulation

Watch a trained RL policy in the web simulation

RL Training (Gymnasium + SB3)

Train (PPO)

TensorBoard

Evaluate a trained checkpoint

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages