Dreamer-pixels lab: notebook execution + 3 asset figures by ChatGPU · Pull Request #20 · ChatGPU/Autonomous-Driving-Learning-Atlas

ChatGPU · 2026-05-27T17:57:30Z

Dreamer-pixels reproduction lab final run.

End-to-end notebook ~5 min 10 s on CPU. World-model pretraining ~70 s; full 8-cycle Dreamer loop ~232 s. Three assets generated:

reconstruction_grid.png — WM recovers cart/pole visual structure from cycle 1 (slight ghosting on frame 0 where h_0 = 0).
latent_vs_real_rollout.png — imagination tracks the real env for ~5 steps then drifts; the pixel-MSE subplot exhibits the ~2× jump that bounds the trustworthy imagination horizon.
return_vs_steps.png — return hovers near random baseline (~25 vs ~20). README documents this honestly: at the chosen CPU budget the imagination horizon is too short to credit-assign a balance policy, but the architecture is faithful to DreamerV1 (encoder + RSSM with deterministic h + stochastic Gaussian z + decoder + reward + continue heads, KL with balancing α = 0.8, λ-returns in latent imagination).

https://claude.ai/code/session_017Ez7KNKDCGRRLjEnJi9TW7

Generated by Claude Code

Trimming the world model's training context so the lab finishes on CPU well under the 8-minute ceiling without losing the latent imagination story. https://claude.ai/code/session_017Ez7KNKDCGRRLjEnJi9TW7

The Dreamer-pixels reproduction lab finished its full pipeline: - 12-cell notebook runs ~5 min 10 s on CPU. - World-model pretraining ~70 s; full Dreamer 8-cycle loop ~232 s. - assets/reconstruction_grid.png shows the WM recovers cart/pole visual structure from the first cycle onward (slight ghosting only on frame 0 where h_0 = 0). - assets/latent_vs_real_rollout.png shows the imagination tracks the real env for ~5 steps then drifts; the pixel-MSE subplot exhibits the ~2x jump that puts a hard cap on imagination depth. - assets/return_vs_steps.png hovers near the random baseline (~25 vs ~20). README documents this honestly: at the chosen CPU budget the imagination horizon is too short to credit-assign a balance policy, but the architecture is faithful to DreamerV1 (encoder + RSSM with det h + stochastic Gaussian z + decoder + reward + continue heads, KL with balancing alpha=0.8, lambda-returns in latent imagination). https://claude.ai/code/session_017Ez7KNKDCGRRLjEnJi9TW7

claude added 2 commits May 27, 2026 17:35

Dreamer: shrink batch_size 16->8 and seq_len 32->20

6eef119

Trimming the world model's training context so the lab finishes on CPU well under the 8-minute ceiling without losing the latent imagination story. https://claude.ai/code/session_017Ez7KNKDCGRRLjEnJi9TW7

ChatGPU merged commit a45e397 into main May 27, 2026

ChatGPU mentioned this pull request May 28, 2026

Revert "Merge pull request #21" — roll main back to the previous merge (a45e397) #22

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dreamer-pixels lab: notebook execution + 3 asset figures#20

Dreamer-pixels lab: notebook execution + 3 asset figures#20
ChatGPU merged 2 commits into
mainfrom
claude/epic-ritchie-A7YtN

ChatGPU commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ChatGPU commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants