Releases: Axect/pytorch_template
v0.3.0 — Dual Logging, TUI Monitor & Migration Skill
New Features
- Dual logging:
logging: wandb(default) orlogging: tuifor agent-friendly terminal output - CSVLoggingCallback (always active): writes
metrics.csvevery epoch with dynamic column expansion - TUILoggingCallback: structured per-epoch terminal output replacing W&B
- LatestModelCallback (always active): saves
latest_model.ptevery epoch - Rust TUI monitor (
tools/monitor/): real-time loss curve visualization frommetrics.csv - Provenance tracking:
env_snapshot.yaml+run_metadata.yamlper run doctorCLI command: system environment health checkmonitorCLI command: launch TUI monitor with auto-detection
Improvements
- Loss prediction: shifted exponential decay
L(t) = a·exp(-b·t) + c - Complete migration skill documentation (M1–M6) with template-clone approach
- Pre-push hook enforcing migration doc updates on source changes
Migration
Existing users can update their projects:
# Install the migration skill globally (once)
mkdir -p ~/.claude/skills
cp -r .claude/skills/pytorch-migrate ~/.claude/skills/
# In any pytorch_template-based project:
/pytorch-migrateFull changelog: v0.2.0...v0.3.0
v0.2.0 — Full Research Pipeline with AI Agent Skills
What's New
Pre-flight Check
Run 1 batch forward+backward before training to catch config errors in seconds:
python -m cli preflight configs/run_template.yaml --device cuda:0Detects shape mismatches, NaN/Inf gradients, scheduler param issues, and estimates GPU memory.
HPO Analysis
After HPO, understand what Optuna found:
python -m cli hpo-report --opt-config configs/my_opt.yamlShows parameter importance (fANOVA), boundary warnings, and top-K trial comparison.
Training Diagnostics
Two always-on callbacks logged to W&B:
- GradientMonitorCallback — exploding gradient detection
- OverfitDetectionCallback — train/val loss divergence warning
Pluggable Data Loading
New data field in config YAML points to any load_data() function via importlib:
data: recipes.regression.data.load_data3-Tier Config Validation
Structural → Runtime → Semantic (catches upper_bound < total_steps, negative lr, duplicate seeds, etc.)
AI Agent Skills
- pytorch-train — Guides the full experiment lifecycle
- pytorch-migrate — Detects project version and applies migrations
Documentation
Rewritten as "Human Skill Guide" — 5 chapters. The human counterpart to the AI agent's SKILL.md.
Migration
Existing users: add data: util.load_data to YAML configs, or run /pytorch-migrate with Claude Code.
No breaking changes — the data field defaults to util.load_data.