Tip
Any agent can follow a plan. The hard part is writing steps that don't assume the agent read the ones before. Blueprint generates plans from a one-line objective — each step carries enough context to be executed cold, by a different agent in a different session.
Built for developers working on projects too large for a single agent session — multi-step refactors, plugin extractions, architecture migrations, phased feature rollouts. The generated plans are plain Markdown files that any agent can read and execute.
A Markdown plan file designed so that a fresh agent in a new session can pick up any step and execute it without reading the rest of the plan:
- Steps — each one-PR sized, with task list, rollback strategy, verification commands, and exit criteria
- Dependency graph — which steps can run in parallel, which must be serial
- Design decisions — rationale for key choices, locked against re-litigation
- Invariants — properties verified after every step (build passes, no regressions)
- Progress log — single source of truth for execution state across sessions
- Review log — findings from an independent review that stress-tests the plan for gaps and flaws
Example: one step from a 9-step plugin extraction plan
### Step 04: Redirect core scheduler calls through plugin registry [x]
**Branch**: `taskflow-step-04-redirect-calls`
**Size**: M
**Isolation**: worktree
**Agent**: strongest
**Context** (cold-start brief):
The core currently calls the scheduler directly via
`import { Scheduler } from './scheduler/index.js'` in 3 files.
The plugin registry now has a `scheduler` slot with `getScheduler()`.
This step replaces all direct imports with registry calls — the critical
bridge that decouples core from the scheduler implementation.
**Tasks**:
1. [guided] Find all files importing from `./scheduler/`
2. [guided] Replace direct imports with `getScheduler()` + descriptive error
3. [guided] Update bootstrap to warn if no scheduler is registered
4. [guided] Update tests to register a mock scheduler in beforeEach
5. [exact] Run: `pnpm -r build && pnpm -r test && pnpm -r typecheck`
**Rollback**: Discard worktree. Main tree is unaffected.
**Verification**:
- [ ] `pnpm -r build && pnpm -r test` — all tests pass
- [ ] `grep -r "from.*scheduler" packages/core/src/ | wc -l` — returns 0
**Exit criteria**: No file in core directly imports from `./scheduler/`.
All tests pass with mock scheduler registration.The Context field is the key — it tells a cold-start agent exactly what state the codebase is in and what this step does, without reading prior steps. Task markers: [exact] = run this command verbatim; [guided] = agent decides the approach; [open] = goal only, full autonomy.
Example: dependency graph with parallel groups
Step 01 (interface)
├──→ Step 02 (registry slot)
│ └──→ Step 04 (redirect calls) ──→ Step 05a (move files)
│ └──→ Step 05b (tests)
└──→ Step 03 (package scaffold) ──────────────→ Step 05a
Parallelizable:
Group A (after Step 01): Step 02 and Step 03 — no shared files
Full examples: small-plan.md (3 steps, direct workflow) · large-plan.md (9 steps, branch workflow with step split)
mkdir -p ~/.claude/skills
git clone https://github.com/antbotlab/blueprint.git ~/.claude/skills/blueprintClaude Code discovers the skill automatically — /blueprint is ready to use.
The generated plans are standard Markdown files. Any agent that can read files can execute a Blueprint plan. Point the agent to SKILL.md as a system prompt or instruction file, and it will know how to generate and execute plans.
Claude Code for the /blueprint slash command. Git and GitHub CLI unlock the full branch/PR/CI workflow, but Blueprint works without them — it detects their absence and automatically switches to direct mode.
/blueprint myapp "migrate database to PostgreSQL"
This scans your codebase, designs a step-by-step plan, reviews it with a strongest-model sub-agent (falls back to default model if unavailable), and saves it to plans/myapp-migrate-database-to-postgresql.md.
Cold-start execution — Every step includes a self-contained context brief. A fresh agent in a new session can execute any step without reading prior steps or conversation history.
Cross-session resumption — Plans track execution state in a Progress Log. A new session reads the log, checks branch/file state, and picks up exactly where the last session stopped.
Living documents — Plans mutate during execution. Steps can be split, inserted, skipped, reordered, or abandoned — all with formal protocols and an audit trail.
Adversarial review — Every plan is reviewed by a strongest-model sub-agent (e.g., Opus) against a checklist covering completeness, dependency correctness, cold-start viability, anti-pattern detection, and more. Falls back to the default model if the strongest model is unavailable.
Parallel step detection — The dependency graph identifies steps with no shared files or output dependencies, so they can run concurrently.
Graceful degradation — Strongest model unavailable? Falls back to default. No GitHub CLI? Switches to direct mode. No project memory? Relies on codebase scanning. Blueprint adapts instead of failing.
Claude Code's built-in /plan works well for straightforward tasks. Persistent-planning tools (like planning-with-files) solve context loss across sessions by writing state to disk. Blueprint is designed for a different problem: multi-session, multi-agent projects where each step must be independently executable by a fresh agent that has never seen the conversation history.
/plan |
Persistent planners | /blueprint |
|
|---|---|---|---|
| Zero setup | Yes — built in | Install required | Install required |
| Persisted plan file | Yes — ~/.claude/plans/ |
Yes — markdown files | Yes — project plans/ directory |
| Cold-start steps | No — shared context | Partial — session-catchup scripts | Yes — each step has a self-contained context brief |
| Adversarial review | No | No | Yes — strongest-model review gate |
| Branch/PR/CI workflow | No | No | Yes — built into every step |
| Dependency graph | No | No | Yes — parallel groups and serial constraints |
| Plan mutation protocol | No | No | Yes — split, insert, skip, reorder, abandon |
| Model tier assignment | No | No | Yes — strongest vs default per step |
| Anti-pattern detection | No | No | Yes — catalog scanned before finalization |
| Autonomy boundaries | No | No | Yes — exact, guided, and open task markers |
| Zero runtime risk | Yes | Hooks execute on every tool call | Yes — pure markdown, no hooks, no scripts |
/plan— single-session tasks, quick prototyping, exploratory work.- Persistent planners — repeated context loss is the bottleneck and you want automatic session recovery.
/blueprint— the project is too large for one session, involves multiple agents or team members, and each step must be executable without reading the rest of the plan.
Blueprint generates standard Agent Skills format (SKILL.md). The generated plans are plain Markdown — any agent that can read files can execute them:
- Claude Code — native
/blueprintslash command - Cursor, GitHub Copilot, VS Code — point to
SKILL.mdas instruction file - OpenAI Codex CLI — load as system prompt
- Gemini CLI — include in context
- OpenClaw — add to workspace skills directory
- Any SKILL.md-compatible agent — 30+ tools support the format
Blueprint is a pure-instruction skill — it contains no executable code, no hooks, no shell scripts, and no runtime dependencies. The entire skill is markdown files that tell the agent what to do.
This matters because 13.4% of public agent skills contain serious vulnerabilities (Snyk, February 2026). Blueprint avoids this attack surface entirely:
- No
PreToolUse/PostToolUsehooks that execute on every tool call - No shell scripts that could be modified or injected
- No network calls, no file system writes outside the plan file
- No dependencies to supply-chain-attack
What you install is what you read — markdown instructions, reference templates, and examples.
- Research — Scans the codebase, reads project memory if available, runs pre-flight checks (git repo, gh auth, default branch).
- Design — Breaks the objective into one-PR-sized steps, identifies parallelism, assigns model tiers (strongest for architecture, default for implementation), assigns isolation strategy (main-tree preferred; worktree only for destructive or exploratory steps).
- Draft — Generates the plan from a structured template. Includes branch workflow rules, CI policy, invariants, and rollback strategies — all inline, fully self-contained.
- Review — Delegates adversarial review to a strongest-model sub-agent. Issues are fixed and re-reviewed until the plan passes.
- Register — Saves the plan and updates project memory.
blueprint/
├── SKILL.md Main skill definition
├── references/
│ ├── plan-template.md Full plan template (branch workflow)
│ ├── plan-template-light.md Lightweight template (direct workflow)
│ ├── review-checklist.md Phase 4 review checklist
│ └── anti-patterns.md Planning anti-patterns & rebuttals
├── workflows/
│ ├── plan-mutation.md Protocol for modifying plans mid-execution
│ └── resumption.md How to resume a partially-executed plan
└── examples/
├── small-plan.md 3-step direct-workflow example
└── large-plan.md 9-step branch-workflow example
Bug reports and feature requests: GitHub Issues.
Pull requests welcome — please open an issue first to discuss non-trivial changes.
MIT © 2026 antbotlab
