Skip to content

antbotlab/blueprint

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Blueprint — plans for coding agents

Tip

Any agent can follow a plan. The hard part is writing steps that don't assume the agent read the ones before. Blueprint generates plans from a one-line objective — each step carries enough context to be executed cold, by a different agent in a different session.

License: MIT

Built for developers working on projects too large for a single agent session — multi-step refactors, plugin extractions, architecture migrations, phased feature rollouts. The generated plans are plain Markdown files that any agent can read and execute.

What You Get

A Markdown plan file designed so that a fresh agent in a new session can pick up any step and execute it without reading the rest of the plan:

  • Steps — each one-PR sized, with task list, rollback strategy, verification commands, and exit criteria
  • Dependency graph — which steps can run in parallel, which must be serial
  • Design decisions — rationale for key choices, locked against re-litigation
  • Invariants — properties verified after every step (build passes, no regressions)
  • Progress log — single source of truth for execution state across sessions
  • Review log — findings from an independent review that stress-tests the plan for gaps and flaws
Example: one step from a 9-step plugin extraction plan
### Step 04: Redirect core scheduler calls through plugin registry [x]

**Branch**: `taskflow-step-04-redirect-calls`
**Size**: M
**Isolation**: worktree
**Agent**: strongest

**Context** (cold-start brief):
The core currently calls the scheduler directly via
`import { Scheduler } from './scheduler/index.js'` in 3 files.
The plugin registry now has a `scheduler` slot with `getScheduler()`.
This step replaces all direct imports with registry calls — the critical
bridge that decouples core from the scheduler implementation.

**Tasks**:
1. [guided] Find all files importing from `./scheduler/`
2. [guided] Replace direct imports with `getScheduler()` + descriptive error
3. [guided] Update bootstrap to warn if no scheduler is registered
4. [guided] Update tests to register a mock scheduler in beforeEach
5. [exact] Run: `pnpm -r build && pnpm -r test && pnpm -r typecheck`

**Rollback**: Discard worktree. Main tree is unaffected.

**Verification**:
- [ ] `pnpm -r build && pnpm -r test` — all tests pass
- [ ] `grep -r "from.*scheduler" packages/core/src/ | wc -l` — returns 0

**Exit criteria**: No file in core directly imports from `./scheduler/`.
All tests pass with mock scheduler registration.

The Context field is the key — it tells a cold-start agent exactly what state the codebase is in and what this step does, without reading prior steps. Task markers: [exact] = run this command verbatim; [guided] = agent decides the approach; [open] = goal only, full autonomy.

Example: dependency graph with parallel groups
Step 01 (interface)
  ├──→ Step 02 (registry slot)
  │      └──→ Step 04 (redirect calls) ──→ Step 05a (move files)
  │                                              └──→ Step 05b (tests)
  └──→ Step 03 (package scaffold) ──────────────→ Step 05a

Parallelizable:
  Group A (after Step 01): Step 02 and Step 03 — no shared files

Full examples: small-plan.md (3 steps, direct workflow) · large-plan.md (9 steps, branch workflow with step split)

Installation

Claude Code

mkdir -p ~/.claude/skills
git clone https://github.com/antbotlab/blueprint.git ~/.claude/skills/blueprint

Claude Code discovers the skill automatically — /blueprint is ready to use.

Other agents

The generated plans are standard Markdown files. Any agent that can read files can execute a Blueprint plan. Point the agent to SKILL.md as a system prompt or instruction file, and it will know how to generate and execute plans.

Requirements

Claude Code for the /blueprint slash command. Git and GitHub CLI unlock the full branch/PR/CI workflow, but Blueprint works without them — it detects their absence and automatically switches to direct mode.

Quick Start

/blueprint myapp "migrate database to PostgreSQL"

This scans your codebase, designs a step-by-step plan, reviews it with a strongest-model sub-agent (falls back to default model if unavailable), and saves it to plans/myapp-migrate-database-to-postgresql.md.

Key Features

Cold-start execution — Every step includes a self-contained context brief. A fresh agent in a new session can execute any step without reading prior steps or conversation history.

Cross-session resumption — Plans track execution state in a Progress Log. A new session reads the log, checks branch/file state, and picks up exactly where the last session stopped.

Living documents — Plans mutate during execution. Steps can be split, inserted, skipped, reordered, or abandoned — all with formal protocols and an audit trail.

Adversarial review — Every plan is reviewed by a strongest-model sub-agent (e.g., Opus) against a checklist covering completeness, dependency correctness, cold-start viability, anti-pattern detection, and more. Falls back to the default model if the strongest model is unavailable.

Parallel step detection — The dependency graph identifies steps with no shared files or output dependencies, so they can run concurrently.

Graceful degradation — Strongest model unavailable? Falls back to default. No GitHub CLI? Switches to direct mode. No project memory? Relies on codebase scanning. Blueprint adapts instead of failing.

Comparison

Claude Code's built-in /plan works well for straightforward tasks. Persistent-planning tools (like planning-with-files) solve context loss across sessions by writing state to disk. Blueprint is designed for a different problem: multi-session, multi-agent projects where each step must be independently executable by a fresh agent that has never seen the conversation history.

/plan Persistent planners /blueprint
Zero setup Yes — built in Install required Install required
Persisted plan file Yes — ~/.claude/plans/ Yes — markdown files Yes — project plans/ directory
Cold-start steps No — shared context Partial — session-catchup scripts Yes — each step has a self-contained context brief
Adversarial review No No Yes — strongest-model review gate
Branch/PR/CI workflow No No Yes — built into every step
Dependency graph No No Yes — parallel groups and serial constraints
Plan mutation protocol No No Yes — split, insert, skip, reorder, abandon
Model tier assignment No No Yes — strongest vs default per step
Anti-pattern detection No No Yes — catalog scanned before finalization
Autonomy boundaries No No Yes — exact, guided, and open task markers
Zero runtime risk Yes Hooks execute on every tool call Yes — pure markdown, no hooks, no scripts

When to use what

  • /plan — single-session tasks, quick prototyping, exploratory work.
  • Persistent planners — repeated context loss is the bottleneck and you want automatic session recovery.
  • /blueprint — the project is too large for one session, involves multiple agents or team members, and each step must be executable without reading the rest of the plan.

Cross-Platform Compatibility

Blueprint generates standard Agent Skills format (SKILL.md). The generated plans are plain Markdown — any agent that can read files can execute them:

  • Claude Code — native /blueprint slash command
  • Cursor, GitHub Copilot, VS Code — point to SKILL.md as instruction file
  • OpenAI Codex CLI — load as system prompt
  • Gemini CLI — include in context
  • OpenClaw — add to workspace skills directory
  • Any SKILL.md-compatible agent30+ tools support the format

Security

Blueprint is a pure-instruction skill — it contains no executable code, no hooks, no shell scripts, and no runtime dependencies. The entire skill is markdown files that tell the agent what to do.

This matters because 13.4% of public agent skills contain serious vulnerabilities (Snyk, February 2026). Blueprint avoids this attack surface entirely:

  • No PreToolUse / PostToolUse hooks that execute on every tool call
  • No shell scripts that could be modified or injected
  • No network calls, no file system writes outside the plan file
  • No dependencies to supply-chain-attack

What you install is what you read — markdown instructions, reference templates, and examples.

How It Works

  1. Research — Scans the codebase, reads project memory if available, runs pre-flight checks (git repo, gh auth, default branch).
  2. Design — Breaks the objective into one-PR-sized steps, identifies parallelism, assigns model tiers (strongest for architecture, default for implementation), assigns isolation strategy (main-tree preferred; worktree only for destructive or exploratory steps).
  3. Draft — Generates the plan from a structured template. Includes branch workflow rules, CI policy, invariants, and rollback strategies — all inline, fully self-contained.
  4. Review — Delegates adversarial review to a strongest-model sub-agent. Issues are fixed and re-reviewed until the plan passes.
  5. Register — Saves the plan and updates project memory.

Directory Structure

blueprint/
├── SKILL.md                     Main skill definition
├── references/
│   ├── plan-template.md         Full plan template (branch workflow)
│   ├── plan-template-light.md   Lightweight template (direct workflow)
│   ├── review-checklist.md      Phase 4 review checklist
│   └── anti-patterns.md         Planning anti-patterns & rebuttals
├── workflows/
│   ├── plan-mutation.md         Protocol for modifying plans mid-execution
│   └── resumption.md            How to resume a partially-executed plan
└── examples/
    ├── small-plan.md            3-step direct-workflow example
    └── large-plan.md            9-step branch-workflow example

Contributing

Bug reports and feature requests: GitHub Issues.

Pull requests welcome — please open an issue first to discuss non-trivial changes.

License

MIT © 2026 antbotlab

About

Turn a one-line objective into a step-by-step plan any coding agent can execute cold — each step carries its own context, no prior session needed.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors