AI Workbench

Reusable skills, harnesses, agent patterns, frameworks, and resources for practical agentic AI work.
A selective working collection for keeping agents scoped, verifiable, and useful outside one private workspace.

Status: AI Workbench is a public, evolving collection. The artifacts are usable and Apache-2.0-licensed, but this is not a packaged product; individual skills, tools, and examples may change as the patterns mature.

Built and maintained by Jarel Remick.

What is AI Workbench?

AI Workbench collects reusable AI operating artifacts: agent skills, project harnesses, workflow patterns, model-council tools, adoption frameworks, starter kits, diagrams, fixtures, and public-safe examples.

Most of it came from repeated use: keeping agents scoped, designing project harnesses, managing context, choosing verification paths, and moving deterministic work out of prompts and into code or checks.

This is a selective working collection, not a prompt dump.

Why this exists

Reusable agent work - turn repeated operating patterns into skills, templates, checks, and small tools.
Public-safe examples - keep the reusable pattern while removing private workspace details, secrets, local paths, and raw session history.
Verification-first habits - pair agent workflows with the smallest credible proof path: evals, validators, read-backs, fixtures, screenshots, or documented manual checks.
Higher-level project harnesses - make broad agentic work concrete enough to start, delegate, inspect, and finish.

Quick start

Clone the repo and start from the catalog that matches what you want to adapt:

git clone https://github.com/jremick/ai-workbench.git
cd ai-workbench

# Optional sanity checks for the package families with validators.
python3 scripts/validate_model_council_package.py
python3 scripts/validate_model_manager_package.py

Then browse by category:

Group	What's in it	Where to look
Frameworks	Models and worksheets for thinking about AI adoption, maturity, and operating constraints.	SMB AI Maturity Model
Patterns	Reusable workflow shapes for splitting, routing, verifying, and repeating agent work.	Agent Workflow Patterns
Skills	Reusable instructions for recurring agent work: writing, triage, diagramming, auth handling, MCP work, model routing, model councils, research, and context boundaries.	skills and docs/skills.md
Harnesses	Operating patterns for starting projects, composing nested work, routing verification, and keeping larger agent tasks coherent.	harness-first-project-coach, project-harness-designer, harness-composer, verification-harness-router
Agents and plugins	Patterns for delegation, sidecar agents, MCP servers, tool boundaries, and context packets.	nested-agent-orchestrator, mcp-build, context-boundary-designer
Benchmarks	Dataset prep and scoring harnesses for evaluating skills and agent workflows.	Model Council DRACO Benchmark
Resources	Starter kits, examples, diagrams, eval fixtures, and reference docs that make the patterns easier to adapt.	AGENTS example, Codex sync workflow, resources

Most artifacts have their own README with usage notes, examples, and the smallest useful check or fixture.

Notable

Project Harness Designer

Project Harness Designer turns a fuzzy project start into a compact operating frame: intent, success evidence, risks, work mode, verification loop, and first path. It is the pattern I reach for when a request is bigger than a single edit but does not need heavyweight project planning.

Harness-First Project Coach

Harness-First Project Coach is the earlier coaching layer for substantial starts. It clarifies material questions, reframes the goal, maps support skills, defines context boundaries, and chooses the first evidence-backed lane before implementation.

Agent Memory

Agent Memory Starter is a source-backed memory pattern for agents. It uses curated pages, timeline evidence, searchable chunks, update proposals, fake fixtures, and a retrieval eval so memory can be inspected and tested instead of becoming a transcript pile.

Model Council and Deep Research

Model Council runs independent model workers and a separate synthesis pass, with local CLI routes for Codex, Claude Code, Antigravity, and Grok Build plus a Vercel AI Gateway option. Deep Research keeps source-backed research disciplined and escalates difficult synthesis to the council pattern. The companion runner supports dry-run planning, manifests, and route validation. Model Council DRACO Benchmark is a separate benchmark package for evaluating the council skill.

Model Manager

Model Manager is a public-alpha skill and deterministic CLI for choosing when model delegation is worthwhile, selecting a role-stack route, and preserving parent-owned execution. It includes sanitized benchmark-derived recommendation values, Artificial Analysis attribution notes, DeepSWE-aware long-horizon coding policy, evals, tests, and a package validator.

War Council

War Council is a decision harness for uncomfortable tradeoffs. It uses advisor personas, weighted scoring, forced $100 allocation, and a deterministic aggregate script to preserve agreements, disagreements, risks, kill criteria, and the final decision ledger.

Meta-Harnesses

The meta-harness pieces are for shaping larger agent workflows: Harness Composer for parent and child workstreams, Nested Agent Orchestrator for delegation, Verification Harness Router for choosing checks, and Context Boundary Designer for deciding what context belongs where.

Agent Workflow Patterns

Agent Workflow Patterns is a diagram-backed catalog for choosing classify-and-act, fan-out-and-synthesize, adversarial verification, generate-and-filter, tournament, loop-until-done, and quarantine-and-act workflows.

Deterministic Controls

Deterministic Controls helps decide when model judgment is the wrong tool. It pushes exact formats, permission gates, routing, retries, release checks, and auditability into schemas, state machines, validators, tests, or other deterministic controls.

Codex Operating Resources

AGENTS example is a cleaned-up global instruction template for pragmatic coding-agent defaults. Codex sync workflow covers the live-home versus versioned-mirror pattern for keeping reusable Codex instructions, skills, agents, config templates, and setup scripts aligned across machines.

Documentation

Skills catalog - installable public skills and starting points.
Patterns - reusable workflow shapes.
Resources - starter kits, templates, and reference material.
Model Council and Deep Research - council workflow, routing, and benchmark notes.
Model Manager - role-based model routing, benchmark-aware policy, and validation commands.

Community and support

Issues - bugs, broken links, unclear docs, and concrete improvement ideas.
Contributing - how to propose public-safe changes.
Security policy - how to report private or sensitive findings.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
assets		assets
benchmarks		benchmarks
docs		docs
frameworks/smb-ai-maturity-model		frameworks/smb-ai-maturity-model
patterns		patterns
resources		resources
scripts		scripts
skills		skills
tools		tools
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Workbench

What is AI Workbench?

Why this exists

Quick start

Notable

Project Harness Designer

Harness-First Project Coach

Agent Memory

Model Council and Deep Research

Model Manager

War Council

Meta-Harnesses

Agent Workflow Patterns

Deterministic Controls

Codex Operating Resources

Documentation

Community and support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Workbench

What is AI Workbench?

Why this exists

Quick start

Notable

Project Harness Designer

Harness-First Project Coach

Agent Memory

Model Council and Deep Research

Model Manager

War Council

Meta-Harnesses

Agent Workflow Patterns

Deterministic Controls

Codex Operating Resources

Documentation

Community and support

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages