ringi-driven-harness

A human-at-the-gates harness for Claude. Coding agents build production software — beside you, while you're away, or while you sleep — and every change converges on one gate where a human keeps final control over what ships. As agent fleets get cheap, this governance layer is the scarce part.

English · 日本語

See the visual tour → — the whole harness at a glance. For depth, read the methodology.

What this is

ringi-driven-harness is a methodology — a user harness for building real software with Claude Code. A single developer orchestrates autonomous coding agents through GitHub, with a human approval gate (稟議, ringi) in front of every change that reaches main. It is the asynchronous evolution of ringi-driven-dev: v1 sat you at the keyboard; v2 runs the work while you are away and shows you the results to approve.

Why it exists

Coding agents can build entire features on their own — and, left unsupervised, drift from intent, mistake the scope, and ship UI that does not work. The question is no longer can the agent build it, but how a human stays in control without sitting in the loop all day. A harness answers that: the human designs and approves; the cloud builds and verifies. As agents get cheaper, the fleet stops being the constraint — what stays scarce is governance: an independent check on the work, a human gate on what merges, and decisions that hold across sessions. This harness is that layer, and it rides on top of any way the agent runs — whether you drive it yourself, dispatch it remotely, or let it work on a schedule.

The workflow

Four independent Claude sessions touch each change — the one that designs, the one that builds, the one that reviews, the one that evaluates — and none of them sees the others' conclusions. That mutual blindness is the cross-check.

The merge gate is the heart of the harness: everything above it can be automated, but whether a change ships is one human click. That click sits behind an automated gate: tests and a secret scan run as required checks, so a change that is not green never reaches it. That is human at the gates, not in the loop.

The diagram above is the steady-state path, where the build runs in the cloud while you are away. But the human can stand in different places without moving the gate. The work runs in one of three modes — synchronous (you drive the agent at the terminal), remote dispatch (a bounded task you hand off to run while you are away), or autonomous (a schedule or trigger builds and opens a pull request, then stops) — and all three converge on the same gate.

The mode changes where the human stands and how the work is triggered; it does not change what ships or who approves it. For the full treatment, see the methodology.

Repository layout

docs/        Methodology, written as a user harness
diagrams/    SVG visualizations of the workflow
templates/   Reusable harness components (CLAUDE.md, issue template, CI)

Lineage

v1 ringi-driven-dev — synchronous. Human at the keyboard.
v2 ringi-driven-harness — asynchronous harness. Autonomous builds that open pull requests for review.

Status

Work in progress. This repository is built using the workflow it describes.

Related work

This harness draws on written practice from the broader Claude Code / agentic-development community, adapted to its own human-at-the-gates philosophy:

andrej-karpathy-skills — the implementer-discipline principles (think before coding, simplicity, surgical changes, goal-driven execution) build on its distillation of common LLM coding pitfalls.
ECC — grading review findings by severity (what blocks vs. what merely informs) is adapted from its security-review and verification practice; its end-to-end testing agent also informs the stance that a runnable surface is gated by running it, not by reading it; its verification loop and eval practice inform the staged build/type/test checks and the pass@k grading of model output.
gstack — the "the AI proposes, the human decides" framing of bounded autonomy echoes its builder ethos; its QA agent that drives a real browser and turns each bug it finds into a regression test is the prior art for putting eyes on the running UI before the human does; its debugging and review passes inform the root-cause-first and survive-a-green-build principles.
Anthropic's guide to evaluating AI agents — the LLM-as-judge discipline in Quality control (calibrating a judge against human-labeled examples, an explicit "unknown" verdict, one rubric-graded dimension at a time) follows its guidance.

Where those projects largely automate the review gates, this harness keeps a human at the final gate by design — the ringi stamp. The pieces are borrowed as written norms only; none of their runtime systems (skill packs, hooks, agents) are vendored here. The three open-source projects above are MIT-licensed; the principles above are adapted in our own words, not copied.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
diagrams		diagrams
docs		docs
scripts		scripts
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
.nojekyll		.nojekyll
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ringi-driven-harness

What this is

Why it exists

The workflow

Repository layout

Lineage

Status

Related work

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ringi-driven-harness

What this is

Why it exists

The workflow

Repository layout

Lineage

Status

Related work

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages