A Local Mission Runtime for Coding Agents

A Local Mission Runtime for Coding Agents

Your coding agent stops starting blind.
Local-first. MCP-native. Graph memory, trust, and change reasoning for agent hosts.

m1nd is a local mission runtime for coding agents — it governs the operating loop, not just retrieval.

grep finds text. Vector search finds similar chunks. m1nd gives agents a local graph of what connects, what changed, what breaks, what drifted, and where to resume.

Three things here exist together in no other tool:

Causal code graph — impact before you edit shows the blast radius you didn't read; ghost_edges surfaces files that always change together but share no import.
Self-verifying memory — memorize anchors findings to real code nodes; cross_verify flags them stale when that code changes.
A trust / recovery layer — every result carries a trust mode; trust_selftest and recovery_playbook tell the agent when the workspace binding is wrong and how to recover.

Quick Start

The minimal happy path — install from source (always current), check health, wire your host:

git clone https://github.com/maxkle1nz/m1nd.git && cd m1nd
npm install -g .
m1nd doctor
m1nd install-skills codex          # or: claude / gemini / antigravity / generic
m1nd mcp-config codex --project /your/project

Or from the npm beta channel: npm install -g @maxkle1nz/m1nd@beta.

Full install map, host packs, native runtime build, and update flags: docs/AGENT-PACKS.md · client-by-client setup: integration matrix.

Agent Entry Point

Agents parse this README. When the host MCP session is stale, bound to the wrong repo, or not loaded yet, use the host-neutral CLI — it launches an isolated runtime, binds it to the repo, and returns one machine-readable envelope:

m1nd agent first-minute --repo /your/project --query "understand this system" --json

m1nd agent first-minute is the safest first contact for a new repo. It scopes the repo, establishes trust, ingests if needed, runs one bounded orientation pass, returns candidate anchors, and then tells the agent to prove directly from source, tests, compiler/runtime output, logs, or probes.

Inside an MCP session, the doctrine is this trust loop — establish trust before believing any retrieval:

// 0. Trust the binding in one call (verdict before retrieval)
{"method":"tools/call","params":{"name":"trust_selftest","arguments":{"agent_id":"dev"}}}

// 1. If the verdict is not full_trust, ask for the deterministic recovery path
{"method":"tools/call","params":{"name":"recovery_playbook","arguments":{"agent_id":"dev"}}}

// 2. Build graph truth
{"method":"tools/call","params":{"name":"ingest","arguments":{"path":"/your/project","agent_id":"dev"}}}

// 3. Ask a structural question — empty results say *why*, never just "no results"
{"method":"tools/call","params":{"name":"activate","arguments":{"query":"authentication flow","agent_id":"dev"}}}

First-session loop, in four moves: trust_selftest → ingest → seek/audit → memorize the durable finding so the next session starts ahead.

What m1nd Is Not

m1nd is not just:

a code search tool with a larger index
a repo RAG layer that only retrieves files or chunks
a graph database that leaves workflow decisions to the client
a static analysis replacement for the compiler, tests, or security tooling
an MCP bundle of unrelated utilities

It is the layer that turns those surfaces into an operational system an agent can reason over and act through. Not for one-file lookups, simple grep, or compiler truth — use plain tools there.

Why Agents Need It

Without m1nd, every session starts with grep loops and manual re-orientation; last week's findings are gone, and an empty search result is indistinguishable from a wrong-workspace bind. With m1nd, the session starts with a trust verdict, past findings auto-load already anchored to the code that backs them, and empty results say why.

Agents on real codebases do not fail because they cannot search. They fail because they have no operating model. They rebuild context from scratch every session, edit without knowing the blast radius, and cannot tell an empty result that means "nothing exists" from one that means "wrong repo."

That works for small codebases. It falls apart when the project has generated artifacts, specs, docs, hidden co-change history, multiple agents, and long handoffs. The problem is not only the agent's reasoning — the agent has no durable model of the codebase's structure. m1nd gives it one: a causal code graph with spreading activation across structural, semantic, temporal, and causal dimensions, plus Hebbian plasticity that compounds per-agent across sessions.

Compounding Memory (L1GHT)

Most tools give an agent better retrieval. m1nd also lets an agent author durable, machine-legible knowledge that compounds across sessions and stays honest against the code. L1GHT turns authored knowledge into graph-native structure that self-flags when the code it cites changes — confident claims spread more activation than uncertain ones.

The loop, end to end:

Conclude — the agent reaches something durable (a decision, a verified finding, why code is the way it is) and calls memorize with structured claims and evidence paths.

memorize({
  "agent_id": "dev",
  "node_label": "AuthTokenFlow",
  "claims": [
    { "label": "TokenValidator", "text": "validates JWTs via HMAC",
      "confidence": "high", "evidence": ["src/auth/token.rs"] }
  ]
})

Anchor — m1nd writes a graph-native .light.md under <runtime>/agent-memory/, ingests it (adapter=light mode=merge), and resolves each evidence path to the real code node via a grounded_in edge — so the knowledge lives in the same activation space as code and surfaces in seek / activate / impact.
Auto-load — on every future session start, m1nd ingests agent-memory/ automatically and reports it in session_handshake.agent_memory. Past findings survive a mode=replace ingest and are just there.
Self-flag staleness — cross_verify(check: ["evidence_freshness"]) re-hashes every cited file and names which claims have gone stale because their code changed — so memory tells you when it lies instead of misleading you.

This loop has been proven live end-to-end: memorize → grounded_in edge → freshness flag on an edited file → survives mode=replace → boot auto-load. Closing a bounded mission? Pass write_light_memory: true to mission_close to persist its verified claims the same way. The habit is documented in the server instructions every MCP client receives at initialize — host-agnostic, no client-specific plugin required.

The Trust / Honesty Layer

This is the most defensible thing m1nd does, and no competitor ships it. The doctrine: credibility comes from honesty, not from always winning.

trust_selftest returns a verdict before any retrieval: full_trust, needs_ingest, wrong_workspace_binding, stale_binding_suspected, or degraded_host_tool_surface. The agent knows whether to proceed, ingest, rebind, or fall back.
agent_runtime_contract rides on every retrieve response, carrying a trust_mode. An empty result is disambiguated — bound to the wrong repo vs. genuinely nothing there — never silently reported as "no results."
non_claims arrays ship on every mission tool. m1nd tells the agent what it did not prove.
mission_verify can say no — and does, in tested code. It rejects graph-only evidence: a claim cannot close without a file read, a test run, or a runtime probe. The test is literally named graph_only_evidence_is_not_enough.
recovery_playbook returns a deterministic, ordered step list to repair the binding.

The proof of the commitment is what was killed for it: savings and resonate were pulled from the advertised surface in beta.7 because a tool that always claims to win is not credible. No competitor — not mem0, Zep, Letta, Sourcegraph, or any code-graph MCP — ships a layer that tells the agent what not to trust and how to recover.

Language Coverage

Graph reasoning (impact, why, predict, trace, taint_trace) is only as good as the extractor. m1nd resolves both calls edges (call graph) and cross-file imports (file→file dependency resolution) per language. The matrix below was proven live in a single polyglot ingest:

Language	`calls`	cross-file imports
Rust	✅	✅ (`mod`/`use crate::`)
Python	✅	✅
JavaScript / TypeScript	✅	✅
Go	✅	✅ (package)
Java	✅	✅ (FQCN + wildcard)
C / C++	✅	✅ (`#include "..."`)
Kotlin	✅	✅ (package)
PHP	✅	✅ (PSR-4)
Scala	✅	✅ (package)
Ruby	⏳	✅ (`require_relative`)
C#	✅	— (namespaces don't map 1:1 to files)
Swift	✅	—

All ✅ rows are verified end-to-end (a caller→callee import resolves and the caller emits call edges). Other languages fall back to the generic extractor (contains only). Unresolvable imports (external packages, gems, stdlib, system headers) are honestly left unresolved rather than guessed.

Capability Map

The live MCP surface evolves with releases. Use tools/list for the exact tool count and names in your current build.

Area	What it enables	Representative tools
Graph foundation	ingest code, maintain graph state, diagnose session continuity, reinforce useful paths, and detect cross-session weight drift	`trust_selftest`, `session_handshake`, `recovery_playbook`, `ingest`, `health`, `doctor`, `learn`, `warmup`, `drift`
Retrieval and orientation	search by text, path, intent, structure, or relationship before manual file reads	`audit`, `search`, `glob`, `seek`, `activate`, `why`, `trace`
Docs and knowledge binding	ingest universal docs or graph-native `L1GHT`, then link concepts back to code	`ingest(adapter="universal"\|"light")`, `document_resolve`, `document_provider_health`, `document_bindings`, `document_drift`, `auto_ingest_*`
Navigation and continuity	keep stateful routes, handoffs, baselines, and investigation memory across sessions	`perspective_`, `trail_`, `coverage_session`, `boot_memory`, `persist`
Mission control and proof discipline	keep a bounded route, record events, switch from graph orientation to direct proof, hand off, and close with explicit gaps	`mission_start`, `mission_event`, `mission_next`, `mission_verify`, `mission_handoff`, `mission_close`
Change planning and proof	reason about impact, co-change, missing steps, failure paths, and structural claims	`impact`, `predict`, `validate_plan`, `missing`, `hypothesize`, `counterfactual`, `differential`
Quality, security, and architecture	detect patterns, taint paths, trust boundaries, duplication, layer violations, type flows, and refactor targets	`scan`, `scan_all`, `heuristics_surface`, `antibody_*`, `taint_trace`, `type_trace`, `trust`, `layers`, `layer_inspect`, `twins`, `fingerprint`, `flow_simulate`, `epidemic`, `tremor`, `refactor_plan`
Time, runtime, and multi-repo work	inspect git history, drift, hidden co-change edges, runtime overlays, and cross-repo references	`timeline`, `diverge`, `ghost_edges`, `runtime_overlay`, `external_references`, `federate`, `federate_auto`
Operations and monitoring	audit repo state, verify graph-vs-disk truth, run daemon watches, persist state, and surface durable alerts	`audit`, `cross_verify`, `daemon_`, `alerts_`, `panoramic`, `metrics`, `report`, `persist`, `diagram`, `help`
Surgical edit prep and execution	pull compact connected context, preview writes, and apply graph-aware edits	`surgical_context`, `surgical_context_v2`, `view`, `batch_view`, `edit_preview`, `edit_commit`, `apply`, `apply_batch`

Tiering: 27 essential tools are advertised by default to reduce tool-selection cost; set M1ND_TOOL_TIER=full to advertise the full surface (100+ tools: RETROBUILDER, perspectives, federation, daemon). A few tools (resonate, savings, lock_*) remain callable by name but are not on the advertised surface. Hidden tools are always callable via tools/call — tiering only controls what tools/list surfaces.

The Operating Loops

The agent pack is part of the product, not decorative documentation. m1nd is strongest when the agent receives the operating loop, not merely a graph endpoint. Five named protocols ship in the pack:

Session Start — trust_selftest → recovery_playbook if trust is not full → ingest if needed → seek/audit.
Research — ingest → activate(query) → why(source, target) → missing(topic) → learn(feedback) → memorize any durable finding.
Code Change — impact(node) for blast radius → predict(node) → counterfactual(nodes) → surgical_context_v2 → memorize the decision and why.
Deep Analysis — fingerprint, diverge, ghost_edges, taint_trace, twins, refactor_plan, runtime_overlay (the RETROBUILDER lens) for hidden coupling, security paths, structural duplicates, and runtime heat.
Memory — persist durable conclusions with memorize, carrying confidence and evidence paths.

Mission Control is proof discipline, not a feature list. mission_next returns exactly one move plus do_not guardrails; mission_verify rejects graph-only claims; mission_close always nudges the agent to persist verified knowledge and records gaps and non-claims. In bug_hunt mode, MC0 requires a final direct direct_sweep after verified findings before close, so agents check negative space.

Caveat: predict has structural fallback only until ghost_edges loads the git co-change matrix — run ghost_edges first when you need real co-change likelihood.

Evidence

Every row is hedged to exactly what was measured. m1nd does not lead with savings or ROI numbers — that is the point.

Claim	Result	Source / hedge
`activate` / `impact` latency	sub-µs `activate`, sub-ms `impact`	Criterion benchmarks in `m1nd-core/benches/` on a 1K-node synthetic graph — methodology; treat as order-of-magnitude.
Language matrix	calls + cross-file imports for 10 languages (+ Ruby cross-file)	Verified end-to-end in a single polyglot ingest; per-language tests in `m1nd-ingest`. See Language Coverage.
Post-write validation sample	12/12 classified correctly	Internal runtime check.
Seeded bug-hunt	16/20 in the first accepted `humanize` seeded-defect round (m1nd-trained); `m1nd-basic` and direct each 8/15	Internal product evidence, `public_claim_worthy=false` — not a universal benchmark.
Memory self-verification	proven live end-to-end	`memorize` → `grounded_in` → freshness flag on edited file → survives replace → boot auto-load.

Limits

m1nd complements rather than replaces your LSP, compiler, test runner, security scanners, and observability stack. It is most useful before search, review, or change, and whenever docs, impact, or continuity matter.

It is less useful when:

exact text search already answers the question
compiler or runtime truth is the only thing you need
the task is a trivial local file action with no structural uncertainty

Needs feeding: trust and tremor start with neutral priors until learn feedback / ghost_edges data accumulates, and predict needs ghost_edges loaded first before its co-change signal is meaningful. These improve with use; they are honest about being uninformed at boot.

Architecture At A Glance

Three core Rust crates plus one auxiliary bridge:

m1nd-mcp — the MCP server and operational runtime surface.
m1nd-core — the graph engine: a WavefrontEngine doing spreading activation, Hebbian plasticity, CSR adjacency, and git-derived ghost edges.
m1nd-ingest — extraction, routing, and graph construction adapters (code, universal docs, L1GHT).
m1nd-openclaw — auxiliary OpenClaw bridge (Unix-socket lane, independently versioned).

Current crate versions: m1nd-core, m1nd-ingest, m1nd-mcp all 0.9.0-beta.8.

For federation, perspectives, RETROBUILDER, multi-agent coordination, and the full agent-pack and operator reference, see the canonical wiki, docs/AGENT-PACKS.md, and EXAMPLES.md.

Contributing

Contributions are welcome across extractors and adapters, MCP/runtime tooling, benchmarks, docs, and graph algorithms. See CONTRIBUTING.md.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 522 Commits
.agents/workflows		.agents/workflows
.github		.github
docs		docs
i18n		i18n
m1nd-core		m1nd-core
m1nd-demo		m1nd-demo
m1nd-ingest		m1nd-ingest
m1nd-mcp		m1nd-mcp
m1nd-openclaw		m1nd-openclaw
m1nd-ui		m1nd-ui
m1nd-viz		m1nd-viz
npm		npm
scripts		scripts
skills		skills
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
EXAMPLES.md		EXAMPLES.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Local Mission Runtime for Coding Agents

Quick Start

Agent Entry Point

What m1nd Is Not

Why Agents Need It

Compounding Memory (L1GHT)

The Trust / Honesty Layer

Language Coverage

Capability Map

The Operating Loops

Evidence

Limits

Architecture At A Glance

Contributing

License

About

Uh oh!

Releases 14

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Local Mission Runtime for Coding Agents

Quick Start

Agent Entry Point

What m1nd Is Not

Why Agents Need It

Compounding Memory (L1GHT)

The Trust / Honesty Layer

Language Coverage

Capability Map

The Operating Loops

Evidence

Limits

Architecture At A Glance

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages