Memory Mission

A governed context engine for agents: turn a firm's scattered knowledge into one queryable, auditable layer that AI agents can act on safely.

Python infrastructure for pulling data from external sources (email + calendar + transcripts + documents + venture CRMs + Slack), distilling it into git-versioned markdown the firm owns, and surfacing it through a hybrid-search retrieval interface — with every extraction, promotion, and retrieval logged for compliance audit.

Two planes (personal and firm) separated by a PR-model review gate. Every fact traces to a source, a reviewer, and a rationale. Nothing lands on firm-plane memory without an explicit human decision.

Where to start

For	Read
The thesis and who it's for	`docs/VISION.md`
The shipped architecture + module walkthrough	`docs/ARCHITECTURE.md`
Every Pydantic model, predicate, tier in one place	`docs/ABSTRACTIONS.md`
How we measure whether the system is right	`docs/EVALS.md`
Load-bearing decisions with rationale	`docs/adr/`
Operator recipes (hot-cache hooks, Bases dashboard)	`docs/recipes/`
How agents should navigate this repo	`docs/AGENTS.md`
Per-step chronology	`BUILD_LOG.md`

Quickstart

git clone https://github.com/SvenWell/memory-mission.git
cd memory-mission

python3 -m venv .venv && source .venv/bin/activate
pip install -e '.[dev]'

make check          # ruff + format + mypy --strict + 903 tests
python -m memory_mission info

A working slice you can run today (in-memory, no external services):

from pathlib import Path
from memory_mission.memory import HashEmbedder, InMemoryEngine, new_page
from memory_mission.observability import observability_scope

with observability_scope(observability_root=Path("./.observability"),
                         firm_id="acme"):
    engine = InMemoryEngine(embedder=HashEmbedder())
    engine.put_page(
        new_page(
            slug="sarah-chen", title="Sarah Chen", domain="people",
            compiled_truth="CEO of [[acme-corp]]. Direct, numbers-heavy.",
        ),
        plane="firm",
    )
    hits = engine.query("CEO of acme")
    # Each query writes a RetrievalEvent to .observability/<firm>/events.jsonl

End-to-end with the full stack (connectors → extract → review → promote → meeting-prep):

from memory_mission.synthesis import compile_agent_context

# Assuming a populated KG + identity resolver (see ARCHITECTURE.md for setup)
context = compile_agent_context(
    role="meeting-prep",
    task="Prep the Q3 review with Acme Corp",
    attendees=["p_alice_abc123"],
    kg=kg,
    engine=engine,
    identity_resolver=resolver,
    plane="firm",
    tier_floor="policy",          # only authoritative doctrine + policy
)
briefing = context.render()       # markdown for the host-agent LLM

What shipped

V1 complete + Step 18 MCP surface shipped + MemPalace personal substrate adopted + P2 capability-based connector manifest + venture-pack (Calendar + Affinity + Outlook + OneDrive/SharePoint + Attio + Notion + Slack) + P7-A venture overlay (constitution + page templates + lifecycle predicate vocabulary + 3 workflow skills). 18 build steps + six-move polish pass + a three-reviewer security-response pass (21 fixes across B1-B28). 837 tests passing, mypy --strict clean on 80 source files. P0–P2 merged to main at 3749034.

Layer	What you can do today
Foundations	Append-only observability log per firm; checkpointed durable execution with resume-on-crash; PII-redacted middleware around every LLM call
Connectors	`Connector` Protocol + `invoke()` harness. Composio-backed adapters (SDK stub; host wires the client) for Gmail, Outlook, Google Calendar, Granola, Google Drive, OneDrive/SharePoint, Affinity, Attio, Notion, Slack — 10 apps across 6 capability roles (`email` / `calendar` / `transcript` / `document` / `workspace` / `chat`). Per-firm `firm/systems.yaml` binds roles → apps; envelope helpers normalize each app's raw payload to `NormalizedSourceItem` with fail-closed visibility mapping (ADR-0007 + ADR-0011)
Memory	Compiled-truth + timeline page format (Obsidian-compatible); SQLite temporal knowledge graph with Bayesian corroboration (Noisy-OR, 0.99 cap); hybrid search (RRF + cosine + compiled-truth boost); tier-aware authority hierarchy; read-only SQL surface for ad-hoc queries
Identity	`IdentityResolver` Protocol + SQLite-backed local resolver; stable `p_<id>` / `o_<id>` across email / LinkedIn / Twitter / phone; `merge_entities` with reviewer gate
Extraction	Six-bucket `ExtractedFact` Pydantic union with mandatory support-quote; `EXTRACTION_PROMPT` markdown template; ingest-time canonicalization to stable IDs; zero LLM-SDK imports in-repo
Permissions	Per-firm `Policy` with scopes + employees; `can_read` + `can_propose` with no-escalation; read-path enforcement inside `BrainEngine`
Promotion	`Proposal` + `ProposalStore`; `create_proposal` / `promote` / `reject` / `reopen` with required rationale; coherence check emits structured warnings; opt-in constitutional mode blocks on contradictions
Federated	Cross-employee pattern detector with distinct-source-file independence check; stages firm-plane proposals from N≥3 employees' personal planes
Synthesis	`compile_agent_context(role, task, attendees, ...)` returns a structured `AgentContext` package; Tolaria Neighborhood-mode shape; Obsidian `[!contradiction]` callouts on rendered pages
MCP surface	FastMCP server (`memory-mission/v1`, 14 tools: 8 read + 6 write). One process per employee. Per-firm `mcp_clients.yaml` manifest with NFKC + dup-key + symlink rejection. Every mutating tool opens an `observability_scope` — audit trail coverage complete over MCP, not just over Python API
Scope enforcement	Fail-closed when no policy configured. `viewer_scopes` filter on KG triples. `can_propose` no-escalation on `create_proposal`. Scope column on every triple; pre-flight scope scan in `_apply_facts` so scope mismatches never leave partial KG writes. WAL + busy_timeout on all per-firm SQLite DBs
Skills	14 shipped. Personal-source backfills (employee plane): `backfill-gmail`, `backfill-outlook`, `backfill-granola`, `backfill-calendar`. Firm-source backfills (firm plane, admin-run): `backfill-firm-artefacts` (Drive), `backfill-onedrive` (OneDrive + SharePoint), `backfill-affinity`, `backfill-attio`, `backfill-notion`. Mixed-plane (per-message split via helper override): `backfill-slack`. Workflow: `extract-from-staging`, `review-proposals`, `detect-firm-candidates`, `meeting-prep`

Full per-step chronology in BUILD_LOG.md.

How it composes

Each layer reads from the one below without rewriting it.

A connector pulls from any of the 10 supported apps through the invoke() harness — PII-scrubbed, durable-checkpointed, logged as a ConnectorInvocationEvent.
The connector's raw payload feeds an envelope helper (gmail_message_to_envelope, slack_message_to_envelope, etc.) which consults firm/systems.yaml for the firm-shaped target_scope and target_plane. Staging then writes the NormalizedSourceItem envelope to staging/<plane>/<source>/<id>.md plus a raw sidecar via StagingWriter.write_envelope. Visibility mapping is fail-closed by default — items whose visibility metadata can't map to a firm scope are rejected, not silently defaulted.
The extract-from-staging skill runs the host agent's LLM against EXTRACTION_PROMPT. Response JSON parses into ExtractionReport; ingest_facts canonicalizes entity names via the IdentityResolver and writes structured facts to fact-staging.
create_proposal groups facts by entity into a Proposal. Deterministic proposal_id — idempotent under re-extraction.
The review-proposals skill surfaces each proposal to a human reviewer. Coherence warnings on tier conflicts surface as forcing questions. Approve / reject / reopen always requires rationale.
promote() corroborates an existing currently-true triple (Bayesian update) or adds a new one. Every source appends to triple_sources. Firm-plane writes are the only way facts leave staging.
The detect-firm-candidates skill (admin) scans personal planes for cross-employee patterns; distinct-source-file threshold defeats the "three people sharing one Granola transcript" failure mode. Candidates route through the same proposal pipeline.
The meeting-prep skill calls compile_agent_context — distilled doctrine + per-attendee neighborhoods with inline provenance + coherence callouts. The host-agent LLM drafts the briefing; Memory Mission doesn't own the generation.

Every link in the chain is auditable. Every write is reviewed. No link is silent.

Open the vault in Obsidian

The on-disk format is vault-native. Point Obsidian at <firm_root>/ and the whole governed memory appears as a working vault — graph view, linked mentions, search, tag panel, all of it.

The Bases dashboard (Obsidian ≥ v1.9.10) gives partners a native database view:

cp src/memory_mission/memory/templates/dashboard.base <firm_root>/firm/dashboard.base

Five views out of the box: Recent changes, Low confidence, Stale or unreviewed, Constitution + doctrine, By domain. Install notes in docs/recipes/vault-dashboard.md.

For employees running a host-agent session against their personal plane, the hot-cache hook recipe makes session memory persistent across restarts: docs/recipes/personal-hot-cache.md.

Repo layout

src/memory_mission/
├── observability/          # append-only JSONL audit, per-firm scoped
├── durable/                # checkpointed runs, resume-on-crash
├── middleware/             # LLM-call chain + PII redaction
├── ingestion/              # connectors (Composio harness + Gmail/Granola/Drive),
│                           # systems_manifest, envelopes, staging, mentions
├── memory/
│   ├── pages.py            # compiled-truth + timeline format
│   ├── schema.py           # MECE domains + plane paths
│   ├── engine.py           # BrainEngine Protocol + InMemoryEngine
│   ├── knowledge_graph.py  # SQLite temporal KG + corroborate + coherence
│   ├── tiers.py            # constitution / doctrine / policy / decision
│   ├── search.py           # RRF + cosine + compiled-truth boost
│   └── templates/          # dashboard.base
├── personal_brain/         # PersonalMemoryBackend Protocol + MemPalaceAdapter
├── extraction/             # 6-bucket ExtractedFact + ingest_facts
├── identity/               # IdentityResolver Protocol + LocalIdentityResolver
├── permissions/            # Policy + can_read / can_propose
├── promotion/              # Proposal + PR-model review gate
├── federated/              # cross-employee pattern detector
├── synthesis/              # compile_agent_context + AgentContext
└── mcp/                    # FastMCP server — 14 tools over stdio (Step 18)

skills/                     # 18 shipped, markdown + YAML frontmatter
tests/                      # 903 passing
docs/                       # VISION + ARCHITECTURE + ABSTRACTIONS + EVALS + AGENTS + adr/ + recipes/
BUILD_LOG.md                # per-step record

Operational notes

Configuration is environment-driven via MM_* vars (see src/memory_mission/config.py):

Variable	Default	Purpose
`MM_WIKI_ROOT`	`./wiki`	Root for firm content (curated pages + staging)
`MM_OBSERVABILITY_ROOT`	`./.observability`	Append-only audit log root
`MM_DATABASE_URL`	(empty)	Unused pre-pilot — SQLite-per-firm is the default (see ADR-0005). Placeholder for a future hosted option if a pilot demands it
`MM_LLM_PROVIDER`	`anthropic`	`anthropic` / `openai` / `gemini` (the host agent uses this; we don't import the SDK)
`MM_LLM_MODEL`	`claude-sonnet-4-6`	Default model identifier

Per-firm isolation is filesystem-based: each firm gets its own subdirectory + its own SQLite files (KG, identity, proposals, mentions, durable). WAL + busy_timeout=5000 on all stores so multiple MCP processes per employee coexist safely. No hosted DB pre-pilot (ADR-0005).

Day-to-day:

make check           # full pre-commit check
make lint-fix        # auto-apply ruff fixes
pytest -k <pattern>  # run a subset

Next chapter — venture-first pilot

See /Users/svenwellmann/.claude/plans/we-ve-built-this-and-curious-unicorn.md for the full plan. Summary:

P0 — DONE. MemPalace is the adopted personal-layer substrate behind PersonalMemoryBackend (ADR-0004).
P2 — DONE. Capability-based connector manifest + fail-closed visibility mapping. firm/systems.yaml binds logical roles (email, calendar, transcript, document, workspace) to concrete apps; NormalizedSourceItem envelope + per-app helpers + StagingWriter.write_envelope are the single staging entry path (ADR-0007).
P3 — Personal-source ingestion. Wire Gmail + Granola + Calendar through the envelope helpers into the personal substrate. Pilot-task acceptance scenarios run against real-shape data.
P4 — Firm-source ingestion + bridge. Notion / Attio / Drive as document_system / workspace_system. Promotion review preserves source-side scope.
P5 — Typed sync-back for approved facts. Reviewed-mode default; per-app allowed_mutation_kinds (ADR-0008 when landed).
P6 — Evidence-pack retrieval + firm auto-wiring at promote time (ADRs 0006 + 0009).
P7 — Venture reference overlay. PE + wealth as thinner overlays on the same core.
P8 — Pilot rehearsal + benchmarks. Employee-memory-on-firm-tasks primary; LongMemEval secondary.

Post-V1 roadmap (parked)

Deferred items, each with a real-data trigger. See project_post_v1_roadmap.md for triggers.

Step 19: Legislative amendment cycle (batched promotions triggered by evidence pressure).
Step 20: Constitution bootstrap skill (cold-start firm truth from existing strategy docs).
Step 21: Relationship-strength view + Graph One adapter (needs real interaction volume).
Identity channel extension (Slack/Telegram/WhatsApp/phone/E.164 normalization).
50-scenario federated eval harness (per EVALS § 2.6).
Distillation coherence eval (per EVALS § 2.7).
CoherenceResolvedEvent so the contradiction callout hides acknowledged conflicts.
/save (conversation → personal-plane note) + /autoresearch (WebSearch + WebFetch loop) as optional skills.
ML-powered query rewriting on hybrid search.

License

Proprietary — see pyproject.toml.

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
docs		docs
overlays		overlays
protocols		protocols
skills		skills
src/memory_mission		src/memory_mission
tests		tests
.gitignore		.gitignore
.gitkeep		.gitkeep
.python-version		.python-version
BUILD_LOG.md		BUILD_LOG.md
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory Mission

Where to start

Quickstart

What shipped

How it composes

Open the vault in Obsidian

Repo layout

Operational notes

Next chapter — venture-first pilot

Post-V1 roadmap (parked)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Memory Mission

Where to start

Quickstart

What shipped

How it composes

Open the vault in Obsidian

Repo layout

Operational notes

Next chapter — venture-first pilot

Post-V1 roadmap (parked)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages