codex_context_engine

An external memory and context orchestration engine for Codex.

The codex_context_engine is an external memory and context orchestration layer designed to help Codex work on complex projects without repeatedly rediscovering the same information.

Instead of reconstructing context from scratch on every task, the engine progressively builds, optimizes and connects contextual knowledge across executions.

The result is a system that becomes more efficient, more aware of project structure, and less prone to repeating mistakes over time.

Canonical Runtime

codex_context_engine is the canonical runtime.

Operational entrypoints currently shipped in this repository:

ruby scripts/install_cross_project_for_all_repos.rb
./scripts/install_launch_agent.sh
./scripts/uninstall_launch_agent.sh

In cross-project mode, task packets now write telemetry both to the shared root .context_metrics/ layer and to .context_metrics/projects/<repo>/, so savings reports can be attributed per repository.

Core Idea

Large coding tasks often fail not because of model capability, but because of context loss.

Each run typically starts with limited awareness of:

project structure
previous decisions
past failures
task-specific knowledge

This leads to repeated exploration, unnecessary context loading, and duplicated reasoning.

codex_context_engine addresses this by introducing an autoincremental context layer that persists and evolves contextual knowledge across runs.

Instead of starting from zero, the engine gradually accumulates understanding of the project.

How the Engine Works

The engine follows a layered contextual workflow.

Task
↓
Context Planner
↓
Context Cost Optimizer
↓
Execution
↓
Communication Compression Layer
↓
Failure Memory
↓
Task-Specific Memory
↓
Memory Graph
↓
Granular Telemetry
↓
Knowledge Mods
↓
Knowledge Processing Pipeline
↓
Knowledge Retrieval Engine
↓
Reference-Based Ingestion
↓
Remote Knowledge Ingestion

Each layer improves how context is selected, used and remembered.

Engine Architecture

flowchart TD
    A[Task] --> B[Context Planner]
    B --> C[Context Cost Optimizer]
    C --> D[Execution]
    D --> E[Communication Compression Layer]

    E --> F[Failure Memory]
    E --> G[Task-Specific Memory]

    F --> H[Memory Graph]
    G --> H
    H --> I[Granular Telemetry]

    I --> J[Knowledge Mods]
    J --> K[Knowledge Processing Pipeline]
    K --> L[Knowledge Retrieval Engine]
    L --> M[Reference-Based Ingestion]
    M --> N[Remote Knowledge Ingestion]

Context Planner

Determines which contextual resources should be loaded for a given task.

Responsibilities:

detect task type
select relevant context sources
control context depth
avoid unnecessary exploration

Goal: load the right context before execution begins.

Context Cost Optimizer

Reduces token usage and latency by filtering context before it reaches the model.

Responsibilities:

deduplicate context blocks
score contextual relevance
filter oversized or low-value entries
prioritize useful context

Goal: ensure only high-value context is sent to the model.

Communication Compression Layer (Iteration 16)

Iteration 16 introduces a communication compression layer inspired by Caveman-style agent communication.

The purpose of this layer is to reduce token waste in execution-time communication without degrading reasoning quality, code quality, or technical precision.

The engine already optimizes:

what context gets loaded
what context gets filtered
what knowledge gets retrieved

Iteration 16 adds optimization for:

how the agent reports progress
how implementation findings are communicated
how execution summaries are delivered
how much output-token waste is spent on filler and repeated phrasing

Default behavior:

no intermediate execution updates while work is in progress
final output only by default
minimal filler
direct reporting of findings, files, tests, risks, and decisions
no decorative formatting in runtime results
compressed communication for the execution loop only

Important boundary:

this layer affects runtime communication
it does not automatically rewrite repository prose, docs, marketing copy, or narrative content into caveman style

Core principle:

reason full, speak lean

Failure Memory

Captures knowledge about what did not work.

Responsibilities:

record failed attempts
detect recurring friction points
surface relevant failure patterns

Goal: prevent the engine from repeating ineffective strategies.

Task-Specific Memory

Stores contextual knowledge associated with specific types of tasks.

Responsibilities:

classify tasks into contextual domains
persist task-related insights
retrieve domain-relevant context

Goal: improve contextual precision for repeated workflows.

Memory Graph

Transforms contextual memory into a connected knowledge structure.

Responsibilities:

link tasks, files and decisions
map contextual relationships
enable graph-based context discovery

Goal: move from isolated memory records to connected contextual knowledge.

Granular Telemetry

Extends telemetry from whole-task estimates into task-plus-phase observability.

Responsibilities:

preserve backward-compatible task-level savings logs
support optional phase/subtask instrumentation
identify expensive task segments such as repo scan, test loops, or follow-up prompts
expose granular coverage and hot spots in summaries

Goal: explain where context and token cost concentrate inside a task, not only across tasks.

Knowledge Mods (Iteration 11)

Iteration 11 introduces generic knowledge modules ("mods") and a local learning workspace.

The engine can now create domain-specific knowledge areas on demand, allowing Codex to learn structured information about topics relevant to the project.

Example commands:

learn ux
aprende ux
study accessibility
learn architecture

When a mod is requested for the first time, the engine automatically creates a local workspace:

.codex_library/mods/<mod_id>/

This enables Codex to accumulate domain knowledge across executions, not just project memory.

Knowledge Processing Pipeline (Iteration 12)

Iteration 12 adds a document processing pipeline that converts raw documents into reusable contextual artifacts.

Documents placed in a mod inbox are processed into compact artifacts such as:

notes/
summaries/
indices/
manifests/

Typical pipeline stages:

detect documents
-> extract text
-> normalize
-> semantic split
-> topic extraction
-> note generation
-> summary generation
-> index generation
-> manifest update

The goal is to avoid loading large source documents repeatedly and instead reuse small structured artifacts.

Knowledge Retrieval Engine (Iteration 13)

Iteration 13 introduces a retrieval layer that selects the most relevant artifacts for a given request.

Instead of re-reading entire documents, the engine:

detects relevant knowledge mods
identifies likely topics
consults lightweight indices
loads a minimal set of notes or summaries

Example retrieval flow:

request
-> detect mod
-> detect topic
-> consult index
-> load minimal artifact set
-> assemble context

This ensures that Codex receives high-value contextual knowledge with minimal token cost.

Reference-Based Ingestion (Iteration 14)

Iteration 14 lets a mod ingest files without copying them into the inbox.

A mod can track external files through:

.codex_library/mods/<mod_id>/inbox/references.md

The pipeline reads references.md during processing and can ingest supported files from elsewhere in the repository or filesystem.

Key behaviors:

supports file references outside the inbox
tracks changes by modification time
invalidates stale derived artifacts automatically
regenerates notes, summaries, indices, and manifests when needed

This reduces duplication and keeps mod knowledge tied directly to the real source files.

Remote Knowledge Ingestion (Iteration 15)

Iteration 15 adds a remote acquisition layer to the engine.

The goal is not to make retrieval depend on live web access. Instead, the engine can:

register documentation URLs
fetch them
snapshot them locally
extract normalized text
emit canonical local documents into the existing pipeline

This keeps the engine aligned with the principle:

remote acquisition, local reasoning

Inside each mod, remote sources live under:

.codex_library/mods/<mod_id>/remote_sources/

with subfolders such as:

manifest.json
raw/
snapshots/
extracted/

The fetched outputs are then routed back into the normal local learning flow.

Example

Without the engine:

Task
→ Codex explores the repository
→ builds context
→ executes

With codex_context_engine:

Task
↓
Planner loads relevant context
↓
Optimizer filters context
↓
Codex executes with focused context
↓
Communication layer compresses runtime updates
↓
Failure memory records outcome
↓
Memory graph connects new knowledge
↓
Telemetry explains cost distribution
↓
Knowledge mods store domain knowledge
↓
Retrieval engine loads only relevant artifacts

Key Characteristics

Autoincremental

The engine continuously accumulates contextual understanding across tasks.

Each execution improves future executions.

Context-first execution

Instead of immediately running a task, the engine first determines:

what context exists
what context is relevant
what context should be ignored

External memory

Contextual knowledge is stored outside the model, allowing:

persistent project awareness
cross-task learning
reproducible contextual state

Failure-aware learning

The engine records mistakes and avoids repeating them.

This reduces exploration cost and accelerates problem solving.

Knowledge-aware execution

The engine can now also accumulate domain knowledge, not just project context.

This allows Codex to reuse learned knowledge across tasks and projects.

Communication-efficient execution

The engine now also optimizes how runtime information is communicated.

This reduces token waste in long implementation loops by compressing progress updates, findings, and execution summaries while preserving technical precision.

Local Knowledge Library

The knowledge system uses a local workspace:

.codex_library/

Typical structure:

.codex_library/
  registry.json
  mods/
    <mod_id>/
      inbox/
      remote_sources/
      sources/
      processed/
      notes/
      summaries/
      indices/
      manifests/
      mod.json

Users can add source documents to a mod in three ways:

Option A — drop files into the inbox

.codex_library/mods/<mod_id>/inbox/

Option B — reference external files

.codex_library/mods/<mod_id>/inbox/references.md

Option C — register remote URL sources

.codex_library/mods/<mod_id>/remote_sources/manifest.json

Running the learning or ingestion flow again triggers processing through the existing pipeline.

MCP Support

The knowledge system is designed to integrate with MCP servers when available:

filesystem MCP — local document access
git MCP — repository awareness
fetch MCP — optional external enrichment
playwright MCP — UI inspection for UX-related knowledge

All MCP integrations remain optional.

The engine continues to function fully in local-only mode.

What This Is Not

This project is not:

a prompt collection
an AI agent framework
a replacement for Codex

Instead, it is a context orchestration layer designed to help Codex maintain and evolve contextual understanding across tasks.

Why This Exists

Large AI-assisted projects frequently suffer from:

context fragmentation
repeated discovery work
token inefficiency
lack of long-term memory
noisy execution loops

codex_context_engine is an experiment in treating context as a first-class system, not a temporary prompt artifact.

Current Status

The engine currently implements:

context planning
context optimization
communication compression for runtime updates
persistent failure memory
task-specific contextual memory
graph-based contextual relationships
granular task-plus-phase telemetry
generic domain knowledge modules
document ingestion and processing
knowledge retrieval with minimal context loading
reference-based local ingestion
remote knowledge ingestion

Together these components create a layered contextual architecture that progressively improves Codex performance on complex repositories.

Auto-Initialize Across `~/projects`

If you want every Git repository under ~/projects to be integrated automatically, use the cross-project installer plus the macOS launchd agent in scripts/.

Requirement:

Ruby must be installed and available in PATH, because the repository scanner runs through scripts/install_cross_project_for_all_repos.rb

Manual one-shot integration:

ruby scripts/install_cross_project_for_all_repos.rb

Automatic background integration on macOS:

./scripts/install_launch_agent.sh

What this does:

scans ~/projects for Git repositories
creates or updates .codex_context_engine/state.json in each repository
creates AGENTS.md when missing
appends or refreshes a managed codex_context_engine block inside an existing AGENTS.md
re-runs automatically at login, every 5 minutes, and whenever ~/projects changes

Useful environment variables:

CODEX_PROJECTS_DIR to target a directory other than ~/projects
CODEX_ENGINE_REPO to point to a shared engine repository in a different path
CODEX_LAUNCH_AGENT_LABEL to override the default launchd label

To remove the background agent:

./scripts/uninstall_launch_agent.sh

Philosophy

The engine follows a simple principle:

Context should evolve with the project.

Instead of rebuilding understanding every time, the system gradually accumulates structural and experiential knowledge.

Over time this transforms Codex from a stateless assistant into a context-aware and communication-efficient development collaborator.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
codex/iterations		codex/iterations
scripts		scripts
.gitignore		.gitignore
README.md		README.md
codex_context_engine.md		codex_context_engine.md

Folders and files

Latest commit

History

Repository files navigation

codex_context_engine

Canonical Runtime

Core Idea

How the Engine Works

Engine Architecture

Context Planner

Context Cost Optimizer

Communication Compression Layer (Iteration 16)

Failure Memory

Task-Specific Memory

Memory Graph

Granular Telemetry

Knowledge Mods (Iteration 11)

Knowledge Processing Pipeline (Iteration 12)

Knowledge Retrieval Engine (Iteration 13)

Reference-Based Ingestion (Iteration 14)

Remote Knowledge Ingestion (Iteration 15)

Example

Key Characteristics

Autoincremental

Context-first execution

External memory

Failure-aware learning

Knowledge-aware execution

Communication-efficient execution

Local Knowledge Library

Option A — drop files into the inbox

Option B — reference external files

Option C — register remote URL sources

MCP Support

What This Is Not

Why This Exists

Current Status

Auto-Initialize Across ~/projects

Philosophy

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Auto-Initialize Across `~/projects`

Packages