Helixir

Graph-based persistent memory for LLM agents.
Associative recall, causal reasoning, ontology classification — out of the box.

Quick Start · How It Works · Ontology · Graph Schema · MCP Tools · Configuration

What is Helixir?

Helixir gives AI agents memory that persists between sessions. When an agent powered by Helixir starts a new conversation, it recalls past decisions, preferences, goals, and reasoning chains — not from a flat log, but from a graph of interconnected facts.

Every piece of information is LLM-extracted into atomic facts, classified by ontology (8 types), linked to named entities, and stored with vector embeddings for semantic search. Duplicate detection, contradiction tracking, and supersession happen automatically.

Built on HelixDB (graph + vector database) with native MCP support for Cursor, Claude Desktop, and any MCP-compatible client.

Flat memory (key-value / embeddings only)	Helixir (graph + vector + ontology)
Retrieves similar text chunks	Retrieves facts and their connections
No deduplication — grows forever	Smart dedup: ADD / UPDATE / SUPERSEDE / NOOP
No reasoning trail	Causal chains: A BECAUSE B, A IMPLIES C
All memories equal	Ontology: skills vs preferences vs goals
Single-user	Cross-user: shared facts, conflict detection

Quick Start

One-command install

curl -fsSL https://raw.githubusercontent.com/nikita-rulenko/Helixir/main/install.sh | bash

The script will:

Check prerequisites (Rust, Docker)
Clone the repo and build from source
Start HelixDB via Docker
Deploy the graph schema
Generate MCP config for your IDE

Or install manually:

git clone https://github.com/nikita-rulenko/Helixir.git
cd helixir

make build          # Build release binary
make setup          # Start HelixDB + deploy schema
make config         # Print MCP config to paste into your IDE

Prerequisites

Rust 1.83+ — rustup.rs
Docker — for HelixDB (install)
API key — at least one LLM provider:
- Cerebras (free tier, ~3000 tok/s)
- OpenAI
- Ollama (local, no key needed)

How It Works

           Input: "I deployed the server to AWS and prefer using Terraform"
                                      |
                                LLM Extraction
                                      |
                      +---------------+---------------+
                      |                               |
              Memory: "I deployed         Memory: "I prefer
              the server to AWS"          using Terraform"
              type: action                type: preference
                      |                               |
                +-----+-----+                   +-----+-----+
                |           |                   |           |
            Entity:     Entity:            Entity:      Concept:
            "AWS"       "server"           "Terraform"  Preference
                      |
                Phase 1: Personal search (dedup check)
                Phase 2: Cross-user search (shared facts)
                      |
                Decision: ADD / UPDATE / SUPERSEDE / NOOP
                      |
                Store in HelixDB (graph + vector)

Architecture

MCP Server (stdio)                        IDE (Cursor / Claude Desktop)
       |                                           |
  HelixirClient                               MCP Protocol
       |
  ToolingManager ──── FastThinkManager
       |                    |
  +----+----+----+     petgraph (in-memory)
  |    |    |    |          |
Extract Decision Entity  commit to DB
  |    Engine  Manager       |
Search    |    Ontology      |
Engine  Reasoning Manager    |
  |    Engine    |           |
  +----+----+----+-----------+
       |
  HelixDB Client (HTTP)
       |
  HelixDB (graph + vector database)

See full architectural diagrams in helixir/diagrams/.

Ontology

Every memory is classified into one of 8 concept types. The LLM extractor assigns the type during ingestion; search_by_concept retrieves memories by type.

Type	What it captures	Example
fact	Objective knowledge, statements about the world	"Rust compiles to native code"
preference	Likes, dislikes, tastes, favorites	"I prefer dark mode in all editors"
skill	Abilities, competencies, expertise	"I can write fluent Python"
goal	Plans, aspirations, objectives	"I want to learn Japanese this year"
opinion	Subjective beliefs, judgments, viewpoints	"I think remote work is more productive"
experience	Past events, situations lived through	"I lived in Berlin for 3 years"
achievement	Accomplished milestones, completed goals	"I built a working compiler from scratch"
action	Specific tasks performed, operations executed	"I deployed the CI/CD pipeline yesterday"

Ontology hierarchy

The concept types are organized into a tree stored in HelixDB:

Thing
  ├── Attribute
  │     ├── Fact
  │     ├── Preference
  │     ├── Skill
  │     ├── Goal
  │     ├── Opinion
  │     └── Trait
  ├── Event
  │     ├── Action
  │     ├── Experience
  │     └── Achievement
  ├── Entity
  │     ├── Person
  │     ├── Organization
  │     ├── Location
  │     ├── Object
  │     └── Technology
  ├── Relation
  └── State

The hierarchy enables traversal: searching for "Attribute" returns all facts, preferences, skills, goals, and opinions. Entity types (Person, Organization, etc.) are used for extracted named entities.

Graph Schema

Helixir stores everything as a typed graph: 15 node types connected by 33 edge types.

Node types

Node	Purpose	Key fields
Memory	Core unit — one atomic fact	content, memory_type, certainty, importance, user_id
User	Owner of memories	user_id, name
Entity	Named thing extracted from text	name, entity_type, aliases
Concept	Ontology node (Fact, Skill, Goal...)	name, level, parent_id
Context	Situational scope (work, personal...)	name, context_type
Session	Conversation session	session_id, status
Agent	AI agent that created a memory	agent_id, role, capabilities
HistoryEvent	Audit log entry for a memory	action, old_value, new_value, timestamp
MemoryChunk	Fragment of a long memory	content, position, token_count
Reasoning	Reasoning node	reasoning_type, confidence
Constraint	Rule applied in a context	rule, constraint_type, priority
MemoryEmbedding	Vector embedding (search index)	content, created_at
EntityEmbedding	Vector embedding for entity search	name
DocPage / DocChunk / CodeExample / ErrorCode	Documentation pipeline (reserved)	—

Edge types (active)

These 24 edge types are used in the current pipeline:

Edge	From → To	What it means
HAS_MEMORY	User → Memory	User owns this memory
INSTANCE_OF	Memory → Concept	Memory is of this ontology type
BELONGS_TO_CATEGORY	Memory → Concept	Memory belongs to this category
MENTIONS	Memory → Entity	Memory mentions this entity
EXTRACTED_ENTITY	Memory → Entity	Entity was LLM-extracted from this memory
RELATES_TO	Entity → Entity	Two entities are related (typed: works_at, uses, etc.)
VALID_IN	Memory → Context	Memory applies in this context (work, personal...)
OCCURRED_IN	Memory → Context	Memory is about an event in this context
AGENT_CREATED	Agent → Memory	This agent created the memory
HAS_HISTORY	Memory → HistoryEvent	Audit trail: who changed what and when
HAS_CHUNK	Memory → MemoryChunk	Memory split into chunks (long texts)
NEXT_CHUNK	MemoryChunk → MemoryChunk	Sequential chunk ordering
CHUNK_HAS_EMBEDDING	MemoryChunk → MemoryEmbedding	Chunk's vector index
MEMORY_RELATION	Memory → Memory	General relation between memories (typed)
IMPLIES	Memory → Memory	A logically leads to B
BECAUSE	Memory → Memory	A is the reason for B
CONTRADICTS	Memory → Memory	A conflicts with B
SUPERSEDES	Memory → Memory	A replaces outdated B
HAS_EMBEDDING	Memory → MemoryEmbedding	Memory's vector index for semantic search
ENTITY_HAS_EMBEDDING	Entity → EntityEmbedding	Entity's vector index
HAS_SUBTYPE	Concept → Concept	Ontology hierarchy (Attribute → Skill)
PAGE_TO_CHUNK	DocPage → DocChunk	Documentation structure
CHUNK_TO_EMBEDDING	DocChunk → ChunkEmbedding	Documentation vector index
SUPPORTS	Memory → Memory	A provides evidence for B

Edge types (reserved)

These 9 edge types are defined in the schema with HQL queries ready, but not yet called from the Rust pipeline. They are infrastructure for planned features:

Edge	From → To	Planned use
IN_SESSION	User → Session	Session tracking
CREATED_IN	Memory → Session	Which session created this memory
IS_A	Concept → Concept	Dynamic ontology extension
CONCEPT_RELATED_TO	Concept → Concept	Cross-concept links
PART_OF	Entity → Entity	Hierarchical entity relations
APPLIES_IN	Constraint → Context	Constraint scoping
CHUNK_MENTIONS_CONCEPT	DocChunk → Concept	Documentation ↔ ontology links
CONCEPT_HAS_EXAMPLE	Concept → CodeExample	Code examples per concept
ERROR_REFERENCES_CONCEPT	ErrorCode → Concept	Error catalog

MCP Tools

Memory

Tool	What it does
`add_memory`	Extract atomic facts from text, deduplicate, store with entities and relations
`search_memory`	Semantic search with temporal modes: `recent` (4h), `contextual` (30d), `deep` (90d), `full`
`search_by_concept`	Filter by ontology type: skill, preference, goal, fact, opinion, experience, achievement, action
`search_reasoning_chain`	Traverse causal/logical connections: IMPLIES, BECAUSE, CONTRADICTS, SUPPORTS
`get_memory_graph`	Return memory as a graph of nodes and edges
`update_memory`	Modify existing memory content
`search_incomplete_thoughts`	Find auto-saved incomplete FastThink sessions

FastThink (working memory)

Isolated scratchpad for complex reasoning. Nothing pollutes long-term memory until you explicitly commit.

Tool	What it does
`think_start`	Open a new thinking session
`think_add`	Add a reasoning step (types: reasoning, hypothesis, observation, question)
`think_recall`	Pull facts from long-term memory into the session (read-only)
`think_conclude`	Mark a conclusion
`think_commit`	Save the conclusion to long-term memory
`think_discard`	Discard the session without saving
`think_status`	Check session state: thought count, depth, elapsed time

Flow: think_start → think_add (repeat) → think_recall (optional) → think_conclude → think_commit

If a session times out, partial thoughts are auto-saved with an [INCOMPLETE] tag and recoverable via search_incomplete_thoughts.

Integration

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "helixir": {
      "command": "/path/to/helixir-mcp",
      "env": {
        "HELIX_HOST": "localhost",
        "HELIX_PORT": "6969",
        "HELIX_LLM_PROVIDER": "cerebras",
        "HELIX_LLM_MODEL": "gpt-oss-120b",
        "HELIX_LLM_API_KEY": "YOUR_KEY",
        "HELIX_EMBEDDING_PROVIDER": "openai",
        "HELIX_EMBEDDING_MODEL": "nomic-embed-text-v1.5",
        "HELIX_EMBEDDING_URL": "https://openrouter.ai/api/v1",
        "HELIX_EMBEDDING_API_KEY": "YOUR_KEY"
      }
    }
  }
}

Claude Desktop

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

Same JSON structure as above.

Cursor Rules (recommended)

Add to Cursor Settings > Rules so the agent actually uses its memory:

# Core Memory Behavior
- At conversation start, call search_memory to recall relevant context
- After completing tasks, save key outcomes with add_memory
- Use search_by_concept for skill/preference/goal queries
- Use search_reasoning_chain for "why" questions

# FastThink for Complex Reasoning
- Before major decisions, use FastThink to structure your reasoning
- Flow: think_start -> think_add (repeat) -> think_recall -> think_conclude -> think_commit

# What to Save
- ALWAYS save: decisions, outcomes, architecture changes, error fixes, preferences
- NEVER save: grep results, lint output, file contents, temporary data

Configuration

All settings are passed as environment variables.

Required

Variable	Description
`HELIX_HOST`	HelixDB address (default: `localhost`)
`HELIX_PORT`	HelixDB port (default: `6969`)
`HELIX_LLM_API_KEY`	API key for the LLM provider
`HELIX_EMBEDDING_API_KEY`	API key for the embedding provider

Optional

Variable	Default	Description
`HELIX_LLM_PROVIDER`	`cerebras`	`cerebras`, `openai`, `ollama`
`HELIX_LLM_MODEL`	`gpt-oss-120b`	Model name
`HELIX_LLM_BASE_URL`	—	Custom endpoint (for Ollama)
`HELIX_EMBEDDING_PROVIDER`	`openai`	`openai`, `ollama`
`HELIX_EMBEDDING_URL`	`https://openrouter.ai/api/v1`	Embedding API URL
`HELIX_EMBEDDING_MODEL`	`nomic-embed-text-v1.5`	Embedding model
`RUST_LOG`	`helixir=warn`	Log level

Provider presets

Cerebras + OpenRouter (recommended — fast inference, cheap embeddings)

HELIX_LLM_PROVIDER=cerebras
HELIX_LLM_MODEL=gpt-oss-120b
HELIX_LLM_API_KEY=csk-xxx           # https://cloud.cerebras.ai

HELIX_EMBEDDING_PROVIDER=openai
HELIX_EMBEDDING_URL=https://openrouter.ai/api/v1
HELIX_EMBEDDING_MODEL=nomic-embed-text-v1.5
HELIX_EMBEDDING_API_KEY=sk-or-xxx   # https://openrouter.ai/keys

Fully local with Ollama (no API keys, fully private)

# Install Ollama: https://ollama.com
ollama pull llama3:8b
ollama pull nomic-embed-text

HELIX_LLM_PROVIDER=ollama
HELIX_LLM_MODEL=llama3:8b
HELIX_LLM_BASE_URL=http://localhost:11434

HELIX_EMBEDDING_PROVIDER=ollama
HELIX_EMBEDDING_URL=http://localhost:11434
HELIX_EMBEDDING_MODEL=nomic-embed-text

OpenAI only (simple, one API key)

HELIX_LLM_PROVIDER=openai
HELIX_LLM_MODEL=gpt-4o-mini
HELIX_LLM_API_KEY=sk-xxx

HELIX_EMBEDDING_PROVIDER=openai
HELIX_EMBEDDING_MODEL=text-embedding-3-small
HELIX_EMBEDDING_API_KEY=sk-xxx

Development

make build          # Build release binary
make test           # Run all tests
make check          # cargo check + clippy
make run            # Run MCP server locally (debug)
make deploy-schema  # Deploy schema to running HelixDB
make docker-up      # Start HelixDB container
make docker-down    # Stop HelixDB container
make test-e2e-hive  # Hive cross-user E2E (HelixDB + LLM + embeddings; set HELIX_* like MCP)

Hive E2E: make test-e2e-hive runs hive_cross_user_collective_link_e2e (ignored by default in cargo test). It adds the same fact for two user_id values and asserts collective user_count ≥ 2 on the first memory. LLM decisions can be flaky—retry if needed.

Project structure

helixir-rs/
  helixir/
    src/
      bin/
        helixir_mcp.rs          # MCP server entry point
        helixir_deploy.rs       # Schema deployment CLI
      core/                     # Config, client, search modes
      db/                       # HelixDB client
      llm/                      # LLM providers, extractor, decision engine
      mcp/                      # MCP server, params, cognitive protocol
      toolkit/
        tooling_manager/        # Main pipeline (add, search, CRUD, events)
        mind_toolbox/           # Search engine, entity, ontology, reasoning
        fast_think/             # Working memory (petgraph-based)
    schema/
      schema.hx                 # Node/edge definitions (15 nodes, 33 edges)
      queries.hx                # HQL queries (100+)
    diagrams/                   # Architecture diagrams (D2 + PNG)
    Dockerfile
    docker-compose.yml

License

Links

HelixDB — graph + vector database
MCP Specification — Model Context Protocol
Cerebras — fast LLM inference (free tier)
OpenRouter — unified LLM/embedding API

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.snapshots		.snapshots
helixir		helixir
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
helixir-logo.jpeg		helixir-logo.jpeg
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Helixir

What is Helixir?

Quick Start

One-command install

Prerequisites

How It Works

Architecture

Ontology

Ontology hierarchy

Graph Schema

Node types

Edge types (active)

Edge types (reserved)

MCP Tools

Memory

FastThink (working memory)

Integration

Cursor

Claude Desktop

Cursor Rules (recommended)

Configuration

Required

Optional

Provider presets

Development

Project structure

License

Links

About

Uh oh!

Releases 9

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Helixir

What is Helixir?

Quick Start

One-command install

Prerequisites

How It Works

Architecture

Ontology

Ontology hierarchy

Graph Schema

Node types

Edge types (active)

Edge types (reserved)

MCP Tools

Memory

FastThink (working memory)

Integration

Cursor

Claude Desktop

Cursor Rules (recommended)

Configuration

Required

Optional

Provider presets

Development

Project structure

License

Links

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages