Principia Diagnostics

An LLM-operated graph coherence engine for research datasets.

This is not a traditional CLI tool. Principia is designed to be driven by an LLM code assistant — Claude Code, Cursor, Cline, or any agent that can read files and run shell commands. The LLM reads CLAUDE.md on session start, which contains the complete pipeline instructions. The human provides datasets and makes judgment calls; the LLM handles orchestration, interpretation, and reporting.

Give it a structured dataset — metabolic networks, taxonomies, power grids, chemical databases, knowledge graphs — and it produces a diagnostic report showing how internally coherent the dataset is and how well it grounds to known scientific foundations.

How This Is Meant to Be Used

Primary: LLM-Driven (Recommended)

The intended workflow is human + LLM code assistant:

1. Clone repo, run setup.sh
2. Open the project in an LLM code assistant (Claude Code, Cursor, etc.)
3. The LLM auto-loads CLAUDE.md — the full pipeline instruction set
4. Tell the LLM what you want:

   Human: "Analyze this metabolic network for me"
       ↓
   LLM reads CLAUDE.md → runs each pipeline step → interprets results
       ↓
   LLM: "PFD Score 0.973 — internally consistent, well-integrated.
         Top hub: pyruvate (D_eff=11). Strongest reference anchor: CHEM5.
         HTML reports saved — open data/reports/ in your browser."

5. The LLM can also:
   - Write a new parser for your data format
   - Explain why a dataset scored the way it did
   - Compare structural patterns across datasets
   - Walk you through claim extraction from papers (with human review gate)

Why LLM-first? The pipeline involves multi-step reasoning, knowledge graph traversal, and natural language interpretation of structural patterns. An LLM turns raw diagnostic output into actionable insight — "pyruvate is the top hub because it sits at the TCA/glycolysis intersection" rather than just "node met_pyr_c: D_eff=11."

CLAUDE.md is the complete LLM instruction set — it covers every pipeline step, what to check, what to report, and when to pause for human judgment.

Secondary: Manual CLI

Every command also works without an LLM for scripting or manual use:

What It Does

The LLM (or CLI) drives a 6-step diagnostic pipeline:

Your Dataset (CSV / JSON / MATPOWER / PDF / custom)
    |
[1] INGEST         -> entries + links -> rrp_*.db (SQLite)
[2] BUILD GRAPH    -> NetworkX internal graph
[3] TIER-1         -> Fisher effective dimension (D_eff), coherence score,
    |                  regime distribution -> HTML report + D3.js network viz
    |
[4] BUILD BRIDGE   -> Dataset nodes <-> Reference Wiki nodes (semantic similarity)
[5] TIER-2         -> Bridge quality, anchor distribution, domain coverage
    |                  -> HTML report
    |
[6] PFD SCORE      -> 0.0-1.0 combined diagnostic verdict with full reasoning

When driven by an LLM, each step includes natural language interpretation — the LLM explains what the numbers mean and why the dataset scored the way it did. When used manually via CLI, you get the raw diagnostic output.

Key principle: No black-box verdicts. Every score shows its full reasoning chain — what was measured, what thresholds applied, and why.

Example Results

Dataset	Entries	Links	PFD Score	Verdict
E. coli core metabolic network	304	536	0.973	CONSISTENT + WELL-INTEGRATED
Zoo animal taxonomy*	426	437	--	CONSISTENT
Periodic Table*	119	1,671	--	CONSISTENT
IEEE Power Grids* (case14/57/118)	14-118	varies	--	MARGINAL (domain-correct)
CCBH cosmology cluster (3 papers)	22	29	0.882	MARGINAL + WELL-INTEGRATED

Example HTML reports (Tier-1 and Tier-2 for E. coli) are included in data/reports/examples/ — open them in any browser to see what the output looks like.

*Data not included in repo — parsers available in src/ingestion/parsers/ if you want to reproduce these results with your own source files.

Prerequisites

Python 3.11 or higher (tested on 3.11, 3.12, 3.13)
~1GB disk space (repo data + embedding model download on first run)
No API keys needed — everything runs locally
Works on: macOS (Intel or Apple Silicon), Linux, Windows (WSL recommended)

Setup (Mac / Linux)

# 1. Clone the repository
git clone https://github.com/IanD25/principia-diagnostics.git
cd principia-diagnostics

# 2. Run the setup script
#    Creates venv, installs deps, downloads embedding model (~430MB),
#    builds semantic index. First run takes 2-5 minutes.
bash setup.sh

# 3. Activate the virtual environment
source .venv/bin/activate

# 4a. LLM-driven (recommended): Open in your LLM code assistant
#     Claude Code:  claude
#     Cursor:       Open folder in Cursor
#     The LLM auto-loads CLAUDE.md and knows the full pipeline.
#     Just tell it: "Analyze the E. coli dataset for me"

# 4b. Manual CLI:
pfd report --rrp data/rrp/ecoli_core/rrp_ecoli_core.db

The pfd report command will print a diagnostic summary to the terminal and save HTML reports to data/reports/. Open them in a browser to see the interactive D3.js network graph.

Setup (Windows)

# 1. Clone
git clone https://github.com/IanD25/principia-diagnostics.git
cd principia-diagnostics

# 2. Create virtual environment manually
python -m venv .venv
.venv\Scripts\activate

# 3. Install dependencies
pip install -e ".[dev]"

# 4. Build the semantic index (first run downloads the embedding model)
python -m sync

# 5. Run analysis (PYTHONUTF8=1 handles Unicode math symbols in the reference wiki)
set PYTHONUTF8=1
pfd report --rrp data/rrp/ecoli_core/rrp_ecoli_core.db

Alternative: Manual pip install (Mac / Linux)

If you prefer not to use the setup script:

git clone https://github.com/IanD25/principia-diagnostics.git
cd principia-diagnostics
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
python3 -m sync          # Build semantic index (~1 min first run)
pfd --help

Commands

# Full PFD report (Tier-1 + Tier-2 + PFD Score)
pfd report --rrp data/rrp/ecoli_core/rrp_ecoli_core.db

# Internal structural analysis only (Tier-1, no reference wiki needed)
pfd internal --rrp data/rrp/ecoli_core/rrp_ecoli_core.db

# Bridge analysis only (Tier-2)
pfd bridge --rrp data/rrp/ecoli_core/rrp_ecoli_core.db

# Reference wiki health check
pfd wiki

# Single node deep-dive
pfd node --node-id CHEM5

# Extract claims from text
pfd extract "We find that k = 3.11 at 90% confidence"

# Resolve a claim against the reference wiki
pfd resolve "entropy increases with dimension"

# Structural alignment on an RRP
pfd align --rrp data/rrp/ccbh/rrp_ccbh_cluster.db

What to Expect

Running pfd report produces:

Terminal output: Summary statistics — entry count, link count, D_eff distribution, regime breakdown, PFD Score
HTML reports: Saved to data/reports/ — interactive D3.js network visualization, coherence charts, bridge histograms
Runtime: 10-30 seconds depending on dataset size and hardware

How It Works

The Reference Wiki

A curated knowledge graph of 209 scientific and logical foundations — conservation laws, thermodynamic bounds, statistical distributions, equilibrium conditions, and more. Each entry has:

Formal description and implications
Typed links to related entries (derives_from, analogous_to, constrains, etc.)
Semantic embeddings for similarity search (generated automatically from entry text)

Fisher Information Matrix (FIM) Diagnostics

Each node in a graph gets analyzed using FIM decomposition:

D_eff (effective dimension) — how many independent information channels a node participates in
Coherence score — how well a node's local structure matches expected behavior
Regime classification — bulk (well-connected), surface (peripheral), bridge (connecting clusters), or isolated

Two-Tier Reports

Tier-1 (Internal): How coherent is the dataset's own internal structure? Includes D_eff distribution, regime breakdown, hub identification, and an interactive network graph.
Tier-2 (Bridge): How well does the dataset connect to known scientific foundations? Includes bridge quality scores, domain coverage heatmap, and anchor distribution.

Claim Extraction

Pattern-based extraction of testable claims from paper prose, with polarity detection (positive/negative/neutral) and mandatory human approval gate before downstream processing. No LLM or API key required.

Adding Your Own Dataset

Principia analyzes datasets by first converting them into an RRP (Research Reference Package) — a standardized SQLite database containing entries (nodes), links (edges), and metadata. The repo includes six parsers for different formats.

Quick version (if your data is already in CSV)

Prepare two CSV files:

entries.csv — one row per entity:

entry_id,title,description,entry_type,domain
NODE1,My First Entry,Description of what this represents,reference_law,physics
NODE2,My Second Entry,Another entity in the dataset,reference_law,chemistry

links.csv — one row per relationship:

source_id,target_id,link_type,confidence_tier
NODE1,NODE2,derives_from,1.5

Then write a parser (see src/ingestion/parsers/ecoli_core_parser.py for a complete example):

from ingestion.rrp_bundle import create_rrp_bundle

# Your parsing logic here — read your data, build entries and links
entries = [...]
links = [...]

create_rrp_bundle("my_dataset.db", entries, links, sections, properties)

Full steps

Create src/ingestion/parsers/your_parser.py (follow existing parser patterns)
Parse your dataset into entries, links, sections, and properties
Call create_rrp_bundle() to produce an RRP SQLite database

Build semantic bridges to the reference wiki:

python scripts/run_entity_catalog_pass.py your_rrp.db data/chroma_db data/ds_wiki.db

Run analysis:
```
pfd report --rrp your_rrp.db
```

Troubleshooting

Problem	Cause	Fix
`python: command not found`	Python not installed or not in PATH	Install Python 3.11+ from python.org or via `brew install python@3.13` (Mac)
`ModuleNotFoundError` after clone	Virtual environment not activated	Run `source .venv/bin/activate` first
First run is slow (~5 min)	Downloading embedding model (~430MB)	Normal — only happens once. Model is cached locally after download.
`sqlite3.OperationalError: no such table`	ChromaDB index not built	Run `python3 -m sync`
Unicode errors on Windows	Python not in UTF-8 mode	Set `PYTHONUTF8=1` before running commands
`pfd: command not found`	Package not installed in editable mode	Run `pip install -e .` from the repo root

Requirements

Python 3.11+ (tested on 3.11, 3.12, 3.13)
~1GB disk (repo data + embedding model)
CPU, CUDA (RTX 2000+), or Apple Silicon — auto-detected, no configuration needed

Core dependencies (installed automatically by setup.sh or pip):

sentence-transformers  # BGE embeddings (auto-downloaded from HuggingFace)
chromadb               # Semantic vector indexing
numpy                  # Numerics
plotly, matplotlib     # Visualization
networkx               # Graph analysis

Project Structure

principia-diagnostics/
|-- CLAUDE.md              # LLM INSTRUCTION SET — auto-loaded by Claude Code / Cursor / Cline
|-- src/
|   |-- config.py              # All paths, model config, thresholds
|   |-- sync.py                # Rebuild ChromaDB from ds_wiki.db
|   |-- cli.py                 # Unified CLI entry point
|   |-- embedder.py            # BGE embedding pipeline
|   |-- analysis/              # Diagnostic tools
|   |   |-- fisher_diagnostics.py   # Core FIM math
|   |   |-- fisher_report.py        # Two-tier PFD report
|   |   |-- claim_extractor.py      # Phase 3 claim extraction
|   |   |-- result_validator.py     # Claim validation + resolution
|   |   +-- structural_alignment.py # Signed polarity scoring
|   |-- ingestion/             # Dataset parsers
|   |   |-- parsers/           # One parser per format (6 included)
|   |   +-- cross_universe_query.py  # Dataset <-> Wiki bridge detection
|   +-- viz/                   # Visualization (D3.js, Plotly)
|       |-- tier1_dashboard.py # Network graph + charts
|       |-- tier1_report.py    # Tier-1 HTML report
|       +-- tier2_report.py    # Tier-2 HTML report
|-- scripts/
|   |-- run_fisher_suite.py    # Main analysis CLI (6 modes)
|   +-- run_entity_catalog_pass.py
|-- data/
|   |-- ds_wiki.db             # Reference knowledge graph (209 entries)
|   +-- rrp/                   # Example datasets
|       |-- ecoli_core/        # E. coli metabolic network (PFD 0.973)
|       +-- ccbh/              # Cosmological coupling papers (PFD 0.882)
+-- tests/                     # Test suite (470+ tests)

Design Philosophy

LLM-first architecture — the system is designed to be operated by an LLM code assistant, with CLAUDE.md as the machine-readable instruction set. The CLI exists for scripting and manual fallback, but the primary interface is natural language via an LLM.
Human-governed — the LLM orchestrates; humans make judgment calls (especially claim review in paper analysis)
Diagnostic, not judge — reports show reasoning, not verdicts
Probabilistic, not boolean — confidence scores (0-1), never VALID/INVALID
Transparent — every output shows full reasoning chain
Reproducible — SQLite databases, deterministic pipelines, no API keys needed
Falsifiable — system is provably wrong when it is; failures are understandable and fixable

License

MIT — see LICENSE.

Contributing

Contributions welcome! See CONTRIBUTING.md for setup instructions and code standards.

The most impactful contribution is adding a parser for a new dataset format — see src/ingestion/parsers/ for examples.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
data		data
docs		docs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
USER_GUIDE.md		USER_GUIDE.md
pyproject.toml		pyproject.toml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Principia Diagnostics

How This Is Meant to Be Used

Primary: LLM-Driven (Recommended)

Secondary: Manual CLI

What It Does

Example Results

Prerequisites

Setup (Mac / Linux)

Setup (Windows)

Alternative: Manual pip install (Mac / Linux)

Commands

What to Expect

How It Works

The Reference Wiki

Fisher Information Matrix (FIM) Diagnostics

Two-Tier Reports

Claim Extraction

Adding Your Own Dataset

Quick version (if your data is already in CSV)

Full steps

Troubleshooting

Requirements

Project Structure

Design Philosophy

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Principia Diagnostics

How This Is Meant to Be Used

Primary: LLM-Driven (Recommended)

Secondary: Manual CLI

What It Does

Example Results

Prerequisites

Setup (Mac / Linux)

Setup (Windows)

Alternative: Manual pip install (Mac / Linux)

Commands

What to Expect

How It Works

The Reference Wiki

Fisher Information Matrix (FIM) Diagnostics

Two-Tier Reports

Claim Extraction

Adding Your Own Dataset

Quick version (if your data is already in CSV)

Full steps

Troubleshooting

Requirements

Project Structure

Design Philosophy

License

Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages