llmo-checker

LLMO Score checker — measures AI-retrieval readiness of a URL. Part of the Open LLMO Research Initiative.

llmo-checker is a Lighthouse-style CLI that scores how "AI-retrievable" a given URL is. It fetches the page, runs a small set of static checks, and returns a JSON report with a single LLMO Score (0-100) plus per-check scores and notes.

It is intentionally not a full AI-citation simulator. It measures the substrate — the structured signals an AI crawler can extract without running an LLM — so that page authors get a fast, reproducible signal before paying for citation simulations.

Status

v0.1 — Draft. Score weights, check list, and JSON schema may change in v0.2. Pin a specific version if you depend on the JSON shape.

Install / Run

Note: The npm publish is pending account verification. Until the npm release lands, install directly from GitHub.

Run from GitHub with npx (no clone needed):

npx github:open-llmo/llmo-checker https://example.com
npx github:open-llmo/llmo-checker https://example.com --json

Once published, the same commands work without the github: prefix:

npx llmo-checker https://example.com           # after npm publish
npx llmo-checker https://example.com --json

Or clone and run locally:

git clone https://github.com/open-llmo/llmo-checker.git
cd llmo-checker
npm install
npm run dev https://example.com

Requires Node.js 20+.

What it checks (v0.1)

Check	Weight	What it measures
`llms-txt`	20	Presence and structure of `/llms.txt` per llmstxt.org
`robots-ai`	15	Explicit posture toward known AI crawlers in `/robots.txt` (GPTBot, ClaudeBot, CCBot, Google-Extended, PerplexityBot, etc.)
`canonical`	15	`<link rel="canonical">` correctness and hreflang alternates
`jsonld`	20	JSON-LD structured data presence, parseability, and recognized schema.org `@type`s
`meta`	15	`<title>` / `<meta name="description">` / OpenGraph / `<h1>` / `<html lang>`

Total weight in v0.1 is 85 (scores normalize to 0-100). Citation Visibility and Chunk Readability are planned for v0.2.

Score bands

Band	Score	Meaning
Green	85-100	Well-grounded for AI retrieval
Yellow	65-84	Needs work — several signals missing or weak
Yellow	40-64	Poor — significant grounding gaps
Red	0-39	Critical — page is largely invisible to AI crawlers

JSON output

npx llmo-checker https://example.com --json

{
  "url": "https://example.com/",
  "origin": "https://example.com",
  "timestamp": "2026-05-24T10:00:00.000Z",
  "checkerVersion": "0.1.0",
  "scoreVersion": "0.1",
  "score": 72,
  "checks": [
    {
      "id": "llms-txt",
      "name": "llms.txt presence and structure",
      "status": "pass",
      "score": 100,
      "weight": 20,
      "details": { "...": "..." },
      "notes": []
    }
  ]
}

Each check exits with one of pass / warn / fail / skip. The CLI exits with status 0 if the overall score is ≥ 50, 1 otherwise, and 2 on fetch errors.

Score v0.1 indicator set (Draft)

These are the indicator categories the v0.1 score covers. Inclusion does not imply causation with downstream AI citation — they are necessary substrate signals that have a clear definition.

Citation Visibility — whether AI assistants cite the URL (planned v0.2, requires probing)
Chunk Readability — heuristic readability of extracted chunks (planned v0.2)
Semantic Structure — JSON-LD, OpenGraph, heading hierarchy (covered by jsonld and meta)
AI Crawlability — robots.txt posture toward known AI bots (covered by robots-ai)
llms.txt — covered by llms-txt
Markdown Quality — applies only when the page has a Markdown source (planned v0.2)
Entity Clarity — JSON-LD @type Organization / Person / Book recognition (partially covered by jsonld)

The full draft spec lives at https://llmoframework.com/en/experimental-projects/.

Development

git clone https://github.com/open-llmo/llmo-checker.git
cd llmo-checker
npm install
npm run dev https://example.com
npm test
npm run build

Contributing

This is an early Draft. Issues and PRs welcome at https://github.com/open-llmo/llmo-checker/issues.

If you want to propose a new check, open an issue with:

the signal name and what it measures
why it predicts AI-retrieval readiness (a paper, a public experiment, or a Lighthouse-style argument)
proposed weight and scoring rule

License

MIT — see LICENSE.

Founded and maintained by Ken Imoto as part of the Open LLMO Research Initiative.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
assets		assets
experiments/external-baseline-2026-05		experiments/external-baseline-2026-05
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llmo-checker

Status

Install / Run

What it checks (v0.1)

Score bands

JSON output

Score v0.1 indicator set (Draft)

Development

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llmo-checker

Status

Install / Run

What it checks (v0.1)

Score bands

JSON output

Score v0.1 indicator set (Draft)

Development

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages