🍇 raisin

Write Python LLMs can read. ~50% fewer tokens, 100% same functionality.

The metaphor

A raisin is a grape with the water removed. Same sweetness, same nutrients, same fruit — half the mass. This project does the same thing to Python: removes the water (docstrings, boilerplate, ceremonial type hints) and keeps the nutrients (logic, behavior, public API).

TL;DR

LLMs are trained to imitate human coding conventions — docstrings for Sphinx, type hints for IDE hover, verbose error handling for readable stack traces. None of these serve the LLM that's writing or reading the code.

When we let LLMs write natively for machine-reading, they produce programs that pass the same test suites in roughly half the tokens. Six independent experiments confirm the pattern:

Experiment	Saved	Verification
Click 8.2.1 retrofit	55.5%	738/738 tests pass
Flask 3.1.3 retrofit	62.7%	syntax + structure
Bottle 0.13.4 retrofit	37.6%	WSGI pipeline
Greenfield TODO CLI (written twice)	51.8%	28/28 tests pass
Guide → agent → URL shortener (from scratch)	47.2%	20/20 tests pass
Guide → agent → click/formatting.py (real library)	44.4%	738/738 tests pass
🏆 Single-file record: flask/helpers.py	80.3%	syntax + structure

Total: 786 tests verified. Zero regressions.

Install the Skill

Claude Code — one line

/plugin marketplace add Oldrich333/raisin
/plugin install raisin

Claude Code — manual

git clone https://github.com/Oldrich333/raisin.git /tmp/raisin
mkdir -p ~/.claude/skills
cp -r /tmp/raisin/plugins/raisin/skills/raisin ~/.claude/skills/

Codex / Gemini CLI / other agents

Copy plugins/raisin/skills/raisin/SKILL.md into your agent's skill directory. The skill is a single self-contained file — no dependencies.

Activate

Use a slash command:

/halfcode      # primary command
/dense         # alias
/raisin        # alias (brand)

Or natural language:

write this dense
minimize tokens
compress src/utils.py
no docstrings, llm-native style

The Core Experiment: Greenfield TODO CLI

To prove compression isn't "retrofit cheating," we wrote the same program twice from scratch, under two styles. Both pass the same 28-test spec.

Style	Tokens	LOC	Tests
Normal Python (docstrings, type hints, verbose errors)	3,022	437	28/28 ✓
LLM-native (dense from line 1)	1,458	104	28/28 ✓
Ratio	48.2%	23.8%	—

→ Full greenfield experiment

Guide Validation — Does the Methodology Transfer?

Sure, we can compress code. But can someone following the guide reproduce the results? We tested it twice:

Test 1 — URL Shortener from scratch (20 tests): Gave an agent only the guide + test suite. Result: 775 tokens, 20/20 pass, 47.2% savings.

Test 2 — Click formatting.py, inside the real library (738 tests): Gave a different agent only the guide + original Click file. The agent wrote a dense version that passes all 738 of Click's own tests. Result: 1,195 tokens, 738/738 pass, 44.4% savings — within 5% of our hand-tuned reference.

→ Guide validation experiment

Headline File: flask/helpers.py — 80.3% savings

	Tokens	LOC
Original	5,399	641
Dense rewrite	1,064	80
Saved	80.3%	87.5%

Flask's helpers.py is mostly small utility functions each wrapped in 30 lines of docstrings and type overloads. The dense version preserves all public API and behavior in 80 lines.

Reproducing the results

Prerequisites

python3 -m pip install pytest tiktoken pyyaml

Verify Click (738 tests at every level)

bash tools/run_tests.sh original          # 738 passed
bash tools/run_tests.sh L1_clean          # 738 passed
bash tools/run_tests.sh LK_kolmogorov     # 738 passed
bash tools/run_tests.sh LK2_aggressive    # 738 passed

Verify greenfield TODO (28 tests × 2 implementations)

cd greenfield
TODO_IMPL=normal python3 -m pytest tests/ -q       # 28 passed
TODO_IMPL=kolmogorov python3 -m pytest tests/ -q   # 28 passed

Verify guide validation (20 tests × 2 implementations)

cd guide_validation
URLSHORT_IMPL=normal python3 -m pytest spec/ -q       # 20 passed
URLSHORT_IMPL=kolmogorov python3 -m pytest spec/ -q   # 20 passed

Measure token counts across all levels

python3 tools/measure.py

The methodology

CODE_COMPRESSION_GUIDE.md is an M2M (machine-to-machine) document — no prose, no chatty explanations. It contains:

FRAMING — what to optimize, what to preserve, what to ignore
WHAT TO REMOVE — always waste: docstrings, comments, blank lines, overload stubs, internal type hints
WHAT TO RESTRUCTURE — the real gains: shared error handlers, validation helpers, dict dispatch, bulk attribute assignment
NAMING RULES — short internal, clear public
FORMATTING RULES — semicolons, one-liners, comprehensions
VERIFICATION PROTOCOL — how to catch bugs without reverting
CHECKLIST — grep patterns for each optimization opportunity
GREENFIELD vs RETROFIT — different process for each

The guide is not a tutorial. It's a specification for LLM agents. The skill in plugins/raisin/skills/raisin/SKILL.md is the same methodology packaged as a Claude Code / Codex skill.

Repository Structure

raisin/
├── README.md                      — you are here
├── CODE_COMPRESSION_GUIDE.md      — the methodology (agent instructions)
├── RESULTS.md                     — detailed retrofit results
├── TECHNIQUES.md                  — design document
├── READABILITY_ARGUMENT.md        — pre-emptive response to "but it's unreadable"
│
├── .claude-plugin/
│   └── marketplace.json           — Claude Code marketplace definition
│
├── plugins/raisin/                — the installable skill
│   ├── .claude-plugin/plugin.json
│   ├── README.md
│   └── skills/raisin/SKILL.md     — methodology as agent instruction
│
├── assets/
│   ├── raisin-banner.png          — social preview
│   ├── raisin-logo.png            — logo with code braces
│   └── raisin-icon-simple.png     — minimalist icon
│
├── greenfield/                    — write-from-scratch experiment
│   ├── SPEC.md, RESULTS.md
│   ├── tests/test_todo.py         — 28 tests (shared)
│   ├── normal/todo.py             — human-imitating (437 LOC)
│   ├── normal_L1/todo.py          — after automated strip
│   ├── normal_L2/todo.py          — after cosmetic pass
│   └── kolmogorov/todo.py         — LLM-native (104 LOC)
│
├── guide_validation/              — methodology transfer test
│   ├── spec/SPEC.md, spec/test_url_short.py  — 20 tests
│   ├── normal/url_short.py        — reference human-style
│   └── kolmogorov/url_short.py    — agent wrote this using only the guide
│
├── original/                      — Click 8.2.1 source
├── L1_clean/                      — Click after automated strip
├── LK_kolmogorov/                 — Click after LLM-native rewrite
├── LK2_aggressive/                — Click after second rewrite pass
├── LK3_agent_click/               — Click with agent's formatting.py
│
├── flask_benchmark/               — Flask 3.1.3 + compressed versions
├── bottle_benchmark/              — Bottle 0.13.4 + compressed versions
│
├── tests/                         — Click 8.2.1's own 738-test suite
└── tools/                         — measure/strip/run_tests/full_report

Why this matters

Context window economics

A 200K context window loaded with Click + Flask + Bottle (original) consumes 170,767 tokens — 85% of the window. The LLM-native versions consume 79,542 tokens — 40%, leaving 120K tokens free for actual thinking.

Across a 50-library research mission, this saves ~1.5M tokens and roughly $4.50 per run at Claude's API rates.

LLM coding speed

When an LLM produces docstrings, type hints, and verbose error handling, it spends real wall-clock time generating tokens that nothing will ever read. Remove those constraints and the LLM writes the program faster and cleaner.

Code review

LLM-native code is not unreadable — it's densely readable. A Python programmer reading the 104-line TODO finishes faster than the 437-line version because there's less skipping. LLMs read it trivially.

Related Work

Atlas Coding Engine (ACE Protocol v15) — the production methodology this benchmark validates. Atlas (the agentic AI platform that produced this benchmark) has 47,472 LOC in 163 shard files, built LLM-native from day one and claiming ~60% LOC savings — verified here.

License

Repo analysis, methodology, tooling, skill: MIT
Click, Flask, Bottle and derivative works: original licenses (BSD-3, BSD-3, MIT)
See LICENSE for details

Citation

@misc{raisin-2026,
  author = {Oldrich333},
  title = {raisin: Write Python LLMs can read},
  year = {2026},
  url = {https://github.com/Oldrich333/raisin}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍇 raisin

The metaphor

TL;DR

Install the Skill

Claude Code — one line

Claude Code — manual

Codex / Gemini CLI / other agents

Activate

The Core Experiment: Greenfield TODO CLI

Guide Validation — Does the Methodology Transfer?

Headline File: flask/helpers.py — 80.3% savings

Reproducing the results

Prerequisites

Verify Click (738 tests at every level)

Verify greenfield TODO (28 tests × 2 implementations)

Verify guide validation (20 tests × 2 implementations)

Measure token counts across all levels

The methodology

Repository Structure

Why this matters

Context window economics

LLM coding speed

Code review

Related Work

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.claude-plugin		.claude-plugin
L1_clean		L1_clean
L2_logic		L2_logic
L3_shard		L3_shard
LK2_aggressive		LK2_aggressive
LK3_agent_click		LK3_agent_click
LK_kolmogorov		LK_kolmogorov
analysis		analysis
assets		assets
bottle_benchmark		bottle_benchmark
flask_benchmark		flask_benchmark
greenfield		greenfield
guide_validation		guide_validation
original		original
plugins/raisin		plugins/raisin
tests		tests
tools		tools
.gitignore		.gitignore
CODE_COMPRESSION_GUIDE.md		CODE_COMPRESSION_GUIDE.md
LICENSE		LICENSE
READABILITY_ARGUMENT.md		READABILITY_ARGUMENT.md
README.md		README.md
RESULTS.md		RESULTS.md
TECHNIQUES.md		TECHNIQUES.md
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

🍇 raisin

The metaphor

TL;DR

Install the Skill

Claude Code — one line

Claude Code — manual

Codex / Gemini CLI / other agents

Activate

The Core Experiment: Greenfield TODO CLI

Guide Validation — Does the Methodology Transfer?

Headline File: flask/helpers.py — 80.3% savings

Reproducing the results

Prerequisites

Verify Click (738 tests at every level)

Verify greenfield TODO (28 tests × 2 implementations)

Verify guide validation (20 tests × 2 implementations)

Measure token counts across all levels

The methodology

Repository Structure

Why this matters

Context window economics

LLM coding speed

Code review

Related Work

License

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages