abyss

Your agent wastes 85% of tokens. We fix that.

Code graph + token compression for AI coding agents.
One binary. Zero config. 14 languages.

English · 中文

The Problem

AI coding agents read entire files to change one line. They dump 50K tokens of cargo test output when only 3 lines matter. They edit functions without knowing who calls them.

abyss fixes this with two layers that reinforce each other:

Layer	What it does	Result
Code Graph	Call graph, blast radius, hotspot scoring — from tree-sitter + git history	Agent knows who calls this and what breaks before editing
Proxy Compression	Intercepts agent commands, structurally compresses output	85% fewer tokens, zero information loss

Quick Demo

$ abyss setup
✓ 210 files, 1545 symbols, 12216 refs in 165ms
✓ hooks installed: pre-edit + post-edit + proxy

$ abyss callers batch_resolve_refs
callers of 'batch_resolve_refs' (1 prod):
  1. src/indexer/pipeline.rs:255 → run_structural()  (100%, call)

$ abyss gain
╭────────────────────────────────────────────╮
│  abyss proxy —   96K tokens saved (85%)   │
╰────────────────────────────────────────────╯
  148 commands proxied
  113K raw → 17K delivered

Install

# prebuilt binary (linux / macOS / Windows, x64 / arm64)
curl -fsSL https://raw.githubusercontent.com/telagod/abyss/main/install.sh | bash

# or via package managers
npm install -g @code-abyss/cli
cargo binstall code-abyss       # or: cargo install code-abyss

More install options

# mirror for restricted networks
curl -fsSL https://cdn.jsdelivr.net/gh/telagod/abyss@main/install.sh | bash

# Windows
npm install -g @code-abyss/cli
# or: prebuilt .zip on GitHub Releases

# from source
git clone https://github.com/telagod/abyss && cd abyss
cargo install --path .

# shell completion (bash / zsh / fish / powershell)
abyss completion bash | sudo tee /etc/bash_completion.d/abyss

Setup (60 Seconds)

cd your-project
abyss setup       # index + hooks + proxy — one command, done

That's it. Your agent now has:

Pre-edit safety cards — callers, blast radius, risk score before every edit
Post-edit reindex — incremental, hash-based, milliseconds
Proxy compression — every shell command output compressed automatically

Works with Claude Code, Codex CLI, Gemini CLI, and OpenClaw. No config files.

What Can It Do?

Code Intelligence

abyss callers SetError         # who calls this?
abyss impact  SetError         # what breaks if I change it?
abyss context src/auth.go      # full pre-edit card: callers, deps, risk, coupling
abyss map                      # hotspots + change coupling overview
abyss search  "validate"       # symbol + fulltext fusion search
abyss where   src/auth.go      # architectural coordinates (layer / module / role)
abyss history src/auth.go      # file evolution from git

Proxy Compression

abyss proxy cargo test         # run + compress + track
abyss proxy --explain          # show which handler fired and why
abyss gain                     # token savings dashboard

Integration

abyss attach claude            # install hooks for Claude Code
abyss attach all               # claude + codex + gemini in one shot
abyss mcp                      # MCP server (9 tools, stdio)
abyss daemon start --detach    # background reindex on file save

Every command supports --json for machine consumption.

Real-World Compression

Measured on real coding sessions. Not synthetic benchmarks.

Scenario	Without abyss	With abyss	Compression
"Who calls this function?"	221 KB (read 6 files)	6 KB (caller graph)	36×
"Find all usages of `run_structural`"	328 KB (grep + read)	1.7 KB (callers)	195×
Codebase overview	291 KB (read all .rs)	2.1 KB (map)	138×
`cargo test` (237 tests pass)	8,861 tokens	11 tokens	99.9%
`cat` large file (862 lines)	8,151 tokens	782 tokens	90.4%
`git diff`	4,493 tokens	862 tokens	80.8%

0% on small results is correct — the never-worse guard passes through output that can't be compressed without information loss.

Resolver Precision

Not a compiler. Not a guess. Measured against SCIP (compiler-grade) ground truth:

Corpus	Language	Precision	Recall
gin v1.10	Go	99.4%	83.0%
hono v4.6	TypeScript	98.9%	64.6%
click 8.1	Python	99.3%	94.6%
ripgrep 14.1	Rust	98.5%	75.5%
abyss (dogfood)	Rust	100%	76.0%
cmark 0.31	C	99.1%	74.8%

All corpora ≥ 98.5% gated precision — regressions are release-blockers. Reproduce: cd eval && ./run.sh

How the resolver works

Tiered heuristic resolution, each level tagged with a confidence score:

Tier	Strategy	Confidence
L0	Receiver-type match (`x.M()` where type of `x` is known)	0.95
L0b	Named-import binding (`import { x } from './m'`)	0.95
L1	Same file, bare/self-like calls	1.0
L2	Same package, unique candidate	0.95
L3	Import-qualifier match, unique	0.9
L4	Globally unique symbol	0.8
L5	Same package, multiple candidates	0.6

Agent-facing APIs default to min_confidence=0.7 to filter noise.

Language Support

Full call graph (calls + type refs + imports): Go, Rust, TypeScript/TSX, JavaScript, Python, Java, C, C++

Symbol indexing + search: all of the above + JSON, TOML, YAML, Bash, HTML, CSS

Battle-Tested

We run abyss on real codebases and publish every score — including the gaps.

Project	Language	Files	Index Time	Score
Django 5.1	Python	3,292	6.9s	8 / 10
SQLAlchemy 2.0	Python	687	8.4s	8 / 10
hono v4.6	TypeScript	388	0.8s	8 / 10
helix-editor	Rust	545	1.6s	7.5 / 10
vite v5.4	TS/JS mono	1,793	0.9s	7 / 10
FastAPI 0.115	Python	2,164	1.1s	6.5 / 10

Full reports: docs/DOGFOOD.md

vs. Pure Compressors

	Pure compressor	abyss
Output filtering	✅ Pattern-matching	✅ Structural + semantic
Code understanding	❌	✅ Call graph + impact
Blast-radius annotations	❌	✅ Risk scores in output
Smart file read	Brace-counting	✅ Tree-sitter AST (14 langs)
Pre-edit safety	❌	✅ Callers, coverage gaps
Setup	Separate install	✅ Built into one binary

Architecture

Single Rust binary (~18 MB). SQLite index at .code-abyss/index.db.

CLI (clap)
 ├── Indexer: walker → tree-sitter parse → tiered SQL resolver → git temporal
 ├── Proxy: 28 Rust handlers + TOML rule engine → never-worse guard
 ├── MCP: 9 tools over stdio (rmcp)
 ├── Daemon: pidfile + Unix socket, hash-incremental reindex on file save
 └── Hooks: pre-edit card / post-edit refresh / proxy rewrite (2ms budget)

Build variants

Build	Contents	Size
Default (slim)	Call graph + temporal + fulltext + proxy + MCP	~18 MB
`--features semantic`	+ embedding search (fastembed / ONNX)	~43 MB

Development

cargo build                    # slim build
cargo test                     # all tests
cargo clippy -- -D warnings    # lint
cargo fmt --check              # format check

# smoke test
cargo run -- index && cargo run -- stats && cargo run -- map --json

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 225 Commits
.github/workflows		.github/workflows
docs		docs
eval		eval
npm		npm
scripts		scripts
site		site
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
RELEASE-NOTES.md		RELEASE-NOTES.md
book.toml		book.toml
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

abyss

The Problem

Quick Demo

Install

Setup (60 Seconds)

What Can It Do?

Code Intelligence

Proxy Compression

Integration

Real-World Compression

Resolver Precision

Language Support

Battle-Tested

vs. Pure Compressors

Architecture

Development

License

About

Uh oh!

Releases 24

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

abyss

The Problem

Quick Demo

Install

Setup (60 Seconds)

What Can It Do?

Code Intelligence

Proxy Compression

Integration

Real-World Compression

Resolver Precision

Language Support

Battle-Tested

vs. Pure Compressors

Architecture

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 24

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages