Skip to content

Paper: E8-vs-KVarN iso-bit-width head-to-head + frontier-leadership positioning#43

Merged
jagmarques merged 1 commit into
mainfrom
company/paper-kvarn
Jun 16, 2026
Merged

Paper: E8-vs-KVarN iso-bit-width head-to-head + frontier-leadership positioning#43
jagmarques merged 1 commit into
mainfrom
company/paper-kvarn

Conversation

@jagmarques

Copy link
Copy Markdown
Owner

Adds a faithful iso-bit-width comparison against KVarN (the nearest calibration-free rotation-based KV-quant rival; Sinkhorn port verified exact against the authors' reference, 0.00 fp32 deviation). At matched K4V2 on Mistral-Inst-v0.3: NQ reaches +0.267% PPL at 3.125 bpe vs KVarN +0.421% at 3.375 bpe (lower delta at lower bit-budget), with KVarN edging the single 4K NIAH cell (5/5 vs 4/5, within FP16 4/5 noise). Framed honestly as comparable, not a win. Also adds a source-verified positioning claim: NQ K2V2 (~1.1 entropy-bpe) is, to our knowledge, the lowest-bit calibration-free operating point with needle-retrieval validation (every calibration-free NIAH/RULER method sits at >=2 bits; the only sub-1.1-bit needle-validated methods are calibrated). Every cell independently verified against the source JSON; compiles clean (0 errors, 88pp). PDF + KVarN result sidecar.

@jagmarques jagmarques merged commit 0c8dae5 into main Jun 16, 2026
@chatgpt-codex-connector

Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

@jagmarques jagmarques deleted the company/paper-kvarn branch June 19, 2026 21:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant