doc: Document KV cache materialization during chunked prefill by Connor1996 · Pull Request #114 · skyzh/tiny-llm

Connor1996 · 2026-04-12T23:56:25Z

What changed

This follow-up adds a short explanation for why chunked prefill materializes the KV cache with mx.eval:

add an inline comment in src/tiny_llm_ref/batch.py
add a matching tip in book/src/week2-06-prefill-and-batch.md

Why

The existing code does the right thing, but the reason is easy to miss when reading the implementation or the chapter. This note makes it explicit that mx.eval is there to keep the lazy graph from growing across prefill chunks.

Signed-off-by: Connor1996 <zbk602423539@gmail.com>

docs: explain prefill kv cache materialization

96398a0

Signed-off-by: Connor1996 <zbk602423539@gmail.com>

Connor1996 changed the title ~~[codex] document KV cache materialization during chunked prefill~~ doc: Document KV cache materialization during chunked prefill Apr 13, 2026

Connor1996 marked this pull request as ready for review April 13, 2026 00:00

docs: align prefill materialization wording

31ee12a

Signed-off-by: Connor1996 <zbk602423539@gmail.com>

Connor1996 merged commit 370514e into skyzh:main Apr 13, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc: Document KV cache materialization during chunked prefill#114

doc: Document KV cache materialization during chunked prefill#114
Connor1996 merged 2 commits intoskyzh:mainfrom
Connor1996:codex/prefill-materialize-docs

Connor1996 commented Apr 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Connor1996 commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Connor1996 commented Apr 12, 2026 •

edited

Loading