Research-production instrument: node -> hypothesis -> code-verified r…#24
Merged
Conversation
…esult -> paper draft Turns the atlas from a map into a research instrument, addressing the 'paper-production pipeline' intent: - labs/research/harness.py: dependency-free (stdlib only, CPU, deterministic) harness that takes a falsifiable Hypothesis tied to a graph node, sweeps one independent variable, records a metric, decides whether the claim survived its own falsifier, and emits a results table + SVG figure + findings note. - labs/research/exp_overestimation_bias.py: code-verifies the lens of insight:q_learning_max_is_optimistically_biased. Result reproduces the textbook effect — single-max bias grows 0.20 -> 0.83 as actions go 2 -> 64, while the double estimator stays ~0 — so the lens claim survives, by code. - tools/scaffold_paper.py: given a node, assembles a paper-draft scaffold from the graph neighbourhood + research lens (research question, positioned related work, load-bearing assumption, gap, method primitives, cross-domain prior, experiment plan with a harness pointer, falsification criteria). Example drafts for UniAD, offline-RL-as-constrained-DP, differentiable-E2E, and conditional-generation planning in research/drafts/. - Research lenses 56 -> 66 (completed RL/decision lens-wave fragment). QC green: 0 dead links, 0/231 lint findings, math test passing, all 66 lens links resolve, seed validator OK. (Lands on the branch after the PR-23 merge.)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
…esult -> paper draft
Turns the atlas from a map into a research instrument, addressing the 'paper-production pipeline' intent:
QC green: 0 dead links, 0/231 lint findings, math test passing, all 66 lens links resolve, seed validator OK. (Lands on the branch after the PR-23 merge.)
Pull Request
变更摘要 / Summary
(一两句话讲清这个 PR 的意图)
类型 / Type
docs/data/graph.json检查清单 / Checklist
python tools/validate_graph.py通过python tools/check_links.py没有新出现的 404LLM_BACKEND=mock跑过nbconvert --executeconcepts.md/ playbook(如相关)#page=N、YouTube&t=Xs等)contributors: [...],或说明不需要标注备注 / Notes
(截图、性能数字、与
AGENTS.md的对照说明等)