docs(antithesis): Antithesis research scratchbook and bug ledger#1768
Conversation
This stack of pull requests is managed by Graphite. Learn more about stacking. |
This comment has been minimized.
This comment has been minimized.
Binary Size Analysis (Agent Data Plane)Baseline: 9d9e29d · Comparison: 1b642b8 · diff ✅ Binary size difference within thresholdChanges by Module
Detailed Symbol Changes |
Regression Detector (Agent Data Plane)Run ID: Optimization Goals: ✅ No significant changes detectedFine details of change detection per experiment (35)Experiments configured
Bounds Checks: ✅ Passed (5)
ExplanationA change is flagged as a regression when |Δ mean %| > 5.00% in the regressing direction for its optimization goal AND SMP marks the experiment as a regression ( |
c269079 to
e840a73
Compare
6b01012 to
c9699d1
Compare
e840a73 to
8712b69
Compare
25934ae to
330a22f
Compare
8712b69 to
3265657
Compare
330a22f to
f8bab09
Compare
3265657 to
af46baf
Compare
f8bab09 to
d048536
Compare
af46baf to
5cb8545
Compare
d048536 to
e4defed
Compare
d604067 to
a649f24
Compare
e4defed to
9994924
Compare
a649f24 to
249a646
Compare
e46f656 to
fb9e655
Compare
249a646 to
05efef3
Compare
fb9e655 to
4d533a8
Compare
05efef3 to
08ac10a
Compare
4d533a8 to
ac323be
Compare
fc4bb29 to
3952ec9
Compare
476a5b6 to
e8a5058
Compare
3952ec9 to
e540987
Compare
| correctness, lifecycle/config, untrusted-input parsing, concurrency, and **transform & enrichment | ||
| correctness** (Category G, added after evaluation — ADP as a *transformer*, not just a transport). | ||
|
|
||
| > **Evaluation note (2026-05-28):** an 4-lens portfolio evaluation added 8 properties (G1 events/ |
| - **Dominance:** `rss-bounded-under-cardinality` is the **roll-up** — it observes the aggregate | ||
| outcome (RSS ≤ grant). The other four explain *why* it does or doesn't hold: | ||
| `aggregate-context-limit-enforced` and `interner-full-bounded` are the two designed bounds; | ||
| `interner-full-bounded` (heap-on default) and `memory-limiter-survives-rss-read-failure` are the | ||
| two leaks that make the roll-up fail. If `rss-bounded` passes, the sub-properties likely hold; if | ||
| it fails, the sub-properties localize the cause. Test the roll-up *and* the components. |
e8a5058 to
377ffd5
Compare
e540987 to
8d3fbbf
Compare
8d3fbbf to
b230025
Compare
## Summary The Antithesis research artifacts for agent-data-plane, under `test/antithesis/scratchbook/`. This is the analysis behind the harness and the bug repros: - A SUT analysis of the DogStatsD data path and runtime. - A property catalog (35 properties) with per-property evidence files. - A deployment topology, a portfolio evaluation, property relationships, and a bug ledger that maps each discovered defect to how it is reproduced. Docs only, no code. The internal design-partner codename is scrubbed and an internal Antithesis run id is redacted; Confluence links and Jira references are kept. ## Change Type - [ ] Bug fix - [ ] New feature - [x] Non-functional (chore, refactoring, docs) - [ ] Performance ## How did you test this PR? Docs only. `check-docs` is unaffected (it builds the `docs/` site; these notes live under `test/`). No code paths change. ## References - Builds on the harness PR (`test/antithesis/`) in this stack. - The failing bug repros it catalogs are in the bug-tests PR in this stack. - Internal context (kept per repo norms) is in the Confluence/Jira links in each artifact's frontmatter.
b230025 to
1b642b8
Compare
| @@ -0,0 +1,159 @@ | |||
| # rss-bounded-under-cardinality | |||
| @@ -0,0 +1,150 @@ | |||
| # retry-queue-bounded-under-outage | |||
| @@ -0,0 +1,129 @@ | |||
| # interner-full-bounded | |||
| @@ -0,0 +1,121 @@ | |||
| # aggregate-context-limit-enforced | |||
| @@ -0,0 +1,60 @@ | |||
| # config-runtime-update-not-revalidated | |||
| type: Safety (Always) + Reachability | ||
| priority: High | ||
| status: assertion-missing | ||
| sut_commit: 042f41db3bd97118c38981765fd49696fce9d318 |
| type: Safety (Reachability / Unreachable) | ||
| priority: Medium | ||
| status: assertion-missing | ||
| sut_commit: fc4bb29728814ddf9321572b954ec28f58faeb53 |
| type: Liveness | ||
| priority: High | ||
| status: assertion-missing | ||
| sut_commit: 042f41db3bd97118c38981765fd49696fce9d318 |
| outcome (RSS ≤ grant). The other four explain *why* it does or doesn't hold: | ||
| `aggregate-context-limit-enforced` and `interner-full-bounded` are the two designed bounds; | ||
| `interner-full-bounded` (heap-on default) and `memory-limiter-survives-rss-read-failure` are the | ||
| two leaks that make the roll-up fail. If `rss-bounded` passes, the sub-properties likely hold; if | ||
| it fails, the sub-properties localize the cause. Test the roll-up *and* the components. |
| correctness, lifecycle/config, untrusted-input parsing, concurrency, and **transform & enrichment | ||
| correctness** (Category G, added after evaluation — ADP as a *transformer*, not just a transport). | ||
|
|
||
| > **Evaluation note (2026-05-28):** an 4-lens portfolio evaluation added 8 properties (G1 events/ |
tobz
left a comment
There was a problem hiding this comment.
Given what we discussed yesterday about how we expect to use this and have it be kept up-to-date, fine with merging it. 👍🏻
## Summary The Antithesis research artifacts for agent-data-plane, under `test/antithesis/scratchbook/`. This is the analysis behind the harness and the bug repros: - A SUT analysis of the DogStatsD data path and runtime. - A property catalog (35 properties) with per-property evidence files. - A deployment topology, a portfolio evaluation, property relationships, and a bug ledger that maps each discovered defect to how it is reproduced. Docs only, no code. The internal design-partner codename is scrubbed and an internal Antithesis run id is redacted; Confluence links and Jira references are kept. ## Change Type - [ ] Bug fix - [ ] New feature - [x] Non-functional (chore, refactoring, docs) - [ ] Performance ## How did you test this PR? Docs only. `check-docs` is unaffected (it builds the `docs/` site; these notes live under `test/`). No code paths change. ## References - Builds on the harness PR (`test/antithesis/`) in this stack. - The failing bug repros it catalogs are in the bug-tests PR in this stack. - Internal context (kept per repo norms) is in the Confluence/Jira links in each artifact's frontmatter. ba3b71f

Summary
The Antithesis research artifacts for agent-data-plane, under
test/antithesis/scratchbook/. This isthe analysis behind the harness and the bug repros:
each discovered defect to how it is reproduced.
Docs only, no code. The internal design-partner codename is scrubbed and an internal Antithesis run
id is redacted; Confluence links and Jira references are kept.
Change Type
How did you test this PR?
Docs only.
check-docsis unaffected (it builds thedocs/site; these notes live undertest/).No code paths change.
References
test/antithesis/) in this stack.frontmatter.