docs: FIP draft — testing & acceptance criteria for onboarding new validators by manan19 · Pull Request #910 · farcasterxyz/snapchain

manan19 · 2026-06-09T20:22:14Z

What

Adds a draft FIP / RFC at docs/proposals/fip-validator-onboarding-testing.md defining the testing and acceptance bar that should be met before a new validator is appended to validators.toml.

This is a documentation-only change (no code). It's intended for review here, then posting to farcasterxyz/protocol Discussions.

Why

Two upcoming changes raise the risk of adding a validator:

Geo/datacenter diversity (primary) — a validator further from the existing (largely us-east) cluster stresses the latency-sensitive BFT timing.
Node (consensus-client) diversity (secondary) — interest in a validator built from a different codebase (reimplementation or fork), which can break consensus determinism.

There's currently no documented, staged acceptance procedure. Adding a validator is a one-line effective_at edit, but with equal voting power and a fault budget of 1 on a 6-node shard, a faulty/slow/divergent validator can stall a shard.

What the doc covers

Three cumulative risk tiers — A (stock binary, new geo) → B (fork) → C (independent reimplementation).
Failure modes — consensus/determinism, networking/liveness, operational/security, each tagged by tier.
Layered test taxonomy with gates — L0 determinism vectors → L1 unit/validation parity → L2 multi-node (tests/consensus_test.rs) → L3 production-like full-network testnet (real QUIC gossip from the target DC, prod timing, perf load).
Geo-specific tests — RTT budget, multi-day soak from the real DC, partition drills, NTP.
Staged rollout — read-node → testnet → mainnet one shard at a time, with a pre-staged rollback entry.
Copy-pasteable go/no-go acceptance checklist.

Review asks

A few claims I'd most like a consensus engineer to sanity-check:

Because signatures are computed over encode_to_vec() bytes and headers are BLAKE3-hashed, a non-snapchain client must be byte-for-byte deterministic — hence the Tier C L0 vector suite. Is this framing correct?
test_validator_set_rotation currently exercises the remove path because the add path is flaky in CI; the doc treats hardening the add path as a prerequisite. Agree?
Should a Byzantine/equivocation fault-injection harness be required for Tier C?

🤖 Generated with Claude Code

vercel · 2026-06-09T20:22:21Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
snapchain-docs	Ready	Preview, Comment	Jun 15, 2026 3:53pm

Copilot

Pull request overview

Adds a draft FIP/RFC document that defines a staged testing and acceptance bar for onboarding new Snapchain validators before appending them to validators.toml, with emphasis on cross-geo latency risk and client determinism risk (forks/reimplementations).

Changes:

Introduces a 3-tier risk model (A/B/C) and maps validator failure modes to each tier.
Defines layered acceptance gates (L0 determinism vectors → L1 unit/validation parity → L2 multi-node harness → L3 production-like testnet soak) plus geo/DC-specific tests.
Proposes an operational rollout/rollback procedure and a copy-paste go/no-go checklist.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

topocount · 2026-06-09T20:47:32Z

+
+## Risks & open questions
+
+- **Flaky validator-add test.** `test_validator_set_rotation` currently relies on the *remove* path


seems to be leaning on this pretty heavily

Agreed — reframed in f52ee60. The open question no longer leans on CI flakiness; it now states the underlying point: the validator add path has thinner end-to-end coverage than remove, and strengthening it is a prerequisite.

manan19 · 2026-06-10T21:47:06Z

Filed the test-coverage gaps this FIP surfaces as tracking issues: #924 (umbrella) covering #917–#923. Highest-leverage prerequisites for alt-client onboarding are #917 (golden determinism vectors) and #919 (validator-add path).

Draft RFC defining the requirements and testing criteria for admitting a new validator to the Farcaster consensus set — including alternative client implementations (forks and independent reimplementations) and validators in new geographies/datacenters. Covers the alternative-client determinism contract (R1–R7), the verification layers that prove it (L0 conformance vectors → L3 production-like testnet), deployment requirements (latency budget, soak, reachability), operator requirements (incident-response collaboration, repo maintainership), a staged read-node → testnet → mainnet rollout with rollback, and a go/no-go acceptance checklist. Open testing gaps are tracked in #924. Intended for posting to farcasterxyz/protocol Discussions. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

manan19 · 2026-06-15T15:43:50Z

Posted to farcasterxyz/protocol Discussions (FIP Stage 1: Ideas): farcasterxyz/protocol#272 — links in that copy are absolute so they resolve from the protocol repo. This PR remains the canonical source/review thread.

Copilot AI review requested due to automatic review settings June 9, 2026 20:22

Copilot started reviewing on behalf of manan19 June 9, 2026 20:22 View session

vercel Bot deployed to Preview June 9, 2026 20:22 View deployment

Copilot AI reviewed Jun 9, 2026

View reviewed changes

topocount reviewed Jun 9, 2026

View reviewed changes

vercel Bot deployed to Preview June 9, 2026 21:04 View deployment

vercel Bot deployed to Preview June 9, 2026 22:18 View deployment

topocount previously approved these changes Jun 10, 2026

View reviewed changes

manan19 mentioned this pull request Jun 10, 2026

[tracking] Validator-onboarding testing gaps (FIP) #924

Open

7 tasks

manan19 dismissed topocount’s stale review via 96c5d74 June 10, 2026 21:52

vercel Bot deployed to Preview June 10, 2026 21:52 View deployment

vercel Bot deployed to Preview June 10, 2026 22:06 View deployment

manan19 mentioned this pull request Jun 10, 2026

bug: Incoming validators blocked from consensus communication after validator set cutover #766

Closed

vercel Bot deployed to Preview June 10, 2026 22:13 View deployment

rishavmukherji reviewed Jun 11, 2026

View reviewed changes

Comment thread docs/proposals/fip-validator-onboarding-testing.md

manan19 force-pushed the docs/validator-onboarding-testing-fip branch from 1804705 to e890a9d Compare June 15, 2026 15:35

docs: link FIP to its protocol discussion (Stage 1: Ideas)

f39aa63

vercel Bot deployed to Preview June 15, 2026 15:44 View deployment

rishavmukherji approved these changes Jun 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: FIP draft — testing & acceptance criteria for onboarding new validators#910

docs: FIP draft — testing & acceptance criteria for onboarding new validators#910
manan19 wants to merge 2 commits into
mainfrom
docs/validator-onboarding-testing-fip

manan19 commented Jun 9, 2026

Uh oh!

vercel Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

topocount Jun 9, 2026

Uh oh!

manan19 Jun 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

manan19 commented Jun 10, 2026

Uh oh!

Uh oh!

manan19 commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		## Risks & open questions

		- Flaky validator-add test. `test_validator_set_rotation` currently relies on the remove path

Uh oh!

Conversation

manan19 commented Jun 9, 2026

What

Why

What the doc covers

Review asks

Uh oh!

vercel Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

topocount Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

manan19 Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

manan19 commented Jun 10, 2026

Uh oh!

Uh oh!

manan19 commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel Bot commented Jun 9, 2026 •

edited

Loading