Skip to content

E7 — Production Readiness (ALO-126)#108

Open
aloewright wants to merge 2 commits into
mainfrom
conductor/alo-126-e7-production-readiness
Open

E7 — Production Readiness (ALO-126)#108
aloewright wants to merge 2 commits into
mainfrom
conductor/alo-126-e7-production-readiness

Conversation

@aloewright
Copy link
Copy Markdown
Owner

Closes ALO-126.

Most of the E7 scope (observability, DO rate limiter, CSRF/CSP, signed R2 URLs, DMCA, moderation queue, lifecycle state machine, Playwright e2e, k6 load, D1 backup runbook, AI Gateway guard) already shipped across earlier PRs. This PR consolidates the remaining operator-facing artifacts and the prod release gate.

Summary

  • on-call runbook (docs/runbooks/on-call.md) — sev ladder, first-five-minutes, rollback steps, playbooks for deploy regressions, DB failures, stuck uploads, rate-limiter overreach, DMCA, cost spikes.
  • cost monitoring runbook (docs/runbooks/cost-monitoring.md) — per-line-item thresholds (Workers, Logs, R2, D1, Stream, AI Gateway), abuse triage queries, cost-discipline rules (no model defaults in code, bounded loops, log retention).
  • production readiness checklist (docs/runbooks/production-readiness.md) — single index mapping every E7 requirement to the artifact that satisfies it.
  • prod-deploy staging gate (.github/workflows/deploy-prod.yml) — new staging-smoke job probes /api/health on staging (URL overridable via vars.STAGING_HEALTH_URL) and blocks the prod job unless status=ok. Realises the "staging mirrors prod" Done-when criterion at the workflow level.

Test plan

  • npm run lint — 0 warnings, AI Gateway guard 0 findings
  • npm test -- --run — 491/491 passing across 44 files
  • Manual: dispatch the prod workflow and confirm the staging-smoke job runs and gates correctly (deferred — needs a working staging deployment, which is owned outside this PR)

🤖 Generated with Claude Code

…LO-126)

Add the operability docs and prod-deploy gate that close out E7 production
readiness. Most of the underlying work (observability, rate limiting, CSRF,
CSP, signed R2 URLs, DMCA, moderation, e2e, k6, D1 backup) already shipped
in earlier PRs — this consolidates the operator-facing artifacts:

* docs/runbooks/on-call.md — sev ladder, first-five-minutes, rollback,
  common playbooks, comms.
* docs/runbooks/cost-monitoring.md — alert thresholds per CF line item,
  abuse triage, cost discipline rules.
* docs/runbooks/production-readiness.md — single checklist mapping every
  E7 requirement to its artifact.
* deploy-prod.yml — staging-smoke job probes /api/health on staging and
  blocks the prod job until it returns status=ok. STAGING_HEALTH_URL is
  overridable via repo variable.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings May 8, 2026 14:53
@aloewright aloewright added the conductor Conductor-managed PR label May 8, 2026
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 8, 2026

Warning

Rate limit exceeded

@aloewright has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 42 minutes and 55 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 9a8cd6a0-2ca4-4f3f-91e9-248dad251733

📥 Commits

Reviewing files that changed from the base of the PR and between 4d3c13f and 0dfc8a8.

📒 Files selected for processing (4)
  • .github/workflows/deploy-prod.yml
  • docs/runbooks/cost-monitoring.md
  • docs/runbooks/on-call.md
  • docs/runbooks/production-readiness.md
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch conductor/alo-126-e7-production-readiness

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@ecc-tools
Copy link
Copy Markdown
Contributor

ecc-tools Bot commented May 8, 2026

ECC bundle files are already tracked in this repository. Skipping generation of another bundle PR.

Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

…-126)

Two accuracy fixes on the E7 runbooks so they match the actual code:

* on-call.md: the rate-limit DO binding is `RATE_LIMITER`, not
  `RATE_LIMIT_DO` — `wrangler.toml` defines it under `[[durable_objects.bindings]]`
  with `name = "RATE_LIMITER"`.
* production-readiness.md: clarify that worker Sentry is wired via
  `Sentry.withSentry(...)` in `src/workers/index.ts`, so an operator can
  trace the integration end-to-end without grepping.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@ecc-tools
Copy link
Copy Markdown
Contributor

ecc-tools Bot commented May 9, 2026

ECC bundle files are already tracked in this repository. Skipping generation of another bundle PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

conductor Conductor-managed PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants