feat: supabase cost tracking by barryroodt · Pull Request #20 · barryroodt/wrily

barryroodt · 2026-05-18T12:10:19Z

Summary

Adds opt-in persistence of per-review token + USD cost to a self-hosted Supabase project. Reviews still work without it enabled — purely additive.

Wired up:

claude CLI now invoked with --output-format=stream-json --verbose; final result event parsed into AgentTokenUsage (was null before).
New src/persist/ module with recordReviewRun (retry-then-fail-soft) + queryCosts. No new runtime deps — plain fetch against PostgREST.
persistUsageStep appended to the review workflow, no-op when env vars absent.
Two SQL migrations: review_runs + review_subagent_runs tables, spend_by_repo_30d + spend_by_model_30d views.
New ./wrily persistence {init,migrate,status} subcommands wrap the official supabase CLI to create a project, write .env, and apply migrations.
New ./wrily costs [--since 30d] [--by repo|model|day] [--repo X] [--json] for spend rollups.
Local CLI runs tagged trigger_source=local_cli; GitHub App runs collapse to github_app.

Spec + plan:

docs/superpowers/specs/2026-05-18-supabase-cost-tracking-design.md
docs/superpowers/plans/2026-05-18-supabase-cost-tracking.md

Test Plan

pnpm test — 243 passing, 1 integration test skipped (opt-in via WRILY_INT_SUPABASE_URL)
pnpm typecheck — clean
pnpm build — clean
./wrily --help shows the new subcommands
./wrily persistence status reports disabled when env unset
Manual: ./wrily persistence init against a throwaway Supabase project — verify tables + views land
Manual: trigger a review with SUPABASE_URL set — verify a row appears in review_runs
Manual: ./wrily costs --since 7d against a project with rows

🤖 Generated with Claude Code

Persistence layer for per-review-run token + USD cost data, written by the review container directly to a self-hosted Supabase project via PostgREST. Read surface: Supabase Studio + a ./wrily costs CLI. New ./wrily persistence init/migrate/status subcommands wrap the official supabase CLI to create the project, write .env, and apply migrations. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Sixteen tasks across six phases: env + cost capture, schema migrations, persistence HTTP client, workflow step, CLI subcommands (costs + persistence init/migrate/status), bash entrypoint wiring, opt-in integration test, and docs. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

CodeQL flagged two clear-text password leaks in `wrily persistence init`: the generated DB password was printed to stdout and also passed via the `--db-password` / `--password` flags (visible via `ps` and included in error messages built from `args.join(' ')`). Drop the console.log entirely and route the password via the SUPABASE_DB_PASSWORD env var, which the supabase CLI reads for both `projects create` and `link`. runSupabase now accepts an `env` option so secrets can be passed to the child process without touching argv. The password is never used by Wrily at runtime (writes go through the service-role key) — operators who need SQL-editor access can reset it from the dashboard. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

`runSupabase` was always piping stdio, which broke `supabase login`: the CLI bails with "Cannot use automatic login flow inside non-TTY environments" because it can't drive the browser handoff prompt. Add an `interactive: true` option that inherits the parent stdio. `ensureLoggedIn` now uses it for the login fallback and surfaces the SUPABASE_ACCESS_TOKEN env-var workaround as the fast path so headless runs don't need a TTY at all. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The supabase CLI's `projects create` MarkFlagRequired check fires before it consults env vars, so the previous attempt to route the password exclusively via SUPABASE_DB_PASSWORD broke project creation with 'required flag(s) "db-password" not set'. Restore --db-password on the create call but keep error-message exposure contained: runSupabase now accepts a redactFlags option that masks the immediately-following value before interpolating args into its error string. The `link` and `db push` calls still receive the password via SUPABASE_DB_PASSWORD env so only one invocation puts the value on argv (and only for the few seconds project creation runs). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The switch to --output-format=stream-json made stdout an NDJSON event log instead of the model's reply text. Downstream extractFindings looks for a \`\`\`json fence in the model output and bailed with "No \`\`\`json fence found in model reply" on every run. Walk the NDJSON, concatenate text blocks from every assistant event, and return that as AgentResult.stdout. The cost parser still reads the raw event stream for the final result event so token usage capture is unaffected. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Wrily-on-Wrily review flagged four bugs: 1. \`--since\` was ignored when \`--by repo|model\` routed through the pre-built 30-day views. 2. \`--by day\` returned raw review_runs rows, not a per-day rollup. 3. \`--repo\` was silently dropped when combined with \`--by model\` (the model view has no github_repo column). 4. \`deriveRunStatus\` only emitted success/failed; budget_exceeded / timeout never landed in review_runs.status, defeating the dashboard distinction promised by the schema CHECK and the spec verification plan. queryCosts now hits review_runs directly with an inserted_at filter, client-side aggregating by the requested axis. The 30d views remain in the schema for Studio convenience but the CLI no longer relies on them. \`--repo\` + \`--by model\` is rejected at parse time. A new persist/failure.ts classifies AgentBudgetExceededError / AgentTimeoutError (including one level of err.cause wrapping) and writes a row from main.ts catch blocks. Success path still goes through persistUsageStep at end of the workflow. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

barryroodt

test review via script

barryroodt

test review with comments

barryroodt

test 422 reproduction

Dogfood run on PR #20 failed with 'Variable \$commitOID of type GitObjectID was provided invalid value' from GitHub's REST endpoint. Root cause: review takes minutes (clone + agent + extract + route), and the commit SHA captured at the start of the bash entrypoint can go stale before the post step runs (force-push, follow-up commit, etc.). GitHub then rejects the review POST. Fix: - postToGitHubStep refreshes the head SHA via octokit.rest.pulls.get immediately before constructing the post payload, falling back to the original env-supplied SHA on lookup failure. - postReview's body-only 422 fallback now retries once more without commit_id at all, so a body-only prose post still lands even when the SHA is rejected for a reason the refresh didn't catch. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Previously persistUsageStep ran last in the workflow, so any failure in postToGitHubStep (e.g. stale commit_id 422) prevented the cost row from being written even though the agent had already burned the spend. Move persistUsageStep ahead of postToGitHubStep — cost rows are now written as soon as agent results + findings are available, independent of GitHub response. deriveRunStatus drops the fallbackUsed check (unknown at this point); post-step issues remain tracked in workflow logs without polluting the cost dashboard status enum. A new persist/state.ts module exposes markUsagePersisted / wasUsagePersisted. The persistUsageStep flips the flag after a successful write so main.ts catch blocks skip persistFailureRun (and its duplicate zero-cost row) when the cost row already exists. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

barryroodt

Wrily Review: PR #20

Overall Verdict: With fixes

Summary

0 critical, 2 important. Supabase cost tracking is well-tested and architecturally sound (best-effort persist, redacted argv, success/failure-path dedupe). Two gaps: bash wrapper skips .env sourcing when shell auth is set so persistence creds silently drop, and the new .env written by persistence init lands with default umask perms while holding the service_role_key. 5 minor findings hidden — set sensitivity: minor in .wrily.yml to see.

Confidence rating skipped — declare an application criticality tier in CLAUDE.md or AGENTS.md to enable.

Critical

None.

Important

L137: wrily — .env only sourced when ANTHROPIC_API_KEY/CLAUDE_CODE_OAUTH_TOKEN are both unset, but SUPABASE_URL/SUPABASE_SERVICE_ROLE_KEY live in the same file. A user with auth in shell env gets persistence silently disabled (empty -e SUPABASE_URL= at lines 224-225). Move the .env source above the auth gate, or add a second guard: if { [[ -z "${SUPABASE_URL:-}" ]] || [[ -z "${SUPABASE_SERVICE_ROLE_KEY:-}" ]]; } && [[ -f "${SCRIPT_DIR}/.env" ]]; then source "${SCRIPT_DIR}/.env"; fi.
L32: src/cli/persistence/dotenv.ts — New .env written via writeFileSync/appendFileSync inherits umask (typically 0644 — world-readable). The file holds SUPABASE_SERVICE_ROLE_KEY, which bypasses RLS = full DB admin. On shared/multi-user hosts any local user can exfiltrate it. After writing, chmodSync(path, 0o600) (and when appending, ensure the existing file is already 0600 or tighten it).

Minor

None.

Strengths

Retry-then-fail-soft + markUsagePersisted dedupe between success and failure paths keeps observability from ever blocking a review.
redactFlags and SUPABASE_DB_PASSWORD env-var path keep the generated DB password out of argv and error messages.
Strong unit + integration test coverage (env, retry, aggregateRuns, supabase stub binary, stream-json reassembly).

Suppressions

None.

Two findings from the dogfood Wrily review on PR #20: 1. The bash wrapper only sourced .env when ANTHROPIC_API_KEY / CLAUDE_CODE_OAUTH_TOKEN were both unset. Users with shell-exported auth had SUPABASE_URL / SUPABASE_SERVICE_ROLE_KEY silently dropped, so the container started with empty Supabase env and persistence stayed off without any indication. The .env source now runs whenever any of those keys is missing in the current shell env. set -a is used briefly so KEY=val lines export into the env we pass to docker. 2. appendDotEnv inherited the shell's umask, so .env landed with typical 0644 perms while holding the service_role key (which bypasses RLS = full DB admin). New files are created with mode 0o600; existing files are tightened to 0o600 after every append so a pre-existing world-readable file gets fixed on the next write. No-op on win32 where the chmod semantics differ. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

barryroodt · 2026-05-18T14:49:15Z

barryroodt and others added 17 commits May 18, 2026 11:55

feat(config): add SUPABASE_URL + SUPABASE_SERVICE_ROLE_KEY env vars

5ce80dd

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(agent): parse token usage + cost from claude CLI stream-json

ca7c658

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(db): supabase schema for review_runs + review_subagent_runs + views

0b7fc3b

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(persist): supabase HTTP client with retry-then-fail-soft

e6c7ccf

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(workflow): persistUsageStep records cost data to supabase

1e76f6e

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): .env reader + safe appender (refuses overwrite)

1fcaac7

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): spawn wrappers around the supabase CLI binary

98cff3a

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): wrily persistence status

99146a5

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): wrily persistence migrate

b997601

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): wrily persistence init (create supabase project + migrate)

076d672

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): wrily costs (queries supabase for spend rollups)

79948e2

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(cli): wrily entrypoint dispatches costs + persistence subcommands

8802cb7

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

test(persist): opt-in supabase integration test

40a8bbe

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

docs(env): document SUPABASE_URL + SUPABASE_SERVICE_ROLE_KEY

fd565b1

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

docs: cost tracking section + README hook

52b370c

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-advanced-security AI found potential problems May 18, 2026

View reviewed changes

Comment thread src/cli/persistence/init.ts Fixed

Comment thread src/cli/persistence/init.ts Fixed

barryroodt self-assigned this May 18, 2026

barryroodt and others added 5 commits May 18, 2026 14:17

barryroodt commented May 18, 2026

View reviewed changes

Comment thread supabase/migrations/0001_review_runs.sql

barryroodt commented May 18, 2026

View reviewed changes

barryroodt and others added 3 commits May 18, 2026 15:44

Merge fix/refresh-commit-sha-before-posting

5496932

barryroodt commented May 18, 2026

View reviewed changes

Comment thread src/cli/persistence/dotenv.ts Outdated

barryroodt merged commit 1461a72 into main May 18, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: supabase cost tracking#20

feat: supabase cost tracking#20
barryroodt merged 26 commits into
mainfrom
feat/supabase-cost-tracking

barryroodt commented May 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

barryroodt left a comment

Uh oh!

barryroodt left a comment

Uh oh!

Uh oh!

barryroodt left a comment

Uh oh!

barryroodt left a comment

Uh oh!

Uh oh!

barryroodt commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

barryroodt commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

Uh oh!

Uh oh!

barryroodt left a comment

Choose a reason for hiding this comment

Uh oh!

barryroodt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

barryroodt left a comment

Choose a reason for hiding this comment

Uh oh!

barryroodt left a comment

Choose a reason for hiding this comment

Wrily Review: PR #20

Overall Verdict: With fixes

Summary

Critical

Important

Minor

Strengths

Suppressions

Uh oh!

Uh oh!

barryroodt commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

barryroodt commented May 18, 2026 •

edited

Loading