🤖 feat: refresh LiteLLM models and prune stale overrides by ammar-agent · Pull Request #2559 · coder/mux

ammar-agent · 2026-02-23T19:20:29Z

Summary

This PR adds a reusable update-models skill, runs a full model-refresh cycle, and prunes stale models-extra overrides that upstream LiteLLM now covers.

Background

models-extra.ts is checked before models.json, so stale local overrides silently shadow corrected upstream metadata. We needed a repeatable workflow that makes pruning safe and routine.

Implementation

Added .mux/skills/update-models/SKILL.md with an end-to-end workflow for:
- refreshing models.json
- diffing models-extra against upstream
- deciding remove/keep/update
- running focused validation
Executed the workflow and refreshed src/common/utils/tokens/models.json from upstream LiteLLM.
Pruned src/common/utils/tokens/models-extra.ts down to only models still missing upstream (gpt-5.3-codex, gpt-5.3-codex-spark).
Updated model-related tests to match current data ownership and remove stale assumptions tied to older overrides.
Incorporated cycle findings back into the skill (including a comparison script and lessons learned section).
Stabilized tests/ui/compaction/compaction.test.ts by pinning compaction-flow test sends to Sonnet so auto-compaction still has a deterministic higher-context fallback after Opus metadata moved to 1M context upstream.

Validation

make static-check
bun test src/common/constants/knownModels.test.ts src/common/utils/tokens/modelStats.test.ts src/common/utils/ai/modelCapabilities.test.ts
TEST_INTEGRATION=1 bun x jest tests/ui/compaction/compaction.test.ts --runInBand

Risks

Upstream token/cost metadata changes can alter runtime context/cost behavior.
Mitigated by preserving only intentional local overrides still missing upstream and validating known model resolution + compaction UI behavior.

Generated with mux • Model: openai:gpt-5.3-codex • Thinking: xhigh • Cost: $2.12

ammar-agent · 2026-02-23T19:20:46Z

@codex review

Please review the models refresh + models-extra pruning + update-models skill improvements.

ammar-agent · 2026-02-23T19:24:14Z

@codex review

Rebased onto latest main, resolved merge conflicts, and re-ran static-check + targeted model tests.

chatgpt-codex-connector · 2026-02-23T19:29:00Z

Codex Review: Didn't find any major issues. You're on a roll.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

ammar-agent · 2026-02-23T19:47:52Z

@codex review

Addressed the failing integration check by making compaction UI tests deterministic with an explicit lower-context model for compaction flows. Re-ran static-check and targeted tests.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: fab0f4d20f

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

tests/ui/compaction/compaction.test.ts

ammar-agent · 2026-02-23T19:56:14Z

@codex review

Addressed the draft-preservation test concern by keeping sendMessage in-flight while typing the draft, then awaiting completion.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8b5999184e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

tests/ui/compaction/compaction.test.ts

ammar-agent · 2026-02-23T20:12:37Z

@codex review

Adjusted the compaction draft-preservation test to keep the UI send path (app.chat.send) while still making auto-compaction deterministic via preferred compaction model setup before render.

chatgpt-codex-connector · 2026-02-23T20:19:58Z

Codex Review: Didn't find any major issues. 🚀

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

ammar-agent added 2 commits February 23, 2026 13:21

wip: add update-models skill baseline

bd537d2

feat: refresh models.json and prune models-extra overrides

b84b1c1

ammar-agent force-pushed the feat/update-models-cycle branch from a8a75e1 to b84b1c1 Compare February 23, 2026 19:24

tests: pin compaction UI flows to Sonnet model

fab0f4d

chatgpt-codex-connector bot reviewed Feb 23, 2026

View reviewed changes

tests/ui/compaction/compaction.test.ts Outdated Show resolved Hide resolved

tests: preserve compaction draft race coverage

8b59991

chatgpt-codex-connector bot reviewed Feb 23, 2026

View reviewed changes

tests/ui/compaction/compaction.test.ts Outdated Show resolved Hide resolved

tests: stabilize compaction UI auto-retry scenario

d4a0506

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

🤖 feat: refresh LiteLLM models and prune stale overrides#2559

🤖 feat: refresh LiteLLM models and prune stale overrides#2559
ammar-agent wants to merge 5 commits intomainfrom
feat/update-models-cycle

ammar-agent commented Feb 23, 2026 •

edited

Loading

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 23, 2026

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

ammar-agent commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

Implementation

Validation

Risks

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 23, 2026

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ammar-agent commented Feb 23, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ammar-agent commented Feb 23, 2026 •

edited

Loading