Add /mobius-evolve skill by AaronGoldsmith · Pull Request #15 · AaronGoldsmith/mobius

AaronGoldsmith · 2026-03-21T16:27:41Z

Summary

Free Opus-powered agent evolution via evaluator-optimizer loop. Analyzes judge feedback and refines underperforming agents' system prompts at zero API cost.

🤖 Generated with Claude Code

Evaluator-optimizer loop that uses Claude Code Opus (free via Pro) to analyze judge feedback and refine underperforming agents' system prompts — same quality as the API evolve command at zero cost. Includes load_underperformers.py helper script to surface agents with low win rates and their loss feedback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Adds a new Claude skill (mobius-evolve) intended to support “free” agent evolution by loading underperforming agents, surfacing their loss feedback, and guiding a local evaluator/optimizer loop to refine system prompts.

Changes:

Added a load_underperformers.py helper script to list low-win-rate agents and print recent loss feedback.
Added SKILL.md defining the mobius-evolve skill instructions and workflow for refining/registering improved agents.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
`.claude/skills/mobius-evolve/scripts/load_underperformers.py`	New script to query the registry/tournament DB for underperformers and print prompt + loss feedback.
`.claude/skills/mobius-evolve/SKILL.md`	New skill definition and step-by-step instructions to run the local evolution loop and register improved agents.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

… matches - Strip raw `git show` output (commit metadata, diff headers, leading +/- chars) from SKILL.md and load_underperformers.py so they parse correctly - Remove unused `json` and `row_to_dict` imports from load_underperformers.py - Filter out unjudged matches (winner_id is None) from loss counting - Update SKILL.md argument-hint to include --min-matches Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4a11c4f51e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings March 21, 2026 16:27

Copilot started reviewing on behalf of AaronGoldsmith March 21, 2026 16:28 View session

Copilot AI reviewed Mar 21, 2026

View reviewed changes

AaronGoldsmith marked this pull request as ready for review March 21, 2026 17:11

chatgpt-codex-connector bot reviewed Mar 21, 2026

View reviewed changes

Comment thread .claude/skills/mobius-evolve/scripts/load_underperformers.py Outdated

Exclude voided matches from underperformer win-rate calculation

5a31c0a

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

AaronGoldsmith merged commit 681f1d4 into main Mar 21, 2026
2 checks passed

AaronGoldsmith deleted the feature/mobius-evolve-skill branch March 21, 2026 17:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add /mobius-evolve skill#15

Add /mobius-evolve skill#15
AaronGoldsmith merged 3 commits intomainfrom
feature/mobius-evolve-skill

AaronGoldsmith commented Mar 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AaronGoldsmith commented Mar 21, 2026

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants