Skip to content

Add /mobius-evolve skill#15

Merged
AaronGoldsmith merged 3 commits intomainfrom
feature/mobius-evolve-skill
Mar 21, 2026
Merged

Add /mobius-evolve skill#15
AaronGoldsmith merged 3 commits intomainfrom
feature/mobius-evolve-skill

Conversation

@AaronGoldsmith
Copy link
Copy Markdown
Owner

Summary

Free Opus-powered agent evolution via evaluator-optimizer loop. Analyzes judge feedback and refines underperforming agents' system prompts at zero API cost.

🤖 Generated with Claude Code

Evaluator-optimizer loop that uses Claude Code Opus (free via Pro)
to analyze judge feedback and refine underperforming agents' system
prompts — same quality as the API evolve command at zero cost.

Includes load_underperformers.py helper script to surface agents
with low win rates and their loss feedback.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings March 21, 2026 16:27
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new Claude skill (mobius-evolve) intended to support “free” agent evolution by loading underperforming agents, surfacing their loss feedback, and guiding a local evaluator/optimizer loop to refine system prompts.

Changes:

  • Added a load_underperformers.py helper script to list low-win-rate agents and print recent loss feedback.
  • Added SKILL.md defining the mobius-evolve skill instructions and workflow for refining/registering improved agents.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File Description
.claude/skills/mobius-evolve/scripts/load_underperformers.py New script to query the registry/tournament DB for underperformers and print prompt + loss feedback.
.claude/skills/mobius-evolve/SKILL.md New skill definition and step-by-step instructions to run the local evolution loop and register improved agents.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread .claude/skills/mobius-evolve/scripts/load_underperformers.py Outdated
Comment thread .claude/skills/mobius-evolve/scripts/load_underperformers.py Outdated
Comment thread .claude/skills/mobius-evolve/SKILL.md Outdated
Comment thread .claude/skills/mobius-evolve/scripts/load_underperformers.py Outdated
Comment thread .claude/skills/mobius-evolve/SKILL.md Outdated
… matches

- Strip raw `git show` output (commit metadata, diff headers, leading +/- chars)
  from SKILL.md and load_underperformers.py so they parse correctly
- Remove unused `json` and `row_to_dict` imports from load_underperformers.py
- Filter out unjudged matches (winner_id is None) from loss counting
- Update SKILL.md argument-hint to include --min-matches

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@AaronGoldsmith AaronGoldsmith marked this pull request as ready for review March 21, 2026 17:11
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4a11c4f51e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread .claude/skills/mobius-evolve/scripts/load_underperformers.py Outdated
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@AaronGoldsmith AaronGoldsmith merged commit 681f1d4 into main Mar 21, 2026
2 checks passed
@AaronGoldsmith AaronGoldsmith deleted the feature/mobius-evolve-skill branch March 21, 2026 17:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants