Description
ERL (erl.rs) already accumulates per-skill heuristics over sessions via Jaccard deduplication. Add a periodic aggregation pass that examines skills where the heuristic count exceeds a configurable threshold (e.g. 5 unique heuristics) and asks the LLM whether any combination of heuristics is substantial enough to:
- Become a standalone new skill (novel capability distinct from parent)
- Be merged into the parent skill body (enriching the existing skill)
This closes the loop between short-term ERL learning and the long-term skill corpus.
Motivation
AutoSkill's lifelong learning model distinguishes itself by making learned behaviors permanent. Zeph's ERL heuristics are valuable but ephemeral — they enhance prompts per-session but never graduate into durable skills. Periodic promotion creates a compounding improvement loop.
Acceptance Criteria
References
Description
ERL (
erl.rs) already accumulates per-skill heuristics over sessions via Jaccard deduplication. Add a periodic aggregation pass that examines skills where the heuristic count exceeds a configurable threshold (e.g. 5 unique heuristics) and asks the LLM whether any combination of heuristics is substantial enough to:This closes the loop between short-term ERL learning and the long-term skill corpus.
Motivation
AutoSkill's lifelong learning model distinguishes itself by making learned behaviors permanent. Zeph's ERL heuristics are valuable but ephemeral — they enhance prompts per-session but never graduate into durable skills. Periodic promotion creates a compounding improvement loop.
Acceptance Criteria
skills.heuristic_promotionconfig section:enabled,threshold(default 5),provider,interval_hoursReferences
specs/061-autoskill-heuristic-promotion/spec.md