Skip to content

feat: add /techne:research-grounded skill#31

Merged
ajbarea merged 1 commit into
mainfrom
feat/research-grounded-skill
May 29, 2026
Merged

feat: add /techne:research-grounded skill#31
ajbarea merged 1 commit into
mainfrom
feat/research-grounded-skill

Conversation

@ajbarea
Copy link
Copy Markdown
Owner

@ajbarea ajbarea commented May 29, 2026

What

A new /techne:research-grounded skill — the queued ROADMAP item that mechanizes the # research(YYYY-MM): provenance convention. It audits IMPL.md / ROADMAP.md for committed design decisions (library / framework / pattern / architecture choices) that lack a research-provenance tag, then web-searches to ground them.

The motivation, from the plan doc: when an SSML capability was assumed for Kourai's M6 and never verified against the target's docs (ElevenLabs v3 doesn't support SSML break tags), it became 5 PRs of revertable work. A decision stated as fact but never checked is the failure this skill catches — before it hardens into code.

Design: judgment over grep

The decision-language grep (chose|switched|adopted|instead of|…) is only a candidate seed. Most instead of hits are descriptive implementation prose ("reads X instead of re-reading Y"), not technology choices — so the skill reads around each hit and classifies per-hit. Its Ignore section explicitly waves through descriptive prose, hypotheticals ("what if we used…"), already-grounded decisions, and mechanical changes. This is why it's a skill (judgment), not a CI regex.

Validation (GREEN test)

Ran the skill, report-only, via a fresh subagent against kourai's ROADMAP+IMPL (established as ground truth):

  • 8 of 10 grep candidates correctly filtered as ignore (descriptive prose, a "what if" hypothetical, mechanical renames, already-grounded decisions).
  • 2 genuine gaps surfaced (an un-tagged Message.metadata extension-key architecture choice; a word-level-timing pattern justified in prose but missing the formal tag).

Low false-positive rate, real signal.

Registration

  • plugins/techne/skills/research-grounded/SKILL.md
  • docs/skills/research-grounded.md + zensical nav entry
  • README "What's in the box" row
  • ROADMAP: moved Queued → Shipped

Local checks green: validate_skill_frontmatter.py (all 10 skills ok), manifests jq-valid, zensical build clean. Sibling of /techne:docsync — claims-vs-code there, decisions-vs-evidence here.

New /techne:research-grounded audits IMPL.md / ROADMAP.md for committed design
decisions (library / framework / pattern / architecture choices) that lack a
`# research(YYYY-MM):` provenance tag, then web-searches to ground them -- closing
the loop that, when skipped, turned an SSML capability bet into 5 revertable PRs.

Judgment over grep: the decision-language grep is only a candidate seed; the skill
reads around each hit and filters descriptive prose ("reads X instead of Y") and
hypotheticals ("what if we used..."). Validated on kourai's ROADMAP/IMPL via a fresh
subagent -- 8/10 grep candidates correctly ignored, 2 genuine gaps surfaced.

Wired into README, docs/skills/, and the zensical nav; moved Queued -> Shipped.
Sibling of /techne:docsync (claims vs code; this is decisions vs evidence).
@ajbarea ajbarea enabled auto-merge (squash) May 29, 2026 13:46
@ajbarea ajbarea merged commit e6d8442 into main May 29, 2026
2 checks passed
@ajbarea ajbarea deleted the feat/research-grounded-skill branch May 29, 2026 13:46
ajbarea added a commit that referenced this pull request May 29, 2026
Adding /techne:research-grounded (#31) left "9 skills" claims behind in ROADMAP,
IMPL, and skill-context — the count lives in more places than the slop_ground_truth
note named. Bump all to 10, add research-grounded to the Drift dimension table and
the skill-context name list, and widen slop_ground_truth to name every place a count
claim lives (README, ROADMAP, IMPL, skill-context) so the next addition catches them
all. The historical "eight of nine" spec note is left as-is (archival).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant