feat: add /techne:research-grounded skill by ajbarea · Pull Request #31 · ajbarea/techne

ajbarea · 2026-05-29T13:46:01Z

What

A new /techne:research-grounded skill — the queued ROADMAP item that mechanizes the # research(YYYY-MM): provenance convention. It audits IMPL.md / ROADMAP.md for committed design decisions (library / framework / pattern / architecture choices) that lack a research-provenance tag, then web-searches to ground them.

The motivation, from the plan doc: when an SSML capability was assumed for Kourai's M6 and never verified against the target's docs (ElevenLabs v3 doesn't support SSML break tags), it became 5 PRs of revertable work. A decision stated as fact but never checked is the failure this skill catches — before it hardens into code.

Design: judgment over grep

The decision-language grep (chose|switched|adopted|instead of|…) is only a candidate seed. Most instead of hits are descriptive implementation prose ("reads X instead of re-reading Y"), not technology choices — so the skill reads around each hit and classifies per-hit. Its Ignore section explicitly waves through descriptive prose, hypotheticals ("what if we used…"), already-grounded decisions, and mechanical changes. This is why it's a skill (judgment), not a CI regex.

Validation (GREEN test)

Ran the skill, report-only, via a fresh subagent against kourai's ROADMAP+IMPL (established as ground truth):

8 of 10 grep candidates correctly filtered as ignore (descriptive prose, a "what if" hypothetical, mechanical renames, already-grounded decisions).
2 genuine gaps surfaced (an un-tagged Message.metadata extension-key architecture choice; a word-level-timing pattern justified in prose but missing the formal tag).

Low false-positive rate, real signal.

Registration

plugins/techne/skills/research-grounded/SKILL.md
docs/skills/research-grounded.md + zensical nav entry
README "What's in the box" row
ROADMAP: moved Queued → Shipped

Local checks green: validate_skill_frontmatter.py (all 10 skills ok), manifests jq-valid, zensical build clean. Sibling of /techne:docsync — claims-vs-code there, decisions-vs-evidence here.

New /techne:research-grounded audits IMPL.md / ROADMAP.md for committed design decisions (library / framework / pattern / architecture choices) that lack a `# research(YYYY-MM):` provenance tag, then web-searches to ground them -- closing the loop that, when skipped, turned an SSML capability bet into 5 revertable PRs. Judgment over grep: the decision-language grep is only a candidate seed; the skill reads around each hit and filters descriptive prose ("reads X instead of Y") and hypotheticals ("what if we used..."). Validated on kourai's ROADMAP/IMPL via a fresh subagent -- 8/10 grep candidates correctly ignored, 2 genuine gaps surfaced. Wired into README, docs/skills/, and the zensical nav; moved Queued -> Shipped. Sibling of /techne:docsync (claims vs code; this is decisions vs evidence).

Adding /techne:research-grounded (#31) left "9 skills" claims behind in ROADMAP, IMPL, and skill-context — the count lives in more places than the slop_ground_truth note named. Bump all to 10, add research-grounded to the Drift dimension table and the skill-context name list, and widen slop_ground_truth to name every place a count claim lives (README, ROADMAP, IMPL, skill-context) so the next addition catches them all. The historical "eight of nine" spec note is left as-is (archival).

ajbarea enabled auto-merge (squash) May 29, 2026 13:46

ajbarea merged commit e6d8442 into main May 29, 2026
2 checks passed

ajbarea deleted the feat/research-grounded-skill branch May 29, 2026 13:46

ajbarea mentioned this pull request May 29, 2026

docs: reconcile skill count to 10 after research-grounded #32

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add /techne:research-grounded skill#31

feat: add /techne:research-grounded skill#31
ajbarea merged 1 commit into
mainfrom
feat/research-grounded-skill

ajbarea commented May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ajbarea commented May 29, 2026

What

Design: judgment over grep

Validation (GREEN test)

Registration

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant