feat(pipeline): candidate-accuracy gates wiring + feedback loop (skills/workflow)#216
Merged
Merged
Conversation
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
…s injection (prompts/schemas) Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
…e repair round Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
…uration; author/validator lessons + lint notes Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
…ves applyCap, display.category is UI-only Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
…lidator path, decisions-preserve guard, unique temp slots, deferred-list completeness, spec amendments Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
AndroDR half of the candidate-accuracy spec (
docs/superpowers/specs/2026-06-10-pipeline-candidate-accuracy-design.md). Sigma-repo half merged as rules#32 + rules#33.Phase 1 — deterministic gates
categoryper runtimeapplyCapsemantics, lone-exploited-CVE, retired-ID registry) + reviewer-driven hardening (category enum check closing a case-sensitivity bypass, empty-CVE-list rejection, clean non-dict display error). 40/40 sigma tests incl. a regression sweep over every shipped rule.skip_noteescape hatch; second failure is final.Phase 2 — feedback loop
pipeline-runs/YYYY-MM-DD-<mode>.yml) recording every HitL verdict + reason + failure class; ledger failures never block rule commits.validation/authoring-lessons.yml(seeded with the posture-cap, no-TROJAN, and lone-CVE lessons) injected into author, validator, reviewer, and repair prompts in both the dispatcher path and the e2e workflow.Review
Two-reviewer cycle ran mid-branch; all findings fixed: dispatcher-path lessons gap, decisions-manifest wipe on repair (
[]truthiness), temp-file slot uniqueness for duplicate rule_ids, empty-parallel guard, validate-skill Gate-5 wording contradiction, deferred-list completeness, two spec amendments. One finding intentionally deferred: a pytest job in the sigma repo's CI (GHA budget policy — suites run locally per project convention).Verification: workflow body parses under the harness wrapper; discover suite 19/19; sigma validation suite 40/40. No Android app code touched.
🤖 Generated with Claude Code