Skip to main content

A Claude Code plugin that turns multi-stage agentic work into provable steps.

Multi-stage agentic work as a sequence of provable steps. Iron Rule: real-system evidence on disk, refuses verdicts without proof. 36 skills, 11 agents, 22 commands, 7 hooks.

Shannon is a Claude Code plugin for high-stakes multi-stage agentic work — research, planning, implementation, validation, release — where each step must produce evidence a skeptical reviewer can open. Its Iron Rule refuses to emit a verdict without real-system proof on disk: no mocks, no stubs, no test files.

The surface is 36 skills, 11 agents, 22 slash commands, and 7 enforcement hooks. Sub-agents carry their skills inline (embedded at build time), so a spawn can never silently fail to load the skill it needs. A meta-judge generates a weighted rubric YAML before any judge runs, with hidden thresholds, and escalates to multi-round debate on disagreement.

Shannon is self-instrumented. `scripts/doctor.py` is a mechanical contract validator (10/10 checks: manifest validity, surface counts, hook registration, the required-hooks dependency graph, and skill/agent body-reference resolution). `scripts/harness/load_check.py` is a real Agent SDK load probe — it inherits the Claude Code CLI's own auth, queries the running plugin, and asserts every command is addressable.

Iron Rule validation

Real-system evidence on disk for every completion claim. Enforcement hooks refuse to write .test.* / mock / stub files. Compilation success is explicitly not validation.

Embedded sub-agent skills

Each agent's SKILL.md content is inlined into its AGENT.md at build time, so a dispatched sub-agent carries its skill and cannot fail to load it mid-run.

Meta-judge consensus

A rubric YAML with weighted dimensions and hidden thresholds is generated before any judge scores. Non-unanimous verdicts escalate to multi-round debate, never to silent averaging.

Refusal-driven autopilot

/shannon:autopilot wraps /shannon:cook in a retry loop. On REFUSED it parses cited blockers and remediates; it never force-completes. Refusal is a feature, not a bug.

  1. 01/plugin marketplace add krzemienski/shannon
  2. 02/plugin install shannon@shannon
  3. 03/shannon:enforce on — activate Shannon's hooks for the current project
  4. 04/shannon:doctor — confirm the 10/10 mechanical contract passes
  5. 05/shannon:plan "<your feature>" --mode deep — author a tournament+converge plan
  6. 06/shannon:cook plans/<date>-<slug>/ — execute end-to-end with Iron-Rule validation
Iron Rule — no mocks / stubs / test files
Enforced by hooks at write time
Embedded sub-agent skills
Inlined by build/embed-skills.py
Four planning modes (linear/converge/tournament/deep)
/shannon:plan
Refusal-driven autopilot
No force-complete; cites blockers
Meta-judge rubric with hidden thresholds
Generated before any judge runs
Self-instrumented contract check
scripts/doctor.py — 10/10
Real Agent SDK load probe
scripts/harness/load_check.py — inherits CLI auth
Dynamic Workflows escalation
Unbounded fan-out → Workflow pipeline()
Documentation
Read the Shannon docs
github.com/krzemienski/shannon#readme
  1. Second stable release. 36 skills, 11 agents, 22 commands, 7 hooks. DeepPlan + wave-execution (dependency-ordered parallel waves with between-wave synthesis gates); /shannon:plan-deep landed as a true alias of plan --mode deep; the evidence-gated forge pipeline (codebase-analysis → docs-research → planning → oracle plan-review → execute → validation → evidence-indexing → 3-reviewer consensus → 3-oracle quorum → completion-gate); skills10x activation harness with transitive registration. doctor 10/10 PASS.
  2. First stable release. 33 skills, 9 agents, 20 commands, 7 hooks. doctor 10/10 PASS; real SDK load probe PASS (all 20 commands addressable). doctor.py now reads version live from the manifest; docs reconciled to disk counts; Dynamic Workflows awareness added to the orchestration cluster.