Skip to main content

Functional validation for agents evidence over mocks.

Ship functional evidence, not passing tests. Every completion claim produces screenshots, API responses, and build logs that a skeptical reviewer can open.

Docs When Readers Aren't Human

ValidationForge is a Claude Code plugin that replaces unit-test theater with functional validation against the real running system. Every completion claim produces evidence — screenshots, API responses, build logs — that a skeptical reviewer can open.

Ships as 52 validation skills, 19 slash commands, 7 enforcement hooks, 7 agents, and 9 rules files. The block-test-files hook refuses to create test files. The validation-not-compilation hook rejects completion claims that cite only a successful build. The evidence-gate-reminder hook injects an evidence checklist before any task is marked done.

No Mocks, Ever

Enforcement hooks block creation of .test.*, mock libraries, and in-memory test doubles. Fix the real system instead.

Evidence Chain of Custody

Every validation step captures screenshots, API bodies, and log output to disk. Missing files = FAIL.

Three Engines

VALIDATE (beta) runs single-pass validation. CONSENSUS (V1.5) votes contested verdicts. FORGE (V2.0) runs autonomous fix loops with rollback.

Dependency-Aware Waves

DB → API → Web/iOS. Downstream validators never run against a failing upstream.

  1. 01claude plugin marketplace add krzemienski/validationforge
  2. 02claude plugin install validationforge@validationforge
  3. 03Run /vf-setup to initialize configuration + preflight checks
  4. 04Run /validate-plan <journey-name> to define PASS criteria + evidence requirements
  5. 05Run /validate-sweep to execute against the real system and capture evidence
  6. 06Run /validate-dashboard to render the final evidence-backed verdict
VALIDATE engine
Beta — 7-phase pipeline
CONSENSUS engine
V1.5 — multi-validator vote
FORGE engine
V2.0 — autonomous fix loops
Evidence chain of custody
Screenshots + API bodies + logs
Block-test-files hook
PreToolUse exit 2
Dependency-aware waves
DB → API → Web/iOS
Documentation
Read the VF docs
vf.withagents.dev
  1. Stable VALIDATE engine, 52 skills, 19 commands, 7 hooks shipped.
  2. Dependency-aware wave scheduling for multi-platform projects.
  3. Evidence dashboard + per-journey verdict synthesis.