K=3 adversarial verification cut false positives 27.8% to 0.0% (95% CI [11.1, 50.0] to [0, 0]; recall 100% to 77.8%) on a 36-snippet labeled set including prompt-injection traps. Held-out real target: 3 of 3 genuine bugs found, 0 surviving false positives.
Cost-aware model routing (DeepSeek, Haiku, Sonnet, Opus) plus adversarial multi-agent verification plus full tracing, with a trace UI that looks like a product. Fans out finders per file, K skeptics per finding, concurrency cap 8. Cost-routing claim is operator-gated on an Anthropic key. Present as harness committed, live multi-tier number gated, not a fabricated routing number. make eval-dry reproduces offline.