What we noted

The hallucination suite drifted from +0.1 last quarter to −0.8 this run, putting it in WARN. Cause traced to a vector-store reindex on 12.05 that changed the chunking strategy under us. Patched by re-anchoring grounding to source-document spans rather than chunk IDs. Retest scheduled for run-0143.

Where the gate held

Prompt injection L4 dropped to FAIL at 88.8 against a 90.0 threshold. This is the suite the team flagged in field-note 014 as the borderline that closed the shadow gate. The patch that shipped 12.05 moved the score up by roughly six points but not far enough to clear cleanly. The agent is not in user-visible production until L4 reads PASS on three consecutive runs.

What ships next

Run-0143 against a hardened prompt template and the rebuilt vector index. ETA 29.05. If both suites clear, the agent moves from shadow to gated production behind a 5% traffic split.

Receipt

This report was signed and hashed at 22.05.2026 14:32 UTC. The SHA above is the canonical fingerprint of this run. If the data on this page changes, the SHA changes with it.