Tenant: primeassist-dev

P26 dashboard skeleton — module pages land in M2+.

Evals · Runs

Eval runs.

Each run replays a golden set through the chat orchestrator with a synthetic visitor session, asserts per-kind expectations on the trace, and persists pass / fail / skipped counts.

New eval run

Filter by golden set

No eval runs match these filters.