Evals
Each eval teaches an AI judge to score content the way you would.
New Eval
All statuses
Ready
In progress
Name
Status
Items
Scale
Judge Model
Created ↓
test
Setting up
0
1-5
—
4/30/2026
Run Test
Clone
Delete