Evals

Each eval teaches an AI judge to score content the way you would.

New Eval
NameStatusItemsScaleJudge ModelCreated
testSetting up01-54/30/2026
Run Test