DialogueHelpfulnessJudge inspects the latest assistant reply in the context of preceding turns. It rewards responses that acknowledge the user’s request, use the available context, and offer actionable guidance.
Integrate this judge into regression suites to catch regressions after prompt changes or upgrades to your assistant model.