Summarization Consistency Judge
SummarizationConsistencyJudge compares a generated summary with the original document (or transcript) and scores how faithfully key facts were preserved. It follows the GEval method: expanding your instructions into a chain-of-thought rubric, then grading on a 0.0–1.0 scale (derived from a raw 0–10 judgement) with detailed explanations.
Use it when you automatically summarise support tickets, research reports, or call transcripts and want to catch hallucinations before they reach end users.
Checking summary faithfulness
Inputs
Configuration
The evaluator emits an integer between 0 and 10 that Opik normalises to 0–1; the reason field captures the rubric notes explaining the judgement.