Run open source LLM evaluations with Opik!

Star

Tag: Evaluation metrics

Abby Morgan

March 26, 2025

Comet Community Hub, LLMOps, Tutorials

SelfCheckGPT for LLM Evaluation

Detecting hallucinations in language models is challenging. There are three general approaches: The problem with many LLM-as-a-Judge techniques is that…
Read
Abby Morgan

November 21, 2024

Comet Community Hub, LLMOps, Tutorials

Perplexity for LLM Evaluation

Perplexity is, historically speaking, one of the “standard” evaluation metrics for language models. And while recent years have seen a…
Read

Get started today for free.

Trusted by Thousands of Data Scientists

Create Free Account