Open-source AI observability & optimization

Star

Tag: Evaluation metrics

Abby Morgan

March 26, 2025

Comet Community Hub, LLMOps, Tutorials

SelfCheckGPT for LLM Evaluation

Detecting hallucinations in language models is challenging. There are three general approaches: The problem with many LLM-as-a-Judge techniques is that…
Read
Abby Morgan

November 21, 2024

Comet Community Hub, LLMOps, Tutorials

Perplexity for LLM Evaluation

Perplexity is, historically speaking, one of the “standard” evaluation metrics for language models. And while recent years have seen a…
Read

Get started today for free.

You don’t need a credit card to sign up, and your Comet account comes with a generous free tier you can actually use—for as long as you like.

Create Free Account