Tag: LLM Evaluation

Sharon Campbell-Crow

January 28, 2026

LLMOps

What is LLM Observability? The Ultimate Guide for AI Developers

If your LLM application or agent sends your user a hallucinated answer, do you know when and why it happened?…
Read
Vincent Koc

March 27, 2025

Academic Research, Comet Community Hub

LLM Evaluation Complexities for Non-Latin Languages

Large language models (LLMs) have revolutionized natural language processing, yet most development and evaluation efforts have historically centered around Latin-script…
Read
Abby Morgan

March 26, 2025

Comet Community Hub, LLMOps, Tutorials

SelfCheckGPT for LLM Evaluation

Detecting hallucinations in language models is challenging. There are three general approaches: The problem with many LLM-as-a-Judge techniques is that…
Read
Abby Morgan

January 28, 2025

Comet Community Hub, LLMOps, Machine Learning, Product, Tutorials

G-Eval for LLM Evaluation

LLM-as-a-judge evaluators have gained widespread adoption due to their flexibility, scalability, and close alignment with human judgment. They excel at…
Read
Abby Morgan

December 19, 2024

Comet Community Hub, LLMOps, Tutorials

BERTScore For LLM Evaluation

Introduction BERTScore represents a pivotal shift in LLM evaluation, moving beyond traditional heuristic-based metrics like BLEU and ROUGE to a…
Read

Get started today for free.

You don’t need a credit card to sign up, and your Comet account comes with a generous free tier you can actually use—for as long as you like.

Create Free Account