-
Structured Generation for LLM-as-a-Judge Evaluations
For the past few months, I’ve been working on LLM-based evaluations (”LLM-as-a-Judge” metrics) for language models. The results have so…
Run open source LLM evaluations with Opik!
StarFor the past few months, I’ve been working on LLM-based evaluations (”LLM-as-a-Judge” metrics) for language models. The results have so…
You don’t need a credit card to sign up, and your Comet account comes with a generous free tier you can actually use—for as long as you like.