Comet Blog

From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta

Vincent Koc

May 21, 2025

LLMOps, Product

At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…
Read

Caroline Borders

June 6, 2025

LLMOps, Product

Release Highlights: Discover Opik Agent Optimizer, Guardrails, & New Integrations

As LLMs power more complex, multi-step agentic systems, the need for precise optimization and control is growing. In case you…
Read
Team Comet

May 23, 2025

LLMOps, Product

Announcing Opik’s Guardrails Beta: Moderate LLM Applications in Real-Time

We’ve spent the past year building tools that make LLM applications more transparent, measurable, and accountable. Since launching Opik, our…
Read
Caroline Borders

April 24, 2025

Integrations, LLMOps, Product

Major Releases: MCP Server & Google Agent Dev Kit Support

We’ve just rolled out two major updates in Opik, Comet’s open-source LLM evaluation platform, that make it easier than ever…
Read
Claire Longo

April 6, 2025

Machine Learning, Thought Leadership

How Contributing to Open Source Projects Helped Me Build My Dream Career in AI

6 years ago, I decided to open-source my Python code for a personal project I was working on, which led…
Read
Vincent Koc

March 27, 2025

Academic Research, Comet Community Hub

LLM Evaluation Complexities for Non-Latin Languages

Large language models (LLMs) have revolutionized natural language processing, yet most development and evaluation efforts have historically centered around Latin-script…
Read
Abby Morgan

March 26, 2025

Comet Community Hub, LLMOps, Tutorials

SelfCheckGPT for LLM Evaluation

Detecting hallucinations in language models is challenging. There are three general approaches: The problem with many LLM-as-a-Judge techniques is that…
Read
Kelsey Kinzer

March 26, 2025

LLMOps

LLM Hallucination Detection in App Development

Even ChatGPT knows it’s not always right. When prompted, “Are large language models (LLMs) always accurate?” ChatGPT says no and…
Read
Caroline Borders

March 10, 2025

Product

Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More

Spring is in the air, and we’re excited to bring you four fresh releases in the Comet platform to make…
Read
Leonardo Gonzalez

March 3, 2025

LLMOps

LLM Evaluation Frameworks: Head-to-Head Comparison

As teams work on complex AI agents and expand what LLM-powered applications can achieve, a variety of LLM evaluation frameworks…
Read
Abby Morgan

February 24, 2025

Comet Community Hub, LLMOps, Tutorials

LLM Juries for Evaluation

Evaluating the correctness of generated responses is an inherently challenging task. LLM-as-a-Judge evaluators have gained popularity for their ability to…
Read

Get started today for free.

Trusted by Thousands of Data Scientists

Create Free Account