-
Major Releases: Auto-Optimize Multi-Step Agents, Annotate & Score Entire Chatbot Convos
When multiple steps in an agentic system are contextually related, logging and evaluating individual LLM calls doesn’t tell the whole…
-
Release Highlights: Discover Opik Agent Optimizer, Guardrails, & New Integrations
As LLMs power more complex, multi-step agentic systems, the need for precise optimization and control is growing. In case you…
-
Announcing Opik’s Guardrails Beta: Moderate LLM Applications in Real-Time
We’ve spent the past year building tools that make LLM applications more transparent, measurable, and accountable. Since launching Opik, our…
-
From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta
At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…
-
Major Releases: MCP Server & Google Agent Dev Kit Support
We’ve just rolled out two major updates in Opik, Comet’s open-source LLM evaluation platform, that make it easier than ever…
-
Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More
Spring is in the air, and we’re excited to bring you four fresh releases in the Comet platform to make…
-
Building Opik: A Scalable Open-Source LLM Observability Platform
Opik is an open-source platform for evaluating, testing, and monitoring LLM applications, created by Comet. When teams integrate language models…
-
G-Eval for LLM Evaluation
LLM-as-a-judge evaluators have gained widespread adoption due to their flexibility, scalability, and close alignment with human judgment. They excel at…
-
Comet Product Releases January 2025
As 2025 picks up steam, we’re thrilled to bring you some exciting product updates from Comet! This month, we’ve added…
-
Comet Product Releases December 2024
Each layer of visibility into your training and debugging workflows builds confidence that your models will work reliably in production.…