Release Highlights: Discover Opik Agent Optimizer, Guardrails, & New Integrations
As LLMs power more complex, multi-step agentic systems, the need for precise optimization and control is growing. In case you…
As LLMs power more complex, multi-step agentic systems, the need for precise optimization and control is growing. In case you…
We’ve spent the past year building tools that make LLM applications more transparent, measurable, and accountable. Since launching Opik, our…
At Comet, we're driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…
We’ve just rolled out two major updates in Opik, Comet’s open-source LLM evaluation platform, that make it easier than ever…
6 years ago, I decided to open-source my Python code for a personal project I was working on, which led…
Large language models (LLMs) have revolutionized natural language processing, yet most development and evaluation efforts have historically centered around Latin-script…
Detecting hallucinations in language models is challenging. There are three general approaches: Measuring token-level probability distributions for indications that a…