Our Blogs | Comet ML

Dr. Cayla Eagon

November 11, 2025

LLMOps

Human-in-the-Loop Review Workflows for LLM Applications & Agents

You’ve been testing a new AI assistant. It sounds confident, reasons step-by-step, cites sources, and handles 90% of real user…
Read
Kelsey Kinzer

November 11, 2025

LLMOps

Best LLM Observability Tools of 2025: Top Platforms & Features

LLM applications are everywhere now, and they’re fundamentally different from traditional software. They’re non-deterministic. They hallucinate. They can fail in…
Read
Matt M. Casey

November 5, 2025

LLMOps

Context Engineering: The Discipline Behind Reliable LLM Applications & Agents

Teams cannot ship dependable LLM systems with prompt templates alone. Model outputs depend on the full set of instructions, facts,…
Read
Sharon Campbell-Crow

October 28, 2025

LLMOps

LLM Tracing: The Foundation of Reliable AI Applications

Your RAG pipeline works perfectly in testing. You’ve validated the retrieval logic, tuned the prompts, and confirmed the model returns…
Read
Matt M. Casey

October 28, 2025

LLMOps

LLM Monitoring: From Models to Agentic Systems

As software teams entrust a growing number of tasks to large language models (LLMs), LLM monitoring has become a vital…
Read
Caroline Borders

October 14, 2025

Product

Opik Release Highlights: GEPA Agent Optimization, MCP Tool-Calling, and Automated Trace Analysis

As AI agents and LLM applications grow more powerful and complex, this month’s Opik updates integrate leading-edge technologies to help…
Read
Claire Longo

October 10, 2025

LLMOps

Thread-Level Human-in-the-Loop Feedback for Agent Validation

Imagine you are a developer building an agentic AI application or chatbot. You are probably not just coding a single…
Read
Gourav Bais

September 22, 2025

LLMOps

Introduction to LLM-as-a-Judge For Evals

In recent years, LLMs (large language models) have emerged as the most significant development in the AI space. They are…
Read
Kelsey Kinzer

September 11, 2025

LLMOps

LLM Evaluation: The Ultimate Guide to Metrics, Methods & Best Practices

The meteoric rise of large language models (LLMs) and their widespread use across more applications and user experiences raises an…
Read
Sarah Ostermeier

September 10, 2025

LLMOps

How We Used Opik to Build AI-Powered Trace Analysis

Within the GenAI development cycle, Opik does often-overlooked — yet essential — work of logging, testing, comparing, and optimizing steps…
Read

Comet Blog

Announcing the Future of AI Engineering: Self-Optimizing Agents

Human-in-the-Loop Review Workflows for LLM Applications & Agents

Best LLM Observability Tools of 2025: Top Platforms & Features

Context Engineering: The Discipline Behind Reliable LLM Applications & Agents

LLM Tracing: The Foundation of Reliable AI Applications

LLM Monitoring: From Models to Agentic Systems

Opik Release Highlights: GEPA Agent Optimization, MCP Tool-Calling, and Automated Trace Analysis

Thread-Level Human-in-the-Loop Feedback for Agent Validation

Introduction to LLM-as-a-Judge For Evals

LLM Evaluation: The Ultimate Guide to Metrics, Methods & Best Practices

How We Used Opik to Build AI-Powered Trace Analysis

Get started today for free.