Comet Blog

Announcing the Future of AI Engineering: Self-Optimizing Agents

Sarah Ostermeier

December 10, 2025

LLMOps

When ChatGPT launched in 2022, generative AI seemed like magic. With a simple API call and prompt, any application could…
Read

Dr. Cayla Eagon

March 7, 2026

LLMOps

Few-Shot Prompting for Agentic Systems: Teaching by Example

Your new AI agent looks great in testing. It follows instructions, calls tools, and returns clean, structured outputs. Then it…
Read
Vincent Koc

March 5, 2026

Integrations, LLMOps, Product

Native Observability & Alerts for Your OpenClaw with Opik

OpenClaw is all the rage, but users want telemetry, observability, and peace of mind to better understand the inner workings…
Read
Collin Cunningham

March 2, 2026

Integrations, LLMOps, Product, Tutorials

Announcing the Opik Claude Code Plugin: Automatically Configure Observability for Complex Agentic Systems

Agent observability shouldn’t be a side project. But in practice, it is. When teams have to choose between shipping features…
Read
Jamie Gillenwater

February 26, 2026

LLMOps

LLM Parameter Optimization: Stop Leaving Agent Performance on the Table

If you search for “LLM parameter optimization,” you’ll find guides on tuning learning rates, batch sizes, and layer configurations. But…
Read
Sharon Campbell-Crow

February 24, 2026

LLMOps

How to Evaluate RAG Systems: Metrics, Methods, and What to Measure First

When a RAG system fails, the output alone won’t tell you why. RAG stands for retrieval-augmented generation, and it’s one…
Read
Sharon Campbell-Crow

February 19, 2026

LLMOps

Retrieval-Augmented Generation: A Practical Guide to RAG Architecture, Retrieval, and Production-Ready Context

Large language models are impressive memorizers. During training, they compress vast amounts of text into billions of parameters, encoding patterns,…
Read
Jamie Gillenwater

February 19, 2026

LLMOps

Prompt Learning: Using Natural Language to Optimize LLM Systems

Your customers expect better and more consistent results than your AI agent can deliver. You manually tweak a prompt, test…
Read
Sharon Campbell-Crow

February 18, 2026

LLMOps

LLM-as-a-Judge: How to Build Reliable, Scalable Evaluation for LLM Apps and Agents

LLM-as-a-judge is an evaluation method for assessing the output quality of AI apps. Think of it as a mechanism that…
Read
Caroline Borders

February 18, 2026

Product

Opik Release Highlights: Prompt Optimization Studio, SDK Upgrades & Multimodal Support

This month’s Opik releases strengthen the connection between experimentation, evaluation, and measurable performance with the launch of the Optimization Studio,…
Read
Collin Cunningham

February 17, 2026

LLMOps, Product

Optimizing AI IDEs at Scale

Using AI development tools at scale comes with real overhead for every engineer. It’s an additional cost layered on top…
Read

Get started today for free.

You don’t need a credit card to sign up, and your Comet account comes with a generous free tier you can actually use—for as long as you like.

Create Free Account