Run open source LLM evaluations with Opik!

Star
Comet logo
  • Comet logo
  • Opik Platform
  • Products
    • Opik GenAI Platform
    • MLOps Platform
  • Docs
    • Opik GenAI Platform
    • MLOps Platform
  • Pricing
  • Customers
  • Learn
    • Blog
    • Deep Learning Weekly
  • Company
    • About Us
    • News and Events
      • Events
      • Press Releases
    • Partners
    • Careers
    • Contact Us
    • Leadership
  • Login
Get Demo
Try Comet Free
Contact Us
Try Opik Free
  1. Home
  2. Blog
  3. Page 38

Comet Blog

  • Academic Research
  • Comet Community Hub
  • Industry
  • Integrations
  • LLMOps
  • Machine Learning
  • Office Hours
  • Partners & Integrations
  • Product
  • Thought Leadership
  • Tutorials
  • Uncategorized
  • From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta

    Vincent Koc

    May 21, 2025
    LLMOps, Product

    At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…

    Read

    From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta
  • Caroline Borders

    November 12, 2025
    Product

    Opik Release Highlights: Real-Time Alerts, No-Code Evaluation, & Generative Prompt Tools

    As LLM applications and agents scale, visibility and iteration speed matter more than ever. This month’s Opik updates close the…

    Read

    Opik Release Highlights: Real-Time Alerts, No-Code Evaluation, & Generative Prompt Tools
  • Dr. Cayla Eagon

    November 11, 2025
    LLMOps

    Human-in-the-Loop Review Workflows for LLM Applications & Agents

    You’ve been testing a new AI assistant. It sounds confident, reasons step-by-step, cites sources, and handles 90% of real user…

    Read

    Human-in-the-Loop Review Workflows for LLM Applications & Agents
  • Kelsey Kinzer

    November 11, 2025
    LLMOps

    Best LLM Observability Tools of 2025: Top Platforms & Features

    LLM applications are everywhere now, and they’re fundamentally different from traditional software. They’re non-deterministic. They hallucinate. They can fail in…

    Read

    Best LLM Observability Tools of 2025: Top Platforms & Features
  • Matt M. Casey

    November 5, 2025
    LLMOps

    Context Engineering: The Discipline Behind Reliable LLM Applications & Agents

    Teams cannot ship dependable LLM systems with prompt templates alone. Model outputs depend on the full set of instructions, facts,…

    Read

    Context Engineering: The Discipline Behind Reliable LLM Applications & Agents
  • Sharon Campbell-Crow

    October 28, 2025
    LLMOps

    LLM Tracing: The Foundation of Reliable AI Applications

    Your RAG pipeline works perfectly in testing. You’ve validated the retrieval logic, tuned the prompts, and confirmed the model returns…

    Read

    LLM Tracing: The Foundation of Reliable AI Applications
  • Matt M. Casey

    October 28, 2025
    LLMOps

    LLM Monitoring: From Models to Agentic Systems

    As software teams entrust a growing number of tasks to large language models (LLMs), LLM monitoring has become a vital…

    Read

    LLM Monitoring: From Models to Agentic Systems
  • Caroline Borders

    October 14, 2025
    Product

    Opik Release Highlights: GEPA Agent Optimization, MCP Tool-Calling, and Automated Trace Analysis

    As AI agents and LLM applications grow more powerful and complex, this month’s Opik updates integrate leading-edge technologies to help…

    Read

    Opik Release Highlights: GEPA Agent Optimization, MCP Tool-Calling, and Automated Trace Analysis
  • Claire Longo

    October 10, 2025
    LLMOps

    Thread-Level Human-in-the-Loop Feedback for Agent Validation

    Imagine you are a developer building an agentic AI application or chatbot. You are probably not just coding a single…

    Read

    Thread-Level Human-in-the-Loop Feedback for Agent Validation
  • Gourav Bais

    September 22, 2025
    LLMOps

    Introduction to LLM-as-a-Judge For Evals

    In recent years, LLMs (large language models) have emerged as the most significant development in the AI space. They are…

    Read

    Introduction to LLM-as-a-Judge For Evals
  • Kelsey Kinzer

    September 11, 2025
    LLMOps

    The Ultimate Guide to LLM Evaluation: Metrics, Methods & Best Practices

    The meteoric rise of large language models (LLMs) and their widespread use across more applications and user experiences raises an…

    Read

    The Ultimate Guide to LLM Evaluation: Metrics, Methods & Best Practices
1 2 3 … 47
→

Get started today for free.

You don’t need a credit card to sign up, and your Comet account comes with a generous free tier you can actually use—for as long as you like.

Create Free Account
Contact Sales
Comet logo
  • LinkedIn
  • X
  • YouTube
  • Facebook

Subscribe to Comet

Thank you for subscribing to Comet’s newsletter!

Products

  • Opik LLM Evaluation
  • ML Experiment Management
  • ML Artifacts
  • ML Model Registry
  • ML Model Production Monitoring

Learn

  • Documentation
  • Opik University
  • Comet Blog
  • Deep Learning Weekly

Company

  • About Us
  • News and Events
  • Partners
  • Careers
  • Contact Us

Pricing

  • Pricing
  • Create a Free Account
  • Contact Sales
Capterra badge
AICPA badge

©2025 Comet ML, Inc. – All Rights Reserved

Terms of Service

Privacy Policy

CCPA Privacy Notice

Cookie Settings

We use cookies to collect statistical usage information about our website and its visitors and ensure we give you the best experience on our website. Please refer to our Privacy Policy to learn more.