Run open source LLM evaluations with Opik!

Star
Comet logo
  • Opik LLM Evals
  • Products
    • Opik – LLM Evaluation
    • ML Experiment Management
    • ML Artifacts
    • ML Model Registry
    • ML Model Production Monitoring
  • Docs
    • Opik – LLM Evaluation
    • ML Experiment Management
  • Pricing
  • Customers
  • Learn
    • Blog
    • Deep Learning Weekly
    • LLM Course
  • Company
    • About Us
    • News and Events
      • Events
      • Press Releases
    • Careers
    • Contact Us
    • Leadership
  • Login
Get Demo
Try Comet Free
  1. Home
  2. Blog

Comet Blog

  • Academic Research
  • Comet Community Hub
  • Industry
  • Integrations
  • LLMOps
  • Machine Learning
  • Office Hours
  • Partners & Integrations
  • Product
  • Thought Leadership
  • Tutorials
  • Uncategorized
  • From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta

    Vincent Koc

    May 21, 2025
    LLMOps, Product

    At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…

    Read

    From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta
  • Caroline Borders

    December 18, 2024
    Product

    Comet Product Releases December 2024

    Each layer of visibility into your training and debugging workflows builds confidence that your models will work reliably in production.…

    Read

    Comet Product Releases December 2024
  • Claire Longo

    December 9, 2024
    Comet Community Hub, LLMOps, Tutorials

    Building ClaireBot, an AI Personal Stylist Chatbot

    Follow the evolution of my personal AI project and discover how to integrate image analysis, LLM models, and LLM-as-a-judge evaluation…

    Read

    Building ClaireBot, an AI Personal Stylist Chatbot
  • Caleb Kaiser

    November 27, 2024
    LLMOps, Uncategorized

    Structured Generation for LLM-as-a-Judge Evaluations

    For the past few months, I’ve been working on LLM-based evaluations (”LLM-as-a-Judge” metrics) for language models. The results have so…

    Read

    Structured Generation for LLM-as-a-Judge Evaluations
  • Abby Morgan

    November 21, 2024
    Comet Community Hub, LLMOps, Tutorials

    Perplexity for LLM Evaluation

    Perplexity is, historically speaking, one of the “standard” evaluation metrics for language models. And while recent years have seen a…

    Read

    Perplexity for LLM Evaluation
  • Siddharth Mehta

    October 8, 2024
    LLMOps, Product, Tutorials

    OpenAI Evals: Log Datasets & Evaluate LLM Performance with Opik

        OpenAI’s Python API is quickly becoming one of the most-downloaded Python packages. With an easy-to-use SDK and access…

    Read

    OpenAI Evals: Log Datasets & Evaluate LLM Performance with Opik
  • Gideon Mendels

    |

    Jacques Verre

    September 16, 2024
    Comet Community Hub, LLMOps, Product

    Meet Opik: Your New Tool to Evaluate, Test, and Monitor LLM Applications

    Today, we’re thrilled to introduce Opik – an open-source, end-to-end LLM development platform that provides the observability tools you need…

    Read

    Meet Opik: Your New Tool to Evaluate, Test, and Monitor LLM Applications
  • Fabrício Ceolin

    August 30, 2024
    Comet Community Hub, LLMOps, Machine Learning, Tutorials

    Building a Low-Cost Local LLM Server to Run 70 Billion Parameter Models

    A guest post from Fabrício Ceolin, DevOps Engineer at Comet. Inspired by the growing demand for large-scale language models, Fabrício…

    Read

    Building a Low-Cost Local LLM Server to Run 70 Billion Parameter Models
  • Paul Iusztin

    |

    Decoding ML

    July 31, 2024
    Comet Community Hub, LLMOps, Tutorials

    The Ultimate Prompt Monitoring Pipeline

    Welcome to Lesson 10 of 12 in our free course series, LLM Twin: Building Your Production-Ready AI Replica. You’ll learn how…

    Read

    The Ultimate Prompt Monitoring Pipeline
  • Nikolas Laskaris

    |

    Thomas Fan

    July 29, 2024
    Comet Community Hub, Industry, Integrations, Machine Learning, Product, Tutorials

    How to Use Comet’s New Integration with Union & Flyte

    In the machine learning (ML) and artificial intelligence (AI) domain, managing, tracking, and visualizing model training processes, especially at scale,…

    Read

    How to Use Comet’s New Integration with Union & Flyte
  • Paul Iusztin

    |

    Decoding ML

    July 23, 2024
    LLMOps, Machine Learning, Tutorials

    Beyond Proof of Concept: Building RAG Systems That Scale

    Welcome to Lesson 9 of 12 in our free course series, LLM Twin: Building Your Production-Ready AI Replica. You’ll learn how to use…

    Read

    Beyond Proof of Concept: Building RAG Systems That Scale
←
1 2 3 4 5 … 45
→

Get started today for free.

Trusted by Thousands of Data Scientists

Create Free Account
Contact Sales
Comet logo
  • LinkedIn
  • X
  • YouTube
  • Facebook

Subscribe to Comet

Thank you for subscribing to Comet’s newsletter!

Products

  • Opik
  • Experiment Management
  • Artifacts
  • Model Registry
  • Model Production Monitoring

Learn

  • Documentation
  • Resources
  • Comet Blog
  • Deep Learning Weekly
  • Heartbeat
  • LLM Course

Company

  • About Us
  • News and Events
  • Careers
  • Contact Us

Pricing

  • Pricing
  • Create a Free Account
  • Contact Sales
Capterra badge
AICPA badge

©2025 Comet ML, Inc. – All Rights Reserved

Terms of Service

Privacy Policy

CCPA Privacy Notice

Cookie Settings

We use cookies to collect statistical usage information about our website and its visitors and ensure we give you the best experience on our website. Please refer to our Privacy Policy to learn more.OkPrivacy policy