Run open source LLM evaluations with Opik!

Star
Comet logo
  • Opik LLM Evals
  • Products
    • Opik – LLM Evaluation
    • ML Experiment Management
    • ML Artifacts
    • ML Model Registry
    • ML Model Production Monitoring
  • Docs
    • Opik – LLM Evaluation
    • ML Experiment Management
  • Pricing
  • Customers
  • Learn
    • Blog
    • Deep Learning Weekly
    • LLM Course
  • Company
    • About Us
    • News and Events
      • Events
      • Press Releases
    • Careers
    • Contact Us
    • Leadership
  • Login
Get Demo
Try Comet Free
  1. Home
  2. Blog
  3. Page 3

Comet Blog

  • Academic Research
  • Comet Community Hub
  • Industry
  • Integrations
  • LLMOps
  • Machine Learning
  • Office Hours
  • Partners & Integrations
  • Product
  • Thought Leadership
  • Tutorials
  • Uncategorized
  • From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta

    Vincent Koc

    May 21, 2025
    LLMOps, Product

    At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…

    Read

    From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta
  • Claire Longo

    February 19, 2025
    LLMOps, Machine Learning, Tutorials

    A Simple Recipe for LLM Observability

    So, you’re building an AI application on top of an LLM, and you’re planning on setting it live in production.…

    Read

    A Simple Recipe for LLM Observability
  • Stéphan André

    February 5, 2025
    Comet Community Hub, LLMOps

    LLM Monitoring & Maintenance in Production Applications

    Generative AI has become a transformative force, revolutionizing how businesses engage with users through chatbots, content creation, and personalized recommendations.…

    Read

    LLM Monitoring & Maintenance in Production Applications
  • Andrés Cruz

    January 29, 2025
    LLMOps, Product

    Building Opik: A Scalable Open-Source LLM Observability Platform

    Opik is an open-source platform for evaluating, testing, and monitoring LLM applications, created by Comet. When teams integrate language models…

    Read

    Building Opik: A Scalable Open-Source LLM Observability Platform
  • Abby Morgan

    January 28, 2025
    Comet Community Hub, LLMOps, Machine Learning, Product, Tutorials

    G-Eval for LLM Evaluation

    LLM-as-a-judge evaluators have gained widespread adoption due to their flexibility, scalability, and close alignment with human judgment. They excel at…

    Read

    G-Eval for LLM Evaluation
  • Caroline Borders

    January 27, 2025
    LLMOps, Product

    Comet Product Releases January 2025

    As 2025 picks up steam, we’re thrilled to bring you some exciting product updates from Comet! This month, we’ve added…

    Read

    Comet Product Releases January 2025
  • Paul Iusztin

    |

    Decoding ML

    January 13, 2025
    LLMOps, Tutorials

    Build Multi-Index Advanced RAG Apps

    Welcome to Lesson 12 of 12 in our free course series, LLM Twin: Building Your Production-Ready AI Replica. You’ll learn…

    Read

    Build Multi-Index Advanced RAG Apps
  • Paul Iusztin

    |

    Decoding ML

    January 13, 2025
    LLMOps, Tutorials

    Build a scalable RAG ingestion pipeline using 74.3% less code

    Welcome to Lesson 11 of 12 in our free course series, LLM Twin: Building Your Production-Ready AI Replica. You’ll learn…

    Read

    Build a scalable RAG ingestion pipeline using 74.3% less code
  • Siddharth Mehta

    January 3, 2025
    LLMOps

    LLM Evaluation Metrics Every Developer Should Know

    When you build an app or system on top of an LLM, you need a way to understand the quality…

    Read

    LLM Evaluation Metrics Every Developer Should Know
  • Gourav Bais

    December 19, 2024
    LLMOps

    Intro to LLM Observability: What to Monitor & How to Get Started

    While LLM usage is soaring, productionizing an LLM-powered application or software product presents new and different challenges compared to traditional…

    Read

    Intro to LLM Observability: What to Monitor & How to Get Started
  • Abby Morgan

    December 19, 2024
    Comet Community Hub, LLMOps, Tutorials

    BERTScore For LLM Evaluation

    Introduction BERTScore represents a pivotal shift in LLM evaluation, moving beyond traditional heuristic-based metrics like BLEU and ROUGE to a…

    Read

    BERTScore For LLM Evaluation
←
1 2 3 4 … 45
→

Get started today for free.

Trusted by Thousands of Data Scientists

Create Free Account
Contact Sales
Comet logo
  • LinkedIn
  • X
  • YouTube
  • Facebook

Subscribe to Comet

Thank you for subscribing to Comet’s newsletter!

Products

  • Opik
  • Experiment Management
  • Artifacts
  • Model Registry
  • Model Production Monitoring

Learn

  • Documentation
  • Resources
  • Comet Blog
  • Deep Learning Weekly
  • Heartbeat
  • LLM Course

Company

  • About Us
  • News and Events
  • Careers
  • Contact Us

Pricing

  • Pricing
  • Create a Free Account
  • Contact Sales
Capterra badge
AICPA badge

©2025 Comet ML, Inc. – All Rights Reserved

Terms of Service

Privacy Policy

CCPA Privacy Notice

Cookie Settings

We use cookies to collect statistical usage information about our website and its visitors and ensure we give you the best experience on our website. Please refer to our Privacy Policy to learn more.OkPrivacy policy