Run open source LLM evaluations with Opik!

Star
Comet logo
  • Opik LLM Evals
  • Products
    • Opik – LLM Evaluation
    • ML Experiment Management
    • ML Artifacts
    • ML Model Registry
    • ML Model Production Monitoring
  • Docs
    • Opik – LLM Evaluation
    • ML Experiment Management
  • Pricing
  • Customers
  • Learn
    • Blog
    • Deep Learning Weekly
    • LLM Course
  • Company
    • About Us
    • News and Events
      • Events
      • Press Releases
    • Careers
    • Contact Us
    • Leadership
  • Login
Get Demo
Try Comet Free
  1. Home
  2. Blog

Comet Blog

  • Academic Research
  • Comet Community Hub
  • Industry
  • Integrations
  • LLMOps
  • Machine Learning
  • Office Hours
  • Partners & Integrations
  • Product
  • Thought Leadership
  • Tutorials
  • Uncategorized
  • From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta

    Vincent Koc

    May 21, 2025
    LLMOps, Product

    At Comet, we’re driven by a commitment to advance innovation in AI, particularly in the realm of LLM observability. Our…

    Read

    From Observability to Optimization: Announcing the Opik Agent Optimizer Public Beta
  • Caroline Borders

    June 6, 2025
    LLMOps, Product

    Release Highlights: Discover Opik Agent Optimizer, Guardrails, & New Integrations

    As LLMs power more complex, multi-step agentic systems, the need for precise optimization and control is growing. In case you…

    Read

    Release Highlights: Discover Opik Agent Optimizer, Guardrails, & New Integrations
  • Team Comet

    May 23, 2025
    LLMOps, Product

    Announcing Opik’s Guardrails Beta: Moderate LLM Applications in Real-Time

    We’ve spent the past year building tools that make LLM applications more transparent, measurable, and accountable. Since launching Opik, our…

    Read

    Announcing Opik’s Guardrails Beta: Moderate LLM Applications in Real-Time
  • Caroline Borders

    April 24, 2025
    Integrations, LLMOps, Product

    Major Releases: MCP Server & Google Agent Dev Kit Support

    We’ve just rolled out two major updates in Opik, Comet’s open-source LLM evaluation platform, that make it easier than ever…

    Read

    Major Releases: MCP Server & Google Agent Dev Kit Support
  • Claire Longo

    April 6, 2025
    Machine Learning, Thought Leadership

    How Contributing to Open Source Projects Helped Me Build My Dream Career in AI

    6 years ago, I decided to open-source my Python code for a personal project I was working on, which led…

    Read

    How Contributing to Open Source Projects Helped Me Build My Dream Career in AI
  • Vincent Koc

    March 27, 2025
    Academic Research, Comet Community Hub

    LLM Evaluation Complexities for Non-Latin Languages

    Large language models (LLMs) have revolutionized natural language processing, yet most development and evaluation efforts have historically centered around Latin-script…

    Read

    LLM Evaluation Complexities for Non-Latin Languages
  • Abby Morgan

    March 26, 2025
    Comet Community Hub, LLMOps, Tutorials

    SelfCheckGPT for LLM Evaluation

    Detecting hallucinations in language models is challenging. There are three general approaches: The problem with many LLM-as-a-Judge techniques is that…

    Read

    SelfCheckGPT for LLM Evaluation
  • Kelsey Kinzer

    March 26, 2025
    LLMOps

    LLM Hallucination Detection in App Development

    Even ChatGPT knows it’s not always right. When prompted, “Are large language models (LLMs) always accurate?” ChatGPT says no and…

    Read

    LLM Hallucination Detection in App Development
  • Caroline Borders

    March 10, 2025
    Product

    Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More

    Spring is in the air, and we’re excited to bring you four fresh releases in the Comet platform to make…

    Read

    Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More
  • Leonardo Gonzalez

    March 3, 2025
    LLMOps

    LLM Evaluation Frameworks: Head-to-Head Comparison

    As teams work on complex AI agents and expand what LLM-powered applications can achieve, a variety of LLM evaluation frameworks…

    Read

    LLM Evaluation Frameworks: Head-to-Head Comparison
  • Abby Morgan

    February 24, 2025
    Comet Community Hub, LLMOps, Tutorials

    LLM Juries for Evaluation

    Evaluating the correctness of generated responses is an inherently challenging task. LLM-as-a-Judge evaluators have gained popularity for their ability to…

    Read

    LLM Juries for Evaluation
1 2 3 … 45
→

Get started today for free.

Trusted by Thousands of Data Scientists

Create Free Account
Contact Sales
Comet logo
  • LinkedIn
  • X
  • YouTube
  • Facebook

Subscribe to Comet

Thank you for subscribing to Comet’s newsletter!

Products

  • Opik
  • Experiment Management
  • Artifacts
  • Model Registry
  • Model Production Monitoring

Learn

  • Documentation
  • Resources
  • Comet Blog
  • Deep Learning Weekly
  • Heartbeat
  • LLM Course

Company

  • About Us
  • News and Events
  • Careers
  • Contact Us

Pricing

  • Pricing
  • Create a Free Account
  • Contact Sales
Capterra badge
AICPA badge

©2025 Comet ML, Inc. – All Rights Reserved

Terms of Service

Privacy Policy

CCPA Privacy Notice

Cookie Settings

We use cookies to collect statistical usage information about our website and its visitors and ensure we give you the best experience on our website. Please refer to our Privacy Policy to learn more.OkPrivacy policy