Skip to main content

Opik by Comet

The Opik platform allows you log, view and evaluate your LLM traces during both development and production. Using the platform and our LLM as a Judge evaluators, you can identify and fix issues in your LLM application.

LLM Evaluation Platform

Overview

Development

During development, you can use the platform to log, view and debug your LLM traces:

  1. Log traces using:

    a. One of our integrations.

    b. The @track decorator for Python, learn more in the Logging Traces guide.

  2. Annotate and label traces through the SDK or the UI.

Evaluation and Testing

Evaluating the output of your LLM calls is critical to ensure that your application is working as expected and can be challenging. Using the Opik platformm, you can:

  1. Use one of our LLM as a Judge evaluators or Heuristic evaluators to score your traces and LLM calls
  2. Store evaluation datasets in the platform and run evaluations
  3. Use our pytest integration to track unit test results and compare results between runs

Getting Started

Comet provides a managed Cloud offering for Opik, simply create an account to get started.

You can also run Opik locally using our local installer. If you are looking for a more production ready deployment, you can also use our Kubernetes deployment option.