Pricing

Flexible Plans for All Teams

Open Source

Download, install, & run Opik your way

Free

GitHub

Full AI observability & agent testing feature set
True OSS: same codebase as the hosted versions

Includes:

Agent tracing & analysis
Test Suites & assertions
Agent Playground

Free Cloud

Perfect for individuals

Free plan

Get Started

Up to 10 team members
25k spans per month
60-day data retention

Includes:

Agent tracing & analysis
Test Suites & assertions
Agent Playground
Ollie coding harness trial

Pro Cloud

Popular

Expanded usage for teams

$19

Per month

Start Free Trial

Up to 50 team members
100k spans per month
60-day data retention

Includes everything in the Free plan plus:

Customizable monthly span limits
Customizable data retention periods

Enterprise

Security, compliance & flexible deployments

Custom

Unlimited team members
Custom usage plans

Includes everything in the Pro plan plus:

Flexible deployments
Service accounts and view-only users
Single sign-on
Dedicated support and SLAs
SOC 2, ISO 27001, ISO 9001, HIPAA, and GDPR compliance

Compare Plans

Open Source

Free Cloud

Pro Cloud

Enterprise

Observability

AI Application Tracing Trace every step of your application’s execution path, from context retrieval to model responses to tool calls
Agent Execution Graphs Visualize your agent’s execution flow
Sessions Track the full session for multi-turn conversations, complex workflows, and conversational agents
Token & Cost Tracking
Multi-Media Logging Log Images, videos, audio files, and more with your traces for future viewing in the UI
Error Surfacing View errors at the project level and use quick-access shortcuts to drill into debugging and error tracking views
User Feedback Tracking Log user feedback with flexible scores and freeform reason explanations using either the UI or the SDK
Integrations with 40+ AI Frameworks, Model Providers, & AI Gateways Opik integrates with dozens of common AI Frameworks and model providers, including LangChain, OpenAI, Google ADK, LangGraph, CrewAI, and many more
OpenTelemetry Integration Opik provides native support for OpenTelemetry (OTel), allowing you to track AI applications in Ruby, Java, and other languages
AI-Powered Debugging with OpikAssist Ask natural language questions to help root-cause issues, identify performance bottlenecks, and get debugging recommendations for your traces	—

Expert Annotation UI

Dedicated Annotation UI Opik provides a streamlined, distraction-free interface designed for efficient review by subject matter experts
Custom Feedback Schemas Create custom labels to systematically collect and analyze structured feedback
Annotation Queues Enable subject matter experts to review and annotate agent outputs with easy queues, invitations, and multi-user annotation
Annotation for Traces or Entire Conversations Annotate individual LLM responses or the entire conversation holistically, with purpose-built interfaces for both trace-level and thread-level feedback

Test Suites (Automated Evaluation)

T est Suites for Unit & Regression Testing Create simple pass/fail tests for your agent — Opik manages datasets and LLM-as-a-judge evaluation metrics for you under the hood
Test Items Input data for your agent (e.g., questions with context, user scenarios)
Assertions Write pass/fail assertions to test individual scenarios.
Item-Level Assertions Systematically test your AI application over an entire dataset, using the metrics that matter most to your use case. Use Experiments for benchmarking, A/B testing, and regression testing
Suite-Level Assertions Write global pass/fail assertions that every test case must pass.
Execution Policies Define how many times a test should run and how many runs must pass to qualify as successful.

Evaluation with Datasets & Metrics

Agent Evaluation Evaluate the performance of complex AI applications and agent systems – both individual steps and overall task completion
Conversation Evaluation Evaluate full multi-turn conversation as well as individual responses, with built-in and custom metrics
Evaluation Datasets Create and maintain collections of example inputs and expected responses using production traces or synthetic examples for systematic testing and evaluation at scale
Experiments Systematically test your AI application over an entire dataset, using the metrics that matter most to your use case. Use Experiments for benchmarking, A/B testing, and regression testing
30+ Built-in Evaluation Metrics Easy to use, configurable LLM-as-judge and heuristic metrics
Custom Metrics Create custom LLM-as-Judge, criteria-based, and python code based metrics or use metrics from external libraries
Automated Dataset Expansion Automatically expand your dataset for more robust evaluation. Use AI to generate additional synthetic examples to add to your existing dataset
Experiments Dashboard View and comment on experiment results, compare prompts and configurations, and select your best performing version for deployment
Dataset & Evaluation Integrations Ragas, Hugging Face Datasets, Gretel

Ollie Assistant & Coding Harness

Ollie Chat Interface Ollie is much more powerful than a simple chatbot — ask it to analyze traces, propose fixes, and even edit your codebase using the capabilities listed below.
Read & Analyze Traces Ollie reads full span trees including inputs, outputs, latencies, token counts, and feedback scores. It can drill into individual spans, compare traces side by side, and search across your project for patterns.
Workspace Search Traces, threads, datasets, experiments, and prompts are all queryable. Ollie can aggregate data, find outliers, and surface trends you’d otherwise need to query manually.
Test Suite Management Ollie can add traces as test cases to test suites, define assertions, trigger evaluation runs, and summarize pass/fail results.
Opik Connect Opik Connect is a powerful coding harness optimized for developing and testing your agents. Configure it to let Ollie read your agent codebase and write fixes directly to it.	—	Free trial then purchase tokens	Free trial then purchase tokens	Custom
Direct Code Editing When you connect your repository with opik connect, Ollie gains secure, read-only access to your source files. It can propose edits that you review and approve before anything changes on disk.	—	Free trial then purchase tokens	Free trial then purchase tokens	Custom
Run Your Agent With opik connect active, Ollie can rerun your agent using inputs from a failing trace to verify a fix in real time. New traces stream back into Opik automatically.	—	Free trial then purchase tokens	Free trial then purchase tokens	Custom

Agent Sandbox

Agent Playground The Agent Playground lets you run agents on your local machine while connected to Opik. Every agent execution is fully traced — you get LLM calls, latencies, token usage, and the complete execution graph, all visible in the Opik UI.
Agent Configuration Manage your prompts, model settings, and tool definitions outside your codebase. Version them, update them without redeploying, and keep a full history of every change.

Agent Optimization

Prompt Optimization Automate prompt engineering and improve your AI application’s performance with Opik’s industry-leading Agent Optimizer toolkit
Tool Optimization Optimize prompts that use external tools and the Model Context Protocol (MCP)
Native Support for 8+ Optimization Algorithms Opik has native support for Evolutionary, Few-Shot Bayesian, MetaPrompt, Hierarchical Reflective Optimizer, MIPRO, and GEPA, with more to come
Optimization Dashboard View your optimization results and performance improvements in Opik’s UI

Prompt Development

Prompt Library Collaborate on prompt development with both UI-based prompt management and seamless integration with prompts saved in code or files
Prompt Versioning Track and version every change made to a prompt, including edits, timestamps, and authors
Prompt Evaluation Test and compare prompts via either the UI or the SDK by running structured experiments over a dataset
Prompt Playground Test new prompts quickly in Opik’s Prompt playground
Prompt Building Build prompts in the UI using composable user, assistant, and system components. Add variables to easily run evaluations and side-by-side comparisons over a dataset

Reliability in Production

Production-Scale Monitoring Opik has been designed from the ground up to support high volumes of traces making it the ideal tool for monitoring your production LLM applications
Online Evaluation Score your production traces and identify any issues with your production LLM application
AI Guardrails Protect your application by detecting and blocking content from the inputs and outputs of your LLM calls. Opik supports PII, Topic, and Custom guardrails. Available for self-hosted deployment	—	—	—
Alerts Configure automated webhook notifications for important events such as trace errors, new feedback scores, or prompt changes

Platform (API, Collaboration, & Data Access)

Public API Opik’s REST API and complete Python client can be used with both the Open-Source platform and Opik Cloud
API Rate Limits	Unlimited	Unlimited	Unlimited	Unlimited
SDKs (Python, TypeScript)
MCP Server Integrate your AI-powered IDE with Opik to use natural language to manage prompts, query traces, access metrics, and more
Query and Export Opik Data Opik’s SDKs and API enable data export and support advanced queries, filtering, and batch operations
UI Data Export Export traces, datasets, experiment data, and annotations to CSV or JSON through the Opik UI
In-Platform Collaboration Collaborate with your team directly in the platform with deep-links to share-able views and commenting on traces, experiments, and prompts
Flexible Deployments Choose from multiple deployment options, including cloud-based, on-premises, and fully managed solutions to fit your infrastructure needs	—	—	—
Optional Add-On: MLOps Platform: Experiment Management, Model Registry, Dataset Management	—

Security & Compliance

Data Region	—	US	US	Custom
RBAC (Project and Organization Level)	—	—	—
View-Only Users	—	—	—
Enterprise SSO (OAuth 2.0, SAML, and LDAP protocols)	—	—	—
SSO Enforcement	—	—	—
SOC 2, ISO 27001, ISO 9001, HIPAA, and GDPR compliance	—	—	—

Support

Community (Slack; GitHub)
Self-Service (Ask AI; Opik MCP Serve)
Email Support	—	—
Dedicated Support Team & SLAs	—	—	—
Private Slack Channel	—	—	—
Service Accounts	—	—	—
Response Time SLA	—	—	—	2 hours

Monthly Usage

Spans Included A span represents a single tracked operation within your LLM pipeline, such as a model request, a function call or an agent action	Unlimited	25k	100k	Unlimited
Additional Spans	Unlimited	—	$5/100k spans	Unlimited
Span Data Retention	Unlimited	60 days	60 days	Custom
Additional Span Retention Increase span data retention from 60 days to 400 days	Unlimited	—	$29/100k spans	Custom

Questions & Answers

What is the relationship between Comet, Opik, and MLOps / Experiment Management?

Comet is our company name, and we offer two flagship product families: Opik and MLOps. Features, use cases, and pricing models differ between the two products, but they run on the same underlying platform and each comes with access to the free version of the other.

Our MLOps platform is designed for teams building and training machine learning models, with tools for experiment tracking, dataset management, model versioning, and model production monitoring.

Opik is the name of our GenAI observability and evaluation platform, and it’s available in a free open-source version along with the cloud-hosted options you see above. If you are building an application or agent that makes calls to an LLM, use Opik to log, evaluate, iterate, and improve your application’s performance.

Is there a free plan for academics?

Yes, Comet offers a free Pro plan for academic users. Researchers, students, and educators can access the full features of our Pro plan at no cost. To apply for the academic Pro plan, please sign up here and follow the instructions on the Academics page to verify your academic status.

What is the best way to get started with Opik?

The full Opik featureset is free to use for as long as you like, and it’s easy to get started. You can download and self-host the open-source version, or let us handle the infrastructure with the free cloud version. For most individuals and teams in the application testing and development phases, these free plans come with everything you need. You might want to step up to a paid plan as your application or agent usage scales up in production, and/or if you need user management and compliance-related features, but the free versions really do come with everything you need to get up and running.

If you already have an application or agent that makes LLM calls, just follow our Quickstart Guide to integrate Opik with whichever model provider or agent framework you’re already using. And if you’re not ready to log traces yet, we recommend trying out the prompt playground to experiment with user prompts and system prompts and compare models side by side, with no setup steps required.

How can I self-host Opik?

Self-hosting the open-source version of Opik is easy! Just download the code from our GitHub repo and follow the README to get started.

What are the differences between Opik Open Source, Opik Free, Opik Pro, and Opik Enterprise?

The main differences are in usage limits and data retention. All plans include unlimited team members and the full suite of LLM observability and evaluation features. Enterprise plans offer custom hosting and deployment options, advanced identity and authentication management, personalized support plans, increased regulatory compliance, and more.

What is a “span”?

A span is a unit of evaluation or tracing data in Opik, representing a structured input-output pair. This could be an LLM call, a tool call, or any tracked function. A single span can contain many tokens.

Are you an academic?

Get free access to paid tiers

Experiment tracking Systematically record, compare, and analyze your machine learning training runs to accelerate development and improve model performance.
Python visualisations Create tailored visualizations using Python to customize your Comet dashboards, enhancing data insights for your specific needs.
Model registry Model Registry centralizes and organizes machine learning models, enabling versioning, tracking, and seamless deployment across your team’s workflow.
Dataset management and versioning Track and version training datasets, linking them to experiments for complete model lineage and reproducibility.
Hyperparameter search Optimize model performance with automated hyperparameter tuning, tracking all results within the Comet platform.
Monthly limits	Fair usage policy	1500 training hours included $1/training hours	Unlimited
Data usage	100GB	500GB included $3/100GB/month	Unlimited
Team members	1	Up to 10 users	Unlimited

Tracing Monitor and analyze the performance of your language models during development and in production, capturing inputs, outputs, and metadata for each inference.
Datasets Manage and version collections of prompts and examples used for evaluating LLM application performance, ensuring consistent and reproducible assessments.
LLM-as-a-judge metrics
Production monitoring Track your LLM applications in production with our production ready LLM Evaluation platform.
Team members	Unlimited	Unlimited	Unlimited

Flexible Plans for All Teams

Open Source

Free Cloud

Pro Cloud

Enterprise

Compare Plans

Observability

Expert Annotation UI

Test Suites (Automated Evaluation)

Evaluation with Datasets & Metrics

Ollie Assistant & Coding Harness

Agent Sandbox

Agent Optimization

Prompt Development

Reliability in Production

Platform (API, Collaboration, & Data Access)

Security & Compliance

Support

Monthly Usage

Questions & Answers

Free

Pro

Enterprise

Full platform self-hosted

Compare Plans

Experiment Management

LLM Evaluation

Model Production Monitoring

Platform

Questions & Answers

Are you an academic?

Data drift detection Automatically identify data drift for your Machine Learning models in production, helping maintain model performance over time.	—	—
Feature distribution analysis Visualize and monitor input feature distributions to detect shifts and anomalies in your data.	—	—
Custom metrics Create and track tailored performance metrics using SQL queries on your logged data.	—	—
Alerts Set up custom notifications for critical events via Slack, email, and other channels.	—	—

Flexible deployments Choose from multiple deployment options, including cloud-based, on-premises, and fully managed solutions to fit your infrastructure needs.	—	—
Single sign-on Streamline authentication with enterprise-grade SSO, supporting OAuth 2.0, SAML, and LDAP protocols.	—	—
Service accounts Create dedicated machine-to-machine accounts for secure, automated data logging and API access.	—	—
View-only users Grant read-only access to team members for data exploration without the ability to modify or log new data.	—	—
Support Access tiered support options including email, Slack, phone, and dedicated team assistance based on your plan.	Community Slack	Email	Dedicated team