💡 Product Enhancements
- Ability to upload CSV datasets directly through the user interface
- Add experiment cost tracking to the Experiments table
- Add hinters and helpers for onboarding new users across the platform
- Added “LLM calls count” to the traces table
- Pretty formatting for complex agentic threads
- Preview support for MP3 files in the frontend
🛠 SDKs and API Enhancements
- Good news for JS developers! We’ve released experiments support for the JS SDK (official docs coming very soon)
- New Experiments Bulk API: a new API has been introduced for logging Experiments in bulk.
- Rate Limiting improvements both in the API and the SDK
🔌 Integrations
- Support for OpenAI o3-mini and Groq models added to the Playground
- OpenAI Agents: context awareness implemented and robustness improved. Improve thread handling
- Google ADK: added support for multi-agent integration
- LiteLLM: token and cost tracking added for SDK calls. Integration now compatible with opik.configure(…)
👉 See full commit log on GitHub
Releases: 1.7.27
, 1.7.28
, 1.7.29
, 1.7.30
, 1.7.31
✨ New Features
-
Opik Agent Optimizer: A comprehensive toolkit designed to enhance the performance and efficiency of your Large Language Model (LLM) applications. Read more
-
Opik Guardrails: Guardrails help you protect your application from risks inherent in LLMs. Use them to check the inputs and outputs of your LLM calls, and detect issues like off-topic answers or leaking sensitive information. Read more
💡 Product Enhancements
- New Prompt Selector in Playground — Choose existing prompts from your Prompt Library to streamline your testing workflows.
- Improved “Pretty Format” for Agents — Enhanced readability for complex threads in the UI.
🔌 Integrations
- Vertex AI (Gemini) — Offline and online evaluation support integrated directly into Opik. Also available now in the Playground.
- OpenAI Integration in the JS/TS SDK
- AWS Strands Agents
- Agno Framework
- Google ADK Multi-agent support
🛠 SDKs and API Enhancements
- OpenAI LLM advanced configurations — Support for custom headers and base URLs.
- Span Timing Precision — Time resolution improved to microseconds for accurate monitoring.
- Better Error Messaging — More descriptive errors for SDK validation and runtime failures.
- Stream-based Tracing and Enhanced Streaming support
👉 See full commit log on GitHub
Releases: 1.7.19
, 1.7.20
, 1.7.21
, 1.7.22
, 1.7.23
, 1.7.24
, 1.7.25
, 1.7.26
Opik Dashboard:
Python and JS / TS SDK:
- Added support for streaming in ADK integration
- Add cost tracking for the ADK integration
- Add support for OpenAI
responses.parse
- Reduce the memory and CPU overhead of the Python SDK through various performance optimizations
Deployments:
- Updated port mapping when using
opik.sh
- Fixed persistence when using Docker compose deployments
Release: 1.7.15
, 1.7.16
, 1.7.17
, 1.7.18
Opik Dashboard:
- Updated the experiment page charts to better handle nulls, all metric values are now displayed.
- Added lazy loading for traces and span sidebar to better handle very large traces.
- Added support for trace and span attachments, you can now log pdf, video and audio files to your traces.

- Improved performance of some Experiment endpoints
Python and JS / TS SDK:
- Updated DSPy integration following latest DSPy release
- New Autogen integration based on Opik’s OpenTelemetry endpoints
- Added compression to request payload
Release: 1.7.12
, 1.7.13
, 1.7.14
Opik Dashboard:
- Released Python code metrics for online evaluations for both Opik Cloud and self-hosted deployments. This allows you to define python functions to evaluate your traces in production.

Python and JS / TS SDK:
- Fixed LLM as a judge metrics so they return an error rather than a score of 0.5 if the LLM returns a score that wasn’t in the range 0 to 1.
Deployments:
- Updated Dockerfiles to ensure all containers run as non root users.
Release: 1.7.11
Opik Dashboard:
- Updated the feedback scores UI in the experiment page to make it easier to annotate experiment results.
- Fixed an issue with base64 encoded images in the experiment sidebar.
- Improved the loading speeds of the traces table and traces sidebar for traces that have very large payloads (25MB+).
Python and JS / TS SDK:
- Improved the robustness of LLM as a Judge metrics with better parsing.
- Fix usage tracking for Anthropic models hosted on VertexAI.
- When using LiteLLM, we fallback to using the LiteLLM cost if no model provider or model is specified.
- Added support for
thread_id
in the LangGraph integration.
Releases: 1.7.4
, 1.7.5
, 1.7.6
. 1.7.7
and 1.7.8
.
Opik Dashboard:
- Added search to codeblocks in the input and output fields.
- Added sorting on feedback scores in the traces and spans tables:
- Added sorting on feedback scores in the experiments table.
Python and JS / TS SDK:
- Released a new integration with Google ADK framework.
- Cleanup up usage information by removing it from metadata field if it’s already
part of the
Usage
field. - Added support for
Rouge
metric - Thanks @rohithmsr ! - Updated the LangChain callback
OpikTracer()
to log the data in a structured way rather than as raw text. This is expecially useful when using LangGraph. - Updated the LangChainJS integration with additional examples and small fixes.
- Updated the OpenAI integration to support the Responses API.
- Introduced a new AggregatedMetric metric that can be used to compute aggregations of metrics in experiments.
- Added logging for LLamaIndex streaming methods.
- Added a new
text
property on the Opik.Prompt object.
Releases: 1.6.14
, 1.7.0
, 1.7.1
, 1.7.2
Opik Dashboard:
- Render markdown in experiment output sidebar
- The preference between pretty / JSON and YAML views are now saved
- We now hide image base64 strings in the traces sidebar to make it easier to read
Python and JS / TS SDK:
- Released a new integration with Flowise AI
- LangChain JS integration
- Added support for jinja2 prompts
General
- Introduced a new
.opik.sh
installation script
Opik Dashboard:
- You can now view the number of spans for each trace in the traces table
- Add the option to search spans from the traces sidebar
- Improved performance of the traces table
Python and JS / TS SDK:
- Fixed issue related to log_probs in Geval metric
- Unknown fields are no longer excluded when using the OpenTelemetry integration
Opik Dashboard:
-
We have revamped the traces table, the header row is now sticky at the top of the page when scrolling
-
As part of this revamp, we also made rows clickable to make it easier to open the traces sidebar
-
Added visualizations in the experiment comparison page to help you analyze your experiments
-
You can now filter traces by empty feedback scores in the traces table
-
Added support for Gemini options in the playground
-
Updated the experiment creation code
-
Many performance improvements
Python and JS / TS SDK:
- Add support for Anthropic cost tracking when using the LangChain integration
- Add support for images in google.genai calls
- LangFlow integration has now been merged