Opik Dashboard:
- Added a new
Feedback modalin the UI so you can easily provide feedback on any parts of the platform.
SDK:
- Released new evaluation metric: GEval - This LLM as a Judge metric is task agnostic and can be used to evaluate any LLM call based on your own custom evaluation criteria.
- Allow users to specify the path to the Opik configuration file using the
OPIK_CONFIG_PATHenvironment variable, read more about it in the Python SDK Configuration guide. - You can now configure the
project_nameas part of theevaluatemethod so that traces are logged to a specific project instead of the default one. - Added a new
Opik.search_tracesmethod to search for traces, this includes support for a search string to return only specific traces. - Enforce structured outputs for LLM as a Judge metrics so that they are more reliable (they will no longer fail when decoding the LLM response).