OpenAI Evals: Log Datasets & Evaluate LLM Performance with Opik
OpenAI’s Python API is quickly becoming one of the most-downloaded Python packages. With an easy-to-use SDK and access…
OpenAI’s Python API is quickly becoming one of the most-downloaded Python packages. With an easy-to-use SDK and access…
Today, we’re thrilled to introduce Opik – an open-source, end-to-end LLM development platform that provides the observability tools you need…
In the machine learning (ML) and artificial intelligence (AI) domain, managing, tracking, and visualizing model training processes, especially at scale,…
Introduction Prompt Engineering is arguably the most critical aspect in harnessing the power of Large Language Models (LLMs) like ChatGPT. Whether…
Introduction We often rely on scalar metrics and static plots to describe and evaluate machine learning models, but these methods…
In this blog post we will leverage Comet’s Model Production Monitoring tool to monitor one of the most popular types…
Machine learning is experimental in nature. It’s more like research in a lab than it is like building traditional software.…