-
G-Eval for LLM Evaluation
LLM-as-a-judge evaluators have gained widespread adoption due to their flexibility, scalability, and close alignment with human judgment. They excel at…
-
I Replaced 1000 Lines of Polling Code with 50 Lines of CDC Magic
Welcome to Lesson 3 of 12 in our free course series, LLM Twin: Building Your Production-Ready AI Replica. You’ll learn…
-
Organize Your Prompt Engineering with CometLLM
Introduction Prompt Engineering is arguably the most critical aspect in harnessing the power of Large Language Models (LLMs) like ChatGPT. Whether…
-
Image Inpainting for SDXL 1.0 Base Model + Refiner
In this article, we’ll compare the results of SDXL 1.0 with its predecessor, Stable Diffusion 2.0. We’ll also take a…
-
Optimized Deep Learning Pipelines
A Deep Dive into TFRecords and Protobufs Learn how to optimize your deep learning pipelines using TFRecords and Google’s…
-
Explainable AI: Visualizing Attention in Transformers
In this article we explore one of the most popular tools for visualizing the core distinguishing feature of transformer architectures:…
-
SAM + Stable Diffusion for Text-to-Image Inpainting
In this article, we’ll leverage the power of SAM, the first foundational model for computer vision, along with Stable Diffusion,…
-
Debugging Image Classifiers With Confusion Matrices
Introduction We often rely on scalar metrics and static plots to describe and evaluate machine learning models, but these methods…
-
Compare Object Detection Models From TorchVision
Introduction Object detection is one of the most popular applications of machine learning for computer vision. A detection model predicts…