Jamie Gillenwater, Author at Comet

Jamie Gillenwater

April 2, 2026

LLMOps

Multimodal LLM Evaluation: A Developer’s Guide to Multimodal Language Models

Production teams processing billions of product listings, such as Shopify, report that multimodal LLMs analyzing product images alongside metadata can…
Read
Jamie Gillenwater

April 2, 2026

LLMOps

AI Agent Evaluation: Building Reliable Systems Beyond Simple Testing

Your customer service agent routes 2,000 queries daily. During testing, it resolved 85 percent of requests correctly. Three weeks after…
Read
Jamie Gillenwater

February 26, 2026

LLMOps

LLM Parameter Optimization: Stop Leaving Agent Performance on the Table

If you search for “LLM parameter optimization,” you’ll find guides on tuning learning rates, batch sizes, and layer configurations. But…
Read
Jamie Gillenwater

February 19, 2026

LLMOps

Prompt Learning: Using Natural Language to Optimize LLM Systems

Your customers expect better and more consistent results than your AI agent can deliver. You manually tweak a prompt, test…
Read
Jamie Gillenwater

January 22, 2026

LLMOps

Chain-of-Thought Prompting: A Guide for LLM Applications and Agents

When Google researchers asked GPT-3 to solve grade-school math problems, the model answered 17.9 percent of the problems correctly. When…
Read
Jamie Gillenwater

January 15, 2026

LLMOps

Prompt Tuning: Parameter-Efficient Optimization for Agentic AI Systems

You’ve built an agentic system that coordinates retrieval, reasoning, and response generation across multiple specialized tasks. Now you need to…
Read
Jamie Gillenwater

January 12, 2026

LLMOps

MIPRO: The Optimizer That Brought Science to Prompt Engineering

You know the routine: Write your first prompt, and then spend hours manually tweaking prompts, testing variations, and documenting what…
Read
Jamie Gillenwater

January 5, 2026

LLMOps

GEPA: Why Reflection-Based Optimization Is Replacing Reinforcement Learning for AI Agents

Your multi-hop reasoning agent fails 55 percent of the time. You spend three days tweaking prompts by adjusting the phrasing,…
Read

Author: Jamie Gillenwater

Multimodal LLM Evaluation: A Developer’s Guide to Multimodal Language Models

AI Agent Evaluation: Building Reliable Systems Beyond Simple Testing

LLM Parameter Optimization: Stop Leaving Agent Performance on the Table

Prompt Learning: Using Natural Language to Optimize LLM Systems

Chain-of-Thought Prompting: A Guide for LLM Applications and Agents

Prompt Tuning: Parameter-Efficient Optimization for Agentic AI Systems

MIPRO: The Optimizer That Brought Science to Prompt Engineering

GEPA: Why Reflection-Based Optimization Is Replacing Reinforcement Learning for AI Agents

Get started today for free.