Use custom metrics when built-in metrics are not enough (domain-specific scoring, precise safety checks, unique multimodal checks). Start with the core Opik evaluation docs so you know what already exists:
reason on ScoreResult for better dashboards.MultiMetricObjective when you need trade-offs./evaluation/metrics instead of reinventing classic scores. You can compose heuristics with custom logic as shown above.