Playbooks

Production AI Failure Modes

Every production AI system is unique in detail: different stacks, different corpora, different traffic dsitributions, different cost constraints.

The failure modes it experiences aren't.

A small set of patterns recurs across teams, products, and verticals, each one with the same symptom, the same mechanism underneath, the same range of fixes. This catalog gives them names — written by the team behind the world's best embedding and reranker models.

Looking for atomic definitions instead? See /concepts/ — the vocabulary every pattern is built from.

The catalog 10 patterns

About the catalog

Each entry names a class of production failure we've seen often enough to give it a name. The point is shared vocabulary: "we've got eval drift," "that's threshold-by-feel," "single-LLM overspend" — the kind of thing a senior engineer should be able to say in a meeting and have the team know exactly what's meant. If a pattern you keep hitting isn't in the catalog yet, tell us .

The best AI teams build with ZeroEntropy models

Book Demo View docs