Statistical monitoring, baseline comparisons, and clear anomaly signals—so silent degradation in prompts, tools, or upstream data never becomes a silent outage.
Track KPIs and custom metrics continuously with distribution-aware methods—not brittle static thresholds alone.
Compare traffic to historical or rolling baselines per workflow and agent. Know whether a spike is noise or a real departure from last week.
Separate drift from one-off spikes with composite rules and severity. Route the right signal to the right team.
Pick the statistic that fits each metric—mean shift, tail risk, or relative change—and document it for auditors and engineers alike.

Model simple spikes, sustained shifts, and multi-signal conditions without bespoke monitoring jobs for every agent.

Drift and anomalies only help if the right people see them. Route policy-sensitive signals to compliance channels and engineering to Slack.
