
Chart groups
Monitor organizes charts by the question they answer:| Group | Questions it helps answer |
|---|---|
| Traffic & volume | Are logs arriving? Did volume spike? Did traces become more complex? |
| Performance & latency | Which requests are slow? Is tail latency moving? |
| Quality | Are evaluator scores or pass rates changing on live traffic? |
| Cost & spend | Are token usage, total cost, or per-request cost increasing? |
| Model breakdown | Which model is driving cost, latency, token usage, or efficiency? |
| Environment breakdown | Is the issue isolated to production, staging, or another environment? |
| Performance breakdown | Which prompts or workflows are slowest? |
| Tool usage | Which tools or functions are being called, and how often? |

Read percentiles and averages
For latency, cost, tokens, spans per trace, and similar metrics, Monitor can show average and percentile values.| Value | Use it when |
|---|---|
| Avg | You want the overall direction of the system. |
| P50 | You want the typical request. |
| P95 | You care about the slower or more expensive edge that a meaningful minority of users sees. |
| P99 | You are investigating rare but high-impact outliers. |

Compare models, environments, and tools



Drill into evidence
Use chart actions to move from trend to trace:- Open the chart for the metric that moved.
- Hover data points to confirm the exact bucket and value.
- Use View traces when available, or open Traces with the same time range.
- Filter by the relevant prompt, model, environment, tool, evaluator, status, cost, latency, or token signal.
- Inspect representative traces and spans before changing the system.
Common reads
| Pattern | Likely next step |
|---|---|
| Eval score drops while latency and cost are stable | Inspect evaluated spans, then update evaluators, datasets, or Improve context. |
| Cost rises with input tokens | Review prompt length, retrieval payloads, conversation context, or tool output size. |
| P95 latency rises for one model | Compare model performance and check provider/runtime behavior before editing prompts. |
| Tool call counts shift | Inspect tool spans and Behavior patterns before assuming the final prompt is wrong. |
| A chart points to a single prompt | Open the trace evidence and decide whether to build coverage or run Improve. |
Analyze log traces
Open the traces behind a chart movement.
Inspect a trace
Read tree view, waterfall view, and span details.
Setup continuous evaluations
Add quality scores to production traffic.
Use logs to improve prompts
Turn a chart signal into a reviewed improvement loop.