When to add a span
Add a span to a dataset when:- The model answered incorrectly.
- The model handled a difficult case well and you want a golden example.
- A user request should become a regression case.
- A tool call, retrieval result, or missing context changed the answer.
- An evaluator failure needs a durable example.
- A Behavior investigation needs representative cases.
- An Improve cycle should preserve the evidence after review.
Dataset compatibility
Adaline can add selected model spans to datasets when the dataset can accept the span variables. A dataset is valid for selected spans when:- It belongs to the same project.
- It is empty, with no rows and no columns, so Adaline can bootstrap columns.
- Or it already contains columns for all variables present in the selected model span.
What gets copied
For model spans, Adaline can copy:- Prompt variable values.
- The model response, when a response column exists or the dataset is being bootstrapped.
- Text values.
- Image and PDF references when represented as URLs, hosted paths, or data payloads.
- Complex values serialized as JSON when needed.
Add spans to a dataset
Find the span
Use Traces, filters, or Deep Search to find representative production evidence.
Open the trace
Inspect the trace and select the model span that contains the useful input and response.
Choose the dataset action
Add the selected span to an existing compatible dataset, or use an empty dataset that Adaline can bootstrap.
Clean up after copying
After adding production spans:- Remove secrets, private identifiers, or unnecessary customer data.
- Add labels that explain why the row matters.
- Add expected output or pass criteria.
- Link the row to a Behavior or incident when helpful.
- Move noisy exploratory rows out of release-gate datasets.
Use rows in the improvement loop
Trace-derived rows make Improve reviews safer:- A production issue appears in Monitor, Traces, or Behaviors.
- Representative spans become dataset rows.
- Evaluators define the expected behavior.
- Improve proposes candidate prompt changes.
- Reviewers approve only when the candidate fixes the issue without regressing the dataset.
Avoid common mistakes
| Mistake | Better approach |
|---|---|
| Adding a full noisy trace when one span matters | Add the representative model span and keep context concise. |
| Adding rows without expected behavior | Add labels, expected output, or evaluators before relying on the row. |
| Mixing every incident in one dataset | Split by behavior, workflow, or release risk. |
| Keeping sensitive production data | Sanitize the row or do not store it. |
| Treating copied rows as automatically correct | Review and curate each row before using it as a gate. |