View
Access and analyze your evaluation reports to understand prompt performance.
After completing evaluations, view detailed reports to see how your prompts performed. Access results by clicking the See results button from your evaluation run.
Inspect a Run
Once the evaluation is complete, you can view the completed set of evaluations and even inspect them.
To inspect a run,
Click on the Evaluation focus mode
- The grid will expand to show all the results for that evaluation.
- The grid will show you which rows in the dataset failed and passed.
Click on the result to start inspecting
- A side window opens up. This will allow you to filter, view completions, variables, and more.
Opening a selected row in the playground
Let’s say you have inspected the entire row and want to open a particular row in the playground. In order to do that,
Click on the particular row
Click on “Open in playground”. This will open the prompt in the Playground
Filtering the Evaluation
You can also filter the evaluation, and it is easy.
Narrow down the results
Use the Filter dropdown to narrow results by:
- Status (passed/failed)
- Reason text
- Response content
- Variables used
- Tokens, Cost, or Latency metrics
Type search query
Type in the search field to find specific responses or patterns across your evaluation data.