Prerequisites for Running
Before you can execute evaluations, ensure you have:- At least one evaluator: Add any evaluator (LLM-as-a-Judge, Cost, Latency, etc.) from the available options
- Connected dataset: Link a dataset that contains your test cases and variables
- Matching columns: Dataset columns must match your prompt variable names exactly
- Minimum 1 row: The Dataset needs at least one row of data to evaluate against
Running Your Evaluation
Once your evaluator is ready, click the green Evaluate button to start your evaluation run.

Background Processing
Evaluations run in the background automatically. You can safely leave the page and return later to check results for larger datasets.