The name of the agent configuration used.
The ID of the benchmark run.
Number of scenarios that completed successfully.
Number of scenarios that failed.
Number of scenarios that timed out.
Detailed outcomes for each scenario in this benchmark run.
Optionalaverage_Average score across all completed scenarios (0.0 to 1.0).
Optionalduration_Total duration of the benchmark run in milliseconds.
Optionalmodel_The model name used by the agent.
Outcome data for a single benchmark run within a benchmark job, representing results for one agent configuration.