Compare Evaluation Runs
GET/api/v1/evaluations/runs/:run_id/compare/:baseline_run_id
Compares two evaluation runs, showing run-level metric deltas and per-query metric breakdowns. Both runs must belong to the same evaluation and have a completed or locked status.
Request
Responses
- 200
- 400
- 404
OK
Bad Request
Not Found