Test Runs & Results
Test runs track the execution of tests against endpoints. Each run produces test results containing the endpoint’s response, evaluation metrics, and review status.
TestRun
A TestRun represents a batch execution of tests. When you execute a test set against an endpoint, a test run is created to track the execution.
Properties
| Property | Type | Description |
|---|---|---|
id | str | Unique identifier |
name | str | Display name |
test_configuration_id | str | Associated test configuration |
status_id | str | Current execution status |
user_id | str | User who initiated the run |
organization_id | str | Organization ID |
owner_id | str | Owner of the test run |
assignee_id | str | Assigned reviewer |
attributes | dict | Custom attributes |
Fetching Test Runs
Getting Test Results
Retrieve all results for a test run:
TestResult
A TestResult contains the output and evaluation for a single test execution.
Properties
| Property | Type | Description |
|---|---|---|
id | str | Unique identifier |
test_run_id | str | Parent test run |
test_id | str | Executed test |
prompt_id | str | Test prompt |
status_id | str | Result status |
status | Status | Status object with name/description |
test_output | dict | Endpoint response data |
test_metrics | dict | Evaluation metric scores |
test_reviews | dict | Human review data |
test_configuration_id | str | Test configuration used |
Fetching Test Results
Working with Results
TestConfiguration
A TestConfiguration defines the settings for test execution, linking test sets to endpoints with specific parameters.
Properties
| Property | Type | Description |
|---|---|---|
id | str | Unique identifier |
endpoint_id | str | Target endpoint (required) |
test_set_id | str | Test set to execute |
category_id | str | Filter by category |
topic_id | str | Filter by topic |
prompt_id | str | Specific prompt |
use_case_id | str | Associated use case |
status_id | str | Configuration status |
attributes | dict | Custom settings |