Tests
Individual test cases that evaluate specific aspects of your AI system’s performance.
Understanding Tests
Tests are individual test cases that check specific inputs and expected outputs for your AI system. Each test includes an input prompt, expected response criteria, assigned metrics for evaluation, and execution history.
Test Types
Single-Turn Tests
Single-turn tests evaluate individual interactions:
- Single prompt and response
- Immediate evaluation
- Quick validation of specific behaviors
Multi-Turn Tests
Multi-turn tests simulate conversations:
- Multiple turns of interaction
- Conversational context maintained
- Evaluation of complete workflows
- Goal achievement tracking
Creating Tests
Manual Creation
Create tests manually for specific scenarios:
- Navigate to the Tests page
- Click “Create Test”
- Select test type (single-turn or multi-turn)
- Enter test details:
- Name: Descriptive test name
- Input: Initial prompt or conversation starter
- Expected Behavior: What should happen
- Metrics: Select evaluation metrics
- Save the test
Automated Generation
Generate tests automatically based on your requirements:
- Go to the Generation page
- Select generation method:
- From behaviors
- From knowledge sources
- From scenarios
- Configure generation parameters
- Review and refine generated tests
- Add to test sets
See Generation for detailed generation options.
Running Tests
Execute individual tests:
- Open a test from the Tests page
- Click “Run Test”
- Select the endpoint to test against
- Monitor execution progress
- Review results
For batch execution, add tests to Test Sets.
Viewing Test Results
After execution, review:
- Actual response from your AI system
- Metric scores and evaluation details
- Pass/fail status
- Execution time and metadata
- Historical trends
Managing Tests
Editing Tests
Modify existing tests:
- Open the test
- Click “Edit”
- Update fields as needed
- Save changes
Organizing Tests
Keep tests organized:
- Add tests to test sets
- Tag tests by feature or category
- Link tests to knowledge sources
- Associate with behaviors
Deleting Tests
Remove obsolete tests:
- Select test(s) to delete
- Click delete action
- Confirm deletion
Note: Tests in test sets will be removed from those sets.