Core Concepts
Rhesis evaluates whether your AI application meets the quality bar you care about. This page has two parts: the workflow (what you do and in what order) and the platform structure (what each object is and how they connect).
The workflow
A normal workflow in Rhesis looks like this:
- Connect your endpoint — link Rhesis to the system under test. See Connect your application.
- Define behaviors — decide what good looks like for your application.
- Set up metrics — assign judges to each behavior so responses can be scored automatically.
- Create tests — write, import, or generate inputs; tag each test with a behavior.
- Run a test set — execute a batch of tests against your endpoint.
- Review results — inspect individual failures, then track trends across runs.
New to Rhesis? The Getting Started guide walks through environment setup and your first run.
Platform structure
Setup
Configure before you run
What to test
Your application
System under test
Endpoint↗
How Rhesis reaches your app. Sends input, receives output for evaluation.
Learn more →Solid flow: test set → endpoint → test run. Metrics score each test result after the response returns.
Ready to start? Create a project, connect an endpoint, and generate tests to run your first evaluation.