Skip to Content
DocsArchitectOverview

Rhesis Architect

Architect (also known as Telemachus) is a conversational agent that designs, builds, and runs test suites for your AI endpoints. Describe what you want to test — it explores your endpoint, proposes a plan, waits for your approval, then creates everything on the platform.

Rhesis AI Architect chat

What you can do

  • Explore an endpoint to understand its domain, capabilities, and refusal behavior
  • Design a complete test suite with behaviors, metrics, and test sets — reusing what already exists on the platform
  • Generate AI-written tests or import your own verbatim
  • Ground tests in your docs by telling Architect to use a specific knowledge source
  • Execute tests against any endpoint and get a structured pass/fail summary
  • Compare two test runs to spot regressions and improvements
  • Analyze any existing test run with behavior and metric breakdowns
  • Do direct operations — update a metric, link a behavior, list test sets — without going through the full planning flow

Architect refers to all entities by name. You never need to supply an ID.

Prerequisites

  • At least one endpoint configured in your organization
  • An active Rhesis account with organization access

Get started

Open Testing → Architect in the sidebar. Type what you want to test, or pick one of the suggested prompts on the welcome screen.

"Test my travel chatbot — focus on safety and accuracy."

Architect responds in real time. You’ll see the current phase, streaming tool activity, and a plan panel that tracks progress as work happens.

Pages in this section

PageWhat it covers
WorkflowThe five phases and what you see in each one
Endpoint ExplorationQuick vs Comprehensive modes and what each reveals
Planning Test SuitesPlan structure, reuse logic, knowledge sources
Running and AnalyzingTest execution, result analysis, run comparison
Chat FeaturesFile attachments, auto-approve, plan panel, mode chip
ScenariosRequest cookbook for common workflows

For developers

The Architect system spans four layers, each documented in the contributing section: