Playground
An interactive, real-time interface for testing conversational endpoints and converting live conversations directly into tests.
Overview
The Playground is a built-in testing environment that lets you interact with your configured endpoints in real time. Unlike running a full test set, the Playground is designed for exploration, debugging, and rapid test creation from real conversations.
Key Features
Real-Time Testing: Send messages to your endpoint and see responses immediately via WebSocket communication. There is no need to configure a separate testing environment.
Split-View Interface: The Playground uses a split-view layout—your conversation on one side and the response with trace details on the other—so you can see exactly what your application returned and how it got there.
Trace Linking: Every Playground session generates a trace that you can inspect directly. This makes the Playground a powerful debugging tool: send a message, see the response, and immediately drill into the execution trace.
Test Creation from Conversations: Convert any Playground conversation into a formal test with a single click. Rhesis uses an LLM to pre-fill the test fields (expected behavior, category, tags) from the conversation content, reducing manual entry.
Single-Turn and Multi-Turn Support: Create both single-turn tests (one message/response pair) and multi-turn tests (full conversation sequences) from Playground sessions.
How to Use the Playground
- Navigate to your endpoint in the platform
- Open the Playground tab
- Select the endpoint you want to test
- Type a message and press Send
- Review the response and the trace
- Optionally, click "Create Test" to save the conversation as a test
WebSocket Communication
The Playground uses WebSocket connections for real-time communication, enabling low-latency responses from your endpoint. The connection includes an automatic retry mechanism to handle intermittent network issues gracefully.
Best Practices
- Use the Playground before creating a formal test set to understand how your endpoint behaves in practice
- Create tests from Playground conversations to capture real examples of both correct and incorrect responses
- Review the linked execution trace for every Playground session to diagnose unexpected outputs
- Switch between multiple endpoints in the Playground to compare behavior across environments or model versions