Categorical Scoring
Back to GlossaryTesting
A metric scoring type that classifies responses into predefined categories such as excellent, good, fair, or poor.
Also known as: categorical score
Overview
Categorical scoring classifies responses into predefined categories, making evaluation results easy to interpret and act upon.
Common Category Sets
Quality Levels:
Safety Classifications:
Accuracy Tiers:
Using Categories
Categories should be clear, mutually exclusive, and cover all possible outcomes.
Benefits
Categorical scoring provides interpretability through clear, meaningful classifications that anyone can understand It's action-oriented, making it easy to identify what needs fixing. Non-technical stakeholders can grasp categorical results more easily than numeric scores. The categories also enable natural segmentation for grouping and analyzing results.
Best Practices
- Mutually exclusive: Each response fits exactly one category
- Exhaustive: Cover all possible response types
- Clear definitions: Document what each category means
- Reasonable count: 3-5 categories usually optimal