Use this file to discover all available pages before exploring further.
LLM-as-a-Judge is the most versatile evaluator in Adaline. It uses an LLM to assess your prompt outputs against a custom rubric, excelling at qualitative assessment where nuanced judgment matters more than simple metrics.
Add the LLM-as-a-Judge evaluator from the evaluator menu.
2
Link a dataset
Give a name to the evaluator and link a dataset containing your test cases.
3
Define your rubric
Write the rubric that defines your evaluation criteria. A well-crafted rubric is the key to high-quality evaluations — it should be specific, actionable, and aligned with your success metrics.
4
Run the evaluation
Click Evaluate to execute the evaluation and see the results.
Evaluate this customer support response using the following criteria:Scoring Scale (1-4):4 - Excellent: Completely resolves the issue, professional tone, anticipates follow-up needs3 - Good: Addresses the main concern clearly and professionally2 - Fair: Partially helpful but missing key information or context1 - Poor: Fails to address the issue or uses inappropriate toneEvaluation Factors:- Problem resolution completeness- Professional communication standards- Information accuracy- User experience qualityProvide a score and brief justification for your assessment.
Rate this content piece on effectiveness for our target audience (1-5):5 - Outstanding: Highly engaging, actionable insights, clear value proposition4 - Strong: Good engagement with solid practical value3 - Adequate: Informative but limited engagement or actionability2 - Weak: Basic information with minimal practical value1 - Poor: Lacks clarity, value, or relevance to target audienceConsider these dimensions:- Audience alignment and relevance- Practical value and actionability- Engagement potential- Brand positioning effectiveness
Assess this feature documentation quality (1-4):4 - Comprehensive: Clear explanation, complete coverage, excellent user guidance3 - Good: Well-explained with adequate detail and guidance2 - Acceptable: Basic explanation but missing important details or clarity1 - Inadequate: Confusing, incomplete, or lacks necessary user guidanceEvaluation Areas:- Technical accuracy and completeness- User comprehension and clarity- Implementation guidance quality- Overall user experience
Evaluate brand voice alignment (1-3 scale):3 - Excellent Alignment: Perfect adherence to brand guidelines, authentic voice2 - Good Alignment: Generally consistent with minor deviations1 - Poor Alignment: Inconsistent with established brand voiceAssessment Criteria:- Tone consistency with brand guidelines- Language and terminology alignment- Audience appropriateness- Brand personality expression