Glossary - Adaline

Adaline Gateway

A local SDK that provides a unified interface for calling 200+ LLMs with built-in batching, retries, caching, callbacks, and OpenTelemetry support. Also referred to as the “Super SDK” for its cross-provider compatibility.

Adaline Proxy

The Adaline log proxy that intercepts LLM API calls and sends them to Monitor without modifying application code.

Auto Generated Evaluator

A production-derived evaluator created from logs, sessions, Behaviors, feedback, or representative examples. Auto Generated Evaluators help Improve cycles score candidates against issues found in real traffic.

Behavior

A recurring pattern detected from project evidence, such as a user intent, assistant response pattern, tool behavior, issue, drift signal, healthy workflow, or agent trajectory pattern.

Continuous Evaluation

An automated evaluation process that runs on live data in Monitor. Samples incoming requests and scores them using configured evaluators to track quality over time.

Dataset

A collection of test cases stored in Adaline used for evaluations. Each dataset consists of rows and columns that can map to prompt variables, expected outputs, labels, evaluator inputs, or metadata.

Deep Search

A semantic search workflow for finding relevant logs by meaning rather than exact tags, filters, or keywords. Deep Search is used when you know the behavior you want to investigate but not the exact trace metadata.

Deployment

A versioned snapshot of a prompt configuration that has been published to a specific environment. Deployments are accessible via API or SDK and can be rolled back to previous versions.

Deployment Environment

A named runtime lane, such as staging or production, that points an application to a deployed prompt snapshot. Environments let teams release, compare, and roll back prompt versions safely.

Dynamic Column

A dataset column that fetches its value from an external API or another prompt at runtime, rather than storing static values. Enables live data in evaluations.

Evaluation

The process of running a prompt against a dataset and scoring the responses using one or more evaluators. Evaluations produce reports with pass/fail rates and detailed metrics.

Evaluator

A configured metric or judge used to assess prompt performance during evaluations. Types include LLM-as-a-Judge, JavaScript, Cost, Latency, Text Matcher, and more.

Improve Cycle

A reviewed workflow that turns production evidence into a proposed prompt change. An Improve cycle can use Behaviors, logs, sessions, evaluators, datasets, and prompt optimization before pausing for human review.

LLM-as-a-Judge

An evaluation pattern where an LLM scores the output of another LLM based on configurable criteria. Enables subjective quality assessment at scale.

Monitor

The Adaline surface for analyzing production logs, traces, spans, charts, cost, latency, token usage, feedback, and continuous evaluation signals.

Phase

A segment inside a session or trajectory, such as planning, retrieval, editing, testing, recovery, or final answer. Phases help reviewers understand where an agent journey changed direction or failed.

Playground

The interactive testing environment in Adaline’s prompt editor. Allows running prompts with different inputs and comparing outputs side-by-side.

Project

A container in Adaline that holds prompts, datasets, evaluations, logs, Behaviors, Improve cycles, and deployments. A project can be considered equivalent to an AI agent, application, or workflow.

Prompt

A configured instruction template in Adaline consisting of messages, model settings, variables, and optional tools. Prompts are versioned and can be deployed to environments.

Prompt Candidate

A proposed prompt snapshot generated during an Improve cycle. Reviewers compare the candidate against the current prompt using diffs, evaluations, examples, and runtime tradeoffs before approving or rejecting it.

Prompt Chaining

The technique of connecting multiple prompts where the output of one becomes the input of another. Enables complex workflows with sequential LLM calls.

Prompt Version

A saved snapshot of a prompt’s configuration at a specific point in time. Versions are auto-incremented and can be deployed or compared in evaluations.

Provider

An LLM service provider (e.g., OpenAI, Anthropic, Google, Azure). Each provider offers different models with varying capabilities and pricing.

Regression Coverage

Dataset rows, evaluators, and review evidence that preserve known failures or important healthy behavior so future prompt changes can be tested before release.

Response Format

A model configuration that constrains output structure (e.g., JSON mode, JSON schema). Ensures consistent, parseable responses from the LLM.

Session

A summarized journey grouped by a stable session ID. Sessions help Adaline understand multi-turn conversations, multi-step workflows, and agent tasks that cannot be judged from one trace alone.

Span

A single operation within a trace representing one LLM call or logical step. Spans have start/end times, attributes, and can be nested.

Synthetic Dataset

A set of generated cases that expands evaluation coverage around production evidence, Behaviors, or known failure modes. Synthetic cases should be reviewed before they become release-grade coverage.

Tool

A function that an LLM can call during generation. Tools have schemas defining their parameters and are used for retrieval, calculations, and external actions.

Tool Call

A structured request from the LLM to execute a specific tool with provided arguments. Tool calls must be processed by the application and results returned to the LLM.

Trace

A complete record of a user request flow containing multiple spans. Traces enable end-to-end visibility of LLM operations in Monitor.

Trajectory

An agent-run journey built from sessions, phases, turns, and source spans. Trajectories are especially useful for multi-step and coding-agent workflows.

Triage Finding

A diagnosis attached to a Behavior, usually with severity, hypothesis, suggested fix, and links to representative evidence.

Turn

A reconstructed step inside a session or phase, such as a user message, assistant reasoning step, tool call, tool result, or assistant reply. Turns can link back to source spans.

Variable

A placeholder in prompt templates (e.g., {{user_input}}) that is replaced with actual values at runtime. Variables are mapped to dataset columns in evaluations.

Webhook

An HTTP callback triggered by Adaline events such as deployments or evaluation completions. Enables integration with external systems and CI/CD pipelines.

Workspace

The top-level organizational unit in Adaline containing projects, members, and API keys. Workspaces define billing boundaries and access control.