Author

seed1

203 approved definitions. Showing 61–80 of 203.

degraded mode

A reduced-function mode used when a dependency is unavailable but the workflow can still provide partial value safely — show prior summaries, disable write actions.
In degraded mode, the assistant showed prior summaries but disabled write actions.

fail open

A design where the system continues when a check fails. Sometimes acceptable for low-stakes actions; dangerous for regulated or write-capable workflows.
The team rejected fail open for regulated approvals.

PII handling

The way a deployment detects, protects, minimizes, logs, or avoids personally identifiable information — must be approved before real customer data enters any eval set.
PII handling had to be approved before real customer tickets entered the eval set.

policy check

A validation step that confirms output, action, or data access complies with customer policy or product rules before the workflow continues.
The policy check blocked responses that lacked required disclosures.

risk review

A review of deployment risks — data exposure, bad actions, reliability, compliance, user misuse, and operational failure modes — often what changes a full write-back launch into an approval-only one.
The risk review changed the launch from full write-back to approval-only.

red team scenario

A deliberately adversarial or high-risk test case used to probe model, agent, security, or workflow weaknesses before production.
The red team scenario checked whether the agent would reveal another account's data.

guardrail bypass

A failure mode where a user, prompt, tool, or configuration avoids a control that was supposed to limit behavior. Treated as a launch-blocking bug.
The FDE treated the guardrail bypass as a launch-blocking bug.

escape hatch

A deliberate manual override or alternate path that lets operators recover when automation, data, or integrations fail — should be designed in, not bolted on after launch.
The escape hatch let supervisors take over the case from the agent.

fallback path

The designed path when automation cannot proceed safely — routing to a human, using a simpler workflow, or reverting to manual processing.
The fallback path sent ambiguous cases to the existing triage queue.

confidence threshold

A threshold used to decide when model or agent output is good enough to proceed, requires human review, or should fall back to a simpler path.
The FDE raised the confidence threshold for regulated cases.

approval gate

A required approval step before a workflow can proceed or a system action can be taken.
The approval gate stopped the agent from sending customer emails directly.

human checkpoint

A specific point in a workflow where a human must review, approve, resolve ambiguity, or accept responsibility before the workflow continues.
The FDE added a human checkpoint before refunds over $500.

MCP connector

A connector that exposes a specific system, API, or data source through MCP so an AI application can use it safely with standard tool calling.
The FDE wrote an MCP connector for the internal knowledge base.

MCP layer

The controlled integration layer that exposes customer tools, data, or actions to AI applications through MCP servers — enforcing permissions without baking them into every agent.
The MCP layer gave the agent access to account data without bypassing permissions.

MCP server

A server implementing the Model Context Protocol so AI systems can access tools, resources, or prompts through a standard interface — the preferred integration layer for FDEs building agent connections to customer systems.
The FDE built an MCP server for the customer's ticketing system.

tool-use trace

A record of which tools an agent called, with inputs, outputs, timing, and errors — the primary debugging artifact for agentic workflows.
The tool-use trace explained why the agent chose the wrong escalation path.

tool sandbox

A safe environment where agent tool calls can be tested without touching production data or committing real actions.
The tool sandbox let the team test refunds without issuing refunds.

tool registry

A governed inventory of tools available to agents or applications — ownership, scopes, descriptions, and approval status tracked so nothing undocumented gets used in production.
The FDE removed an unsafe write tool from the tool registry.

tool contract

The agreed inputs, outputs, permissions, errors, and side effects of a tool an agent or workflow can call. Makes clear what the agent can do, not just what it can call.
The tool contract made it clear the agent could create drafts but not send them.

override signal

A signal generated when users correct, reject, or bypass system output — one of the most valuable inputs for improving evals and workflow design.
The override signal revealed that the model misunderstood one policy clause.