Skip to content
Agentic Craft

Trust & Control Plane

Autonomy governance, operational configuration, transparency, and guardrail patterns that establish and maintain user trust in agentic experiences.

Settings Templates

Reusable control groups for product teams that need durable agent boundaries across sessions, not one-off prompt instructions.

Reusable Settings Groups

Set the maximum independence level, escalation threshold, and allowed action classes.

Choose when background runs, blockers, approvals, and completions notify users.

Require confirmation for external communication, spending, publishing, and destructive edits.

Control durable memory review, workspace scope, expiry, and removal behavior.

Effective policy

Autonomy

Maximum Level 2: Human-in-Command

Applies to the current workspace until changed.

ReviewReview

External actions

Approval required before sending, publishing, or modifying shared records.

Risky actions should show a locked consequence preview before execution.

RequiredRequired

Notifications

Notify on blockers, approvals, completions, and budget warnings.
AllowedAllowed

Memory

Durable memory is blocked until review settings are enabled.

Memory should not expand scope silently.

BlockedBlocked

Settings templates should use confirmation or undo for risky changes such as disabling approval gates, lowering escalation thresholds, or expanding memory scope.

Autonomy Level

Select how much independence the agent has. Higher levels increase speed but reduce oversight. Uses the 6-level autonomy scale from foundations.

1
2
3
4
5
6
Level 2·Human-in-Command

AI drafts outputs and proposes actions; human approves every one before execution.

Capabilities

Draft project findings for review
Propose requirement-to-test-case mappings
Suggest source requests to developer

Restrictions

Cannot send emails without approval
Cannot modify project records
Cannot create or close findings

Surface: Approval modal

PropertyValueNotes
Scale6 levels (1–6)From Human-Augmented to Human-Out-of-the-Loop
DefaultLevel 2Start conservative, unlock higher levels over time
IndicatorStepped barDiscrete steps, not a continuous slider
Autonomy levels should be progressive — start at Level 2 for new workflows and unlock higher levels only after the agent has demonstrated reliability. Never default to full autonomy for review tasks that affect approval outcomes.

Mode Toggles

Switch the agent's operational mode to focus on different aspects of the review workflow. Each mode changes available tools and priorities.

Requirements mode

Active

Focus

Ensuring all source material meets operating playbook requirements and policy alignment claims.

Available tools

Source completeness checker
Requirement coverage matrix generator
Policy alignment validator
Lifecycle document scanner
ModeFocusTools
RequirementsSource material and policy alignment4 requirements-specific tools
ResearchTechnical investigation4 research-specific tools
ReviewDeliverable review and audit prep4 review-specific tools
Mode switching should be instant — no confirmation dialog needed since it only changes tool availability and focus, not data access. Reviewers typically switch modes multiple times during a single review session.

Context Scope

Define which documents and project artifacts the agent can access. Narrower scopes reduce noise; wider scopes enable cross-referencing.

Portal Only

ACME Customer Portal v3.1

Accessible documents

Project brief v3Full document
QA Notes 2026-003Product-specific checks
2 documents in scope
ScopeDocumentsUse case
Portal Only2 documentsFocused work on a single product
Portal + Policy4 documentsEvaluating policy alignment claims
Global6 documentsCross-referencing across full project workspace
Context scope directly affects response quality and cost. A narrower scope produces faster, cheaper answers but may miss cross-document dependencies. For OR preparation, always use Global scope to ensure nothing is overlooked.

Confidence Display

How the agent communicates its certainty level through language, visual cues, and actionable follow-ups.

High confidence
Export workflow requires CSV and JSON export encryption for all data-at-rest operations. The product implements this through the approved export service referenced by the implementation notes.
Source: Project brief v3, §5.1.1
LevelIndicatorLanguage pattern
HighGreen dotDirect assertions with citations
MediumAmber dotHedged language: "appears to," "based on"
LowRed dotExplicit uncertainty + verify action
Confidence indicators should never be hidden from the user. Even when the agent is highly confident, showing the source builds trust over time. Low-confidence responses must always offer a path to human verification.

Kill Switch

Always-available mechanism to immediately halt agent execution. The stop control adapts its prominence to the agent's current state.

Agent idle

Waiting for instructions

StateButton styleBehavior
IdleSubtle borderPresent but low prominence
RunningRed-tinted backgroundProminent, immediately accessible
StoppedResume + Discard optionsPartial results preserved, user chooses next step
The stop button must always be reachable within one click. During long-running operations like batch requirement analysis, it should be the most prominent UI element. Stopped agents must preserve partial work — never discard without explicit user consent.

Cost Transparency

Showing users the computational cost of agent operations. Compact mode provides a glanceable summary; detailed mode breaks down token usage and pricing.

13,134 tokens
·$0.21·
4.2s
ModeShowsUse case
CompactTotal tokens, cost, elapsed timeInline display after each response
DetailedInput/output split, model, pricingBudget review and cost optimization
Cost transparency builds trust with procurement teams and lab managers who need to justify AI spending. The compact format should be unobtrusive; detailed mode is for when users actively want to understand costs.

Data Provenance

Tracing information back to its origin. Sources mode shows where data came from; chain mode shows the full reasoning path from source to conclusion.

Project brief v3Primary

§5.1 — Requirement Definitions

x
94%

Operating playbookGuidance

§12.4 — Reviewer Actions

x
87%

Previous launch review (2025-08)Reference

§6 — Findings Summary

x
71%
ModeShowsWhen to use
SourcesDocument, section, confidence, typeQuick verification of data origin
ChainSource → fact → inference → conclusionAuditing the full reasoning path
In complex review workflows, every claim must be traceable to source material. Data provenance mirrors the reviewer's own methodology — showing the chain from source document through extracted requirement to analytical conclusion. This makes agent outputs auditable by review teams.

Audit Trail

Immutable log of all agent actions for requirements and review. Summary mode shows a compact timeline; detailed mode expands each entry with full context and source links.

14:02:11
Opened Project brief v3
14:02:14
Cross-referenced Export workflow QA notes
14:02:18
Flagged Export workflow for reviewer approval
14:02:22
Reviewer approved finding classification
ModeShowsAudience
SummaryTimestamp + action descriptionQuick scan during review
DetailedUser, outcome, source linksFormal audit and requirements review
Audit trails are a regulatory requirement in compliance and release-governance contexts. Every agent action must be logged with enough detail for an review team sessioner to reconstruct exactly what happened. The summary view keeps daily work manageable while the detailed view satisfies formal review needs.