Agentic UX

A framework of patterns for designing AI agents that act on behalf of users.

38 patterns7 territoriesv1.0

Assistants answered questions. Agents take actions. The design problems are categorically different: the cost of a wrong answer is embarrassment; the cost of a wrong action is a deleted file, an unwanted charge, or a misrouted customer email.

Territories

Reversibility & rollback#

When the agent is wrong, how do we undo it, and how fast?

The defining property of an agent is that it changes the world. Reversibility is the fundamental safety net. Design the undo before you design the action.

Reversibility marking#

Every proposed action is labeled by how easily it can be undone: fully reversible (draft), partially (DB row with a backup), or not (sent email, charged card). Users see this before approving.

Action trail#

A chronological, human-readable log of everything the agent did, with the ability to inspect inputs, outputs, and side effects for each step. The trail is the primary debugging surface.

Blast-radius visualization#

A compact preview of how many records, files, or users a proposed action would affect. The user sees "will rename 412 files across 37 folders" before approving.

Bulk actions, schema changes, permission changes, or anything fan-out shaped.

Checkpoints and restore#

Named snapshots before significant actions, with a visible restore button. Works whether the state lives in the app, a codebase, or an external API with a supported undo window.

Time-delayed execution#

High-impact actions run on a delay (5s, 30s, 5 min) during which a single click cancels. Trades a moment of latency for a wide margin of user control.

Readable action diffs#

Proposed changes rendered as a diff the user can read line-by-line (for text, for settings, for database rows), not just described in prose.

Autonomy gradient#

How independently should the agent act, and who decides?

Autonomy is not a binary. It is a slider, and the slider position should be visible, adjustable, and different for different actions. A well-designed agent gives the user language and controls for the gradient.

Suggest / confirm / execute#

Three discrete modes, chosen per task or per session: the agent suggests only, confirms each step, or executes end-to-end. The mode is always visible and one click to change.

Per-action autonomy#

Autonomy set at the capability level, not the app level: auto-send drafts to known contacts but require approval for new ones; auto-commit to feature branches but never to main.

Autonomous mode display#

A persistent indicator (color, badge, ambient shape) that makes it unmistakable when the agent is acting without asking. Users should never discover autonomy by accident.

Escalation thresholds#

Rules that auto-demote autonomy when risk crosses a line: dollar amount, user impact, novelty. Crossing the threshold hands control back to the human before execution.

Autonomy budgets#

Time- or action-count-bounded grants of autonomy: "run unattended for 30 minutes or 50 actions, whichever comes first, then pause for review." Prevents runaway sessions.

Cost, latency & budget#

What does this action cost, in dollars, in time, in tokens?

Agents are the first class of software where a loop can burn unbounded money or wall-clock time in the background. The UI has to make both legible before, during, and after the run.

Pre-task cost estimate#

A predicted cost range (tokens, dollars, minutes) shown before the agent begins, with the same precision the user is about to spend at.

Hard budget ceilings#

User-set caps per task, per day, per account. The agent stops when the cap is hit and asks, rather than billing through it.

Running meters#

Live counters during execution: tokens consumed, dollars spent, steps remaining. Ambient enough to ignore when fine, obvious enough to notice when off.

Cost transparency (post-hoc)#

After a run, a breakdown the user can inspect: which tool calls cost what, which steps were retries, where the time and dollars went.

Latency expectations#

Honest time estimates for long-running agents (seconds vs. minutes vs. hours), and a way to walk away and come back without losing state.

Cross-session budget#

Budgets that persist across sessions, users, and devices. Starting a new chat should not silently reset a cap the user cared about.

Cross-app handoffs#

When control crosses a boundary, what travels with it, and what does not?

Most agent value comes from composing across apps: an email agent that books a calendar event, a code agent that opens a pull request, a research agent that writes to a doc. Boundaries are where trust and blame get muddled.

Context portability#

A clear mechanism for what context moves when the agent crosses an app boundary: credentials, history, files, preferences. Users can inspect and edit the handoff payload.

Per-target tool permissions#

Permissions scoped per destination app, not once globally. Granting Gmail access does not implicitly grant Google Drive access inside the same agent.

Inter-agent handoffs#

When one agent invokes another, the UI shows the chain: who called whom, with what payload, under which permissions. Handoffs are first-class citizens, not hidden plumbing.

Authentication chains#

Legible auth flows when an agent acts across identity domains. The user sees which identity is performing each step and can revoke any link in the chain.

Responsibility attribution#

Clear records of which agent (and which human) is responsible for each action across the chain. Essential for audits, incident reviews, and shared accountability.

Ambient & voice agents#

When is it listening, watching, or running, and how do we tell?

Always-on agents break the classical command-response contract. Users lose the sharp signal that something is happening. The UI compensates with durable, glanceable presence.

Activation boundaries#

Explicit, visible starts and stops for listening, watching, or running. Wake words, hardware indicators, and time-boxed sessions, not an ambiguous always-on state.

Ambient presence displays#

Low-attention signals (a color, a shape, a sound) that convey agent state without requiring a glance at a screen. Quiet when idle, louder when active.

Interruptibility#

A single, known gesture to pause or cancel an ambient agent: a word, a button, a hand sign. Works the same way regardless of what the agent is doing.

Voice confirmation for high-stakes actions#

Explicit voice or multimodal confirmation before irreversible actions, not a silent "sure, done that." The user hears or sees the exact action before it commits.

Multi-user awareness#

The agent knows who is speaking, acts on the right permissions, and can say so. "I heard Alex ask me to pay the bill; Alex does not have billing access." Shared devices demand it.

Trust scaffolding & provenance#

How do we trust what the agent claims to have done?

Trust is not a vibe. It is the sum of small, durable signals that give the user evidence, and a way to audit the evidence later. Treat trust as infrastructure, not decoration.

Agent identity#

Each agent has a stable name, version, and capability set the user can point at. Distinguishes which agent did what when multiple are in play.

Capability disclosure#

A plain-language "what I can do and what I cannot" surface, kept current as capabilities change. Users should never have to test the limits to learn them.

Failure disclosure#

Honest signaling when the agent hit a wall: unknown, blocked, rate-limited, low confidence. The failure mode is named, not hidden behind a hopeful paraphrase.

Action attribution#

Every external artifact the agent produces (commit, email, ticket, message) carries legible attribution. Downstream humans can tell what was agent-authored.

Audit export#

A one-click, user-owned export of agent activity: inputs, tools called, outputs, costs. Works for compliance and for the user reviewing their own history.

Design review

Coverage notes for critique. Most production regressions are a variant of one of these.

Anti-patterns

Opaque autopilot
The agent acts without a durable indicator that it is acting. Users discover autonomy by seeing unexpected results, not by choosing it.
Bundled consent
One "allow everything" modal at setup, with no way to revise the grant later without starting over.
Cheerful failure
Prose narration that smooths over blocked tool calls, empty search results, or low-confidence steps, presenting them as success.
Hidden cost
No forecast before a run, no meter during, no breakdown after. The user only learns the cost when the bill arrives.
Silent handoff
Control passes between agents, apps, or identities without a legible record. Attribution becomes impossible after the fact.

Open problems

Consent UIs for recurring, long-horizon agents (e.g. a monthly rebalancer): the right cadence is unknown.
Attribution across multi-agent chains: who is responsible when agent A delegates to agent B delegates to a tool?
Universal revoke: when a user revokes an agent, how do downstream caches, forks, and copies get invalidated?
Non-visual presence for ambient agents in shared spaces: sound, light, and haptic vocabularies are nascent.
Voice confirmation under adversarial conditions: how do we prevent replay attacks on spoken approvals?

How to use this framework

When scoping a new agent surface, walk the 7 territories in order. If you cannot answer the core question, the surface is not ready to ship.
In design reviews, use the anti-patterns as a checklist. Most production regressions are variants of one of the five.
Treat this as a living reference. If you ship a new pattern we have not named, send it. We version this framework publicly.

Changelog

v1.0
2026-04-17
Initial public release. 7 territories, 36 patterns, 5 anti-patterns, 5 open problems.

Other frameworks

Livev1.0
Chat UX
A framework of patterns for designing reliable AI conversations.
Explore framework
Livev1.0
Trust Scaffolding
A framework of patterns for designing calibrated trust in AI systems.
Explore framework
Coming soonPlanned Q4 2026
AI Onboarding
Teaching users what AI can do without a user manual.
Coming soonPlanned 2027
Visual AI UX
Generative images, design tools, canvas surfaces.

Design review

How to use this framework

Changelog

Other frameworks

Chat UX

Trust Scaffolding

AI Onboarding

Visual AI UX