Help us improve
Share bugs, ideas, or general feedback.
How this skill is triggered — by the user, by Claude, or both
Slash command
/erpaval:agent-ux-patternsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
A structured methodology for designing agent user experiences — particularly the human-in-the-loop surfaces (inbox, gates, reviews, trust) where agents and humans collaborate. Optimized for conversations that start with a vague pain point ("users feel overwhelmed by the agent's outputs") and end with a concrete data model, interaction vocabulary, and prioritized backlog.
Designs and builds AI agents for business, research, operations, and creative domains. Covers architecture, capabilities, knowledge, context, planning, and subagents.
Design AI agents with capabilities, knowledge, and context. Guides agent architecture decisions from simple loops to subagents and planning.
Guides building agent-native apps where agents match UI capabilities via tools and achieve features in loops. For autonomous agents, MCP tools, self-modifying systems.
Share bugs, ideas, or general feedback.
A structured methodology for designing agent user experiences — particularly the human-in-the-loop surfaces (inbox, gates, reviews, trust) where agents and humans collaborate. Optimized for conversations that start with a vague pain point ("users feel overwhelmed by the agent's outputs") and end with a concrete data model, interaction vocabulary, and prioritized backlog.
This skill focuses on the agent-side design problem: how agents surface work for review, how autonomy levels shift over time, how trust accrues through observation of human responses. For product-discovery disciplines (customer research, problem framing, PR/FAQ authoring), see the shared references in ${CLAUDE_PLUGIN_ROOT}/skills/product-design-shared/references/ and the consumer skills that wrap them (working-backwards, customer-research).
Start with the user's stated friction, not the solution. Ask clarifying questions to find the root cause.
Pattern: "I'm juggling too many X" usually means the triage/prioritization layer is missing, not that X needs to be reduced.
Research three areas in parallel before proposing solutions:
Vector 1: Internal codebase — What data, models, events, and infrastructure already exist? What's the richest unexploited data source? What patterns has the team already established?
Vector 2: Prior art — What have others built for this problem? Open-source projects, commercial products, academic papers, blog posts. Find the closest direct competitor and study it.
Vector 3: Adjacent platforms — What native capabilities does the deployment platform offer? (Slack features, GitHub Actions, AWS services, browser APIs). Often the best solution is assembling existing primitives, not building from scratch.
Generate 4-6 distinct directions, each with:
Avoid converging too early. Present all directions before recommending. Let the user's reaction guide which to expand.
Map the emerging concept to an established framework or vocabulary. This is critical for:
Sources of good vocabulary:
The vocabulary should be:
Translate the concept into a concrete data model. Include:
Example pattern:
Item:
id: string
intent: enum (from vocabulary)
source: reference to originating system
title: string (8-word summary)
detail: rich_text (expandable)
artifacts: [{type, ref, label}]
status: pending | acted_on | dismissed | expired
created_at: timestamp
decision: {action, reason, duration_s} | null
For Slack-based conversations:
Every agent-to-human interaction is one of five types (A2H vocabulary):
| Intent | User Action | Priority | Color |
|---|---|---|---|
| ESCALATE | Take over, agent is stuck | Highest | Red |
| AUTHORIZE | Approve/reject a gate | High | Amber |
| COLLECT | Provide structured input | Medium | Blue |
| RESULT | Review completed work | Low | Green |
| INFORM | Awareness only | Lowest | Grey |
The inbox composition is itself a trust metric: early on it's AUTHORIZE-heavy (agent asks permission), over time it shifts to INFORM-heavy (agent reports what it did).
Trust is learned, not configured. The mechanism:
Maps to Levels of Autonomy:
Every human action on an inbox item is recorded:
This journal feeds the progressive trust engine and provides an audit trail.
Projects decompose into phases with gates between them:
For sandboxed agent execution:
When running a design thinking session:
See ${CLAUDE_PLUGIN_ROOT}/skills/agent-ux-patterns/references/key-frameworks.md for detailed framework descriptions (Twilio A2H Protocol, Levels of Autonomy, Slack primitives).
| File | Contents |
|---|---|
references/key-frameworks.md | Twilio A2H Protocol, Levels of Autonomy (L0-L5), OpenHands Outer Loop, LangChain Agent Inbox, Slack primitives |