Search everything...

Skill

skill-authoring

Creates and optimizes Claude Code skills following Anthropic's official best practices with emphasis on agent parallelization and script-first determinism. Use when: (1) creating a new skill from scratch, (2) optimizing an existing skill that exceeds 500 lines or has poor discoverability, (3) extracting inline code into scripts/ or reference material into references/, (4) designing orchestrator + sub-agent architectures for complex skills, (5) restructuring a skill directory into SKILL.md + scripts/ + references/ layout, (6) auditing skill cross-references for stale links. Covers: agent-first orchestration, parallel sub-agent design, script-first determinism, frontmatter rules, progressive disclosure, directory layout, description writing, and quality checklist.

npx claudepluginhub abhattacherjee/claude-code-skills --plugin skill-authoring

Tool Access

This skill uses the workspace's default tool permissions.

Preview

1. **Decompose into agents** — break complex skills into an orchestrator + specialized

Supporting Assets

CHANGELOG.mdreferences/quality-checklist.mdreferences/task-tracking-pattern.mdscripts/generate-task-manifest.shscripts/validate-skill.sh

SKILL.md

Similar Skills

design-system

167.4k

Generates design tokens/docs from CSS/Tailwind/styled-components codebases, audits visual consistency across 10 dimensions, detects AI slop in UI.

team-skills-platform

ui-demo

167.4k

Records polished WebM UI demo videos of web apps using Playwright with cursor overlay, natural pacing, and three-phase scripting. Activates for demo, walkthrough, screen recording, or tutorial requests.

team-skills-platform

kotlin-patterns

167.4k

Delivers idiomatic Kotlin patterns for null safety, immutability, sealed classes, coroutines, Flows, extensions, DSL builders, and Gradle DSL. Use when writing, reviewing, refactoring, or designing Kotlin code.

team-skills-platform

Stats

Parent Repo Stars1

Parent Repo Forks0

Last CommitMar 20, 2026

Actions

View Source View Plugin View on GitHub View README

skill-authoring | skill-authoring | ClaudePluginHub

Back to Skills

Skill

skill-authoring

From skill-authoring

npx claudepluginhub abhattacherjee/claude-code-skills --plugin skill-authoring

Tool Access

This skill uses the workspace's default tool permissions.

Preview

1. **Decompose into agents** — break complex skills into an orchestrator + specialized

Supporting Assets

CHANGELOG.mdreferences/quality-checklist.mdreferences/task-tracking-pattern.mdscripts/generate-task-manifest.shscripts/validate-skill.sh

SKILL.md

Skill Authoring

Core Principles

Decompose into agents — break complex skills into an orchestrator + specialized sub-agents. Each sub-agent has a single focused responsibility. The orchestrator delegates, coordinates, and reports — it never does the work itself.
Parallelize aggressively — launch independent sub-agents in a SINGLE Task tool message. If 3 catalogs need processing, launch 3 agents simultaneously, not sequentially. Time savings compound: 3 parallel agents = ~1x latency, not 3x.
Script-first for determinism — if the skill's value can be captured in a deterministic script, write the script FIRST, then wrap SKILL.md around it. Scripts are testable, runnable outside Claude, and keep SKILL.md lean. Agents handle judgement; scripts handle procedure.
Concise is key — the context window is a shared resource. Only add what Claude doesn't already know. Challenge each paragraph: "Does this justify its token cost?"
Progressive disclosure — SKILL.md is the overview; reference files load on-demand. Keep SKILL.md body under 500 lines.
Match freedom to fragility — text instructions for flexible tasks, exact scripts for fragile operations, specialized agents for judgement-heavy tasks.
Default assumption — Claude is already very smart. Skip explanations of basic concepts, library purposes, or general programming knowledge.
Track progress for long workflows — skills with 3+ sequential phases must include a task manifest script. Use TaskCreate/TaskUpdate to give real-time progress visibility. Users should never wonder "what phase is it on?" during a 5-minute workflow.

Frontmatter Rules

Supported fields: name, description, metadata, compatibility, license.

---
name: kebab-case-name          # ≤64 chars, lowercase + hyphens only
description: "Third-person description. Use when: (1) ..., (2) ..."  # ≤1024 chars, single-line quoted
metadata:
  version: 1.0.0               # semver: patch=typos, minor=new content, major=breaking
---

Do NOT include: author, date, tags, allowed-tools, category, or top-level version (use metadata.version instead). Use double-quoted single-line strings for description — block scalars (description: |) cause VS Code linter errors.

Description rules:

Write in third person ("Processes files..." not "I help you..." or "You can...")
Include both what it does AND when to use it
Add numbered trigger conditions: Use when: (1) ..., (2) ..., (3) ...
Include specific symptoms, error messages, framework names
Claude uses this to choose from 100+ skills — be specific enough to win selection

Directory Layout

your-skill/
├── SKILL.md              # Required — decision workflow, when-to-use, key rules
├── scripts/              # Optional — executable automation
│   ├── extract.sh        # Pre-processing: deterministic data extraction
│   └── apply-fixes.sh    # Post-processing: apply agent results
└── references/           # Optional — lookup material loaded on-demand
    ├── field-tables.md   # Tables, matrices, lookup data
    └── examples.md       # Code examples, past case studies

# Agent definitions live alongside other agents (not inside the skill):
.claude/agents/
├── your-orchestrator.md       # Pure orchestrator — delegates everything
├── your-sub-agent-a.md        # Focused specialist (NOT user-invocable)
└── your-sub-agent-b.md        # Focused specialist (NOT user-invocable)

What Goes Where

Content Type	Location	Why
Decision workflow	SKILL.md	Always loaded — guides what to do
Trigger conditions	SKILL.md	Must be visible for skill activation
Quick-reference commands	SKILL.md	Frequently needed during use
Agent orchestration pattern	SKILL.md	Defines how agents coordinate
Agent definitions	`.claude/agents/`	Reusable across skills, standard location
Lookup tables, field refs	references/	Consulted occasionally, not always
Code examples, case studies	references/	Large blocks that dilute SKILL.md
Executable procedures	scripts/	Predictable, testable, reusable

Reference Rules

One level deep from SKILL.md — no references linking to other references
Descriptive filenames — api-field-reference.md not ref1.md
Files > 100 lines should have a table of contents at the top

Script Extraction

Default: extract a script. Only skip if the skill is purely decision guidance with no deterministic steps.

Extract into scripts/ when ANY apply:

The skill checks, validates, or detects something (staleness, sync, coverage)
The code handles error conditions (missing deps, wrong directory, invalid args)
The same code block appears in multiple skills
The script composes with other scripts or CI/hooks
Users may run it standalone outside the skill context

Script requirements:

Always support --help / -h with usage examples
Validate inputs before operating (check files exist, directories writable)
Use meaningful exit codes (0 = success, 1 = error, 2 = usage)
Include a --fix mode where applicable (detect + auto-remediate)
Make executable: chmod +x scripts/*.sh
Use #!/usr/bin/env bash shebang (portable)
Choose set flags by script purpose (see Pitfall below)

Pitfall: set -e interacts badly with bash arithmetic and pipes. Common triggers: (1) find | sort | head -N — head closes the pipe causing SIGPIPE (exit 141) with pipefail, (2) grep -c returns exit 1 when count is 0, (3) echo "$var" | while read in subshells, (4) ((var++)) when var=0 — ((0)) evaluates to false, causing set -e to terminate the script. Fix: use VAR=$((VAR + 1)) instead of ((VAR++)). Use set -euo pipefail for validation scripts; use set -eu (without pipefail) for context-gathering scripts.

After writing the script, slim SKILL.md:

Replace procedural prose with a Quick Check section pointing to the script
Keep SKILL.md focused on when/why/context, not how (the script handles that)
Move lookup tables (field mappings, inventories) into the script or references/

Reference from SKILL.md:

## Quick Check
```bash
./scripts/validate.sh /tmp/data.json           # Report only
./scripts/validate.sh /tmp/data.json --fix     # Auto-remediate
./scripts/validate.sh --help                   # Usage
```

Agent & Orchestration Design

Default: decompose into agents. Only skip if the skill is a single-step check or pure decision guidance. Every skill with 2+ independent subtasks should use parallel agents.

When to Use Agents

Signal	Agent Approach	Teams?
Task has 2+ independent subtasks	Parallel sub-agents for each	No
Task requires web search, content reading, or AI judgement	Dedicated agent per domain	No
Task processes N items of the same type	Fan-out: one agent per item (or per batch)	No
Task has sequential phases with parallel work within	Orchestrator coordinates phase gates	Maybe
Task is a single deterministic check	No agent — use a script instead	No
Multi-phase workflow with inter-agent feedback	Named teammates via TeamCreate	Yes

Orchestrator Pattern

The pure orchestrator pattern is the gold standard for complex skills:

Orchestrator (coordinates, decides, reports)
├── Sub-agent A (focused task 1) ─── launched in parallel ──┐
├── Sub-agent B (focused task 2) ─── launched in parallel ──┤ SINGLE message
├── Sub-agent C (focused task 3) ─── launched in parallel ──┘
└── Script (deterministic pre/post-processing)

Orchestrator rules:

Pure delegation — the orchestrator NEVER does the work itself. It launches agents, collects results, makes phase-gate decisions, and generates the final report.
Parallel by default — launch all independent agents in a SINGLE Task tool message. Only sequence agents when one depends on another's output.
Progress reporting — output status updates between tool calls so the user is never left wondering what's happening.

MCP Tool Constraint (CRITICAL for skills using MCP servers)

Agent Teams teammates and sub-agents CANNOT call MCP tools. MCP server connections and tool permissions are session-scoped — they don't propagate to tmux panes or child agent sessions. When a teammate calls an MCP tool, it shows "Permission request sent to team leader" and deadlocks — the lead has no mechanism to approve.

"Lead Reads, Agents Analyze" pattern:

The orchestrator/lead reads ALL MCP content in Phase 1 (it has the permissions)
File contents are passed as TEXT in agent prompts — agents analyze text, not MCP resources
Agents must NOT call any mcp__* or ReadMcpResourceTool tools

This applies to ALL MCP servers (Figma, Sentry, Railway, etc.) and both Agent Teams teammates and non-team sub-agents (Agent tool calls). Always design skills that use MCP data with this constraint in mind.

Sub-Agent Design

Each sub-agent should be maximally specialized:

Single responsibility — one agent per focused task (e.g., "validate curated catalog URLs" not "validate all URLs across all catalogs")
Self-contained prompt — include all context the agent needs in its Task prompt. Don't rely on the agent inferring context from the conversation.
Structured output — define the exact JSON/report format the agent should return. The orchestrator parses this to make decisions.
Appropriate model — use haiku for fast/simple tasks (data extraction, formatting), sonnet for moderate judgement (code review, validation), opus only when deep reasoning is essential.

Parallelization Patterns

Fan-out by item — one agent per catalog, per PR, per test folder:

# 3 catalogs → 3 parallel agents (SINGLE message)
Task(agent=general-purpose, prompt="Validate curated catalog URLs...")
Task(agent=general-purpose, prompt="Validate google-places catalog URLs...")
Task(agent=general-purpose, prompt="Validate experiences catalog URLs...")

Fan-out by concern — one agent per review dimension:

# 3 review concerns → 3 parallel agents (SINGLE message)
Task(agent=code-reviewer, prompt="Review for bugs/correctness...")
Task(agent=code-reviewer, prompt="Review for simplicity/DRY...")
Task(agent=code-reviewer, prompt="Review for project conventions...")

Phased parallelism — sequential phases, parallel within each:

Phase 1: Script extracts data (deterministic)
Phase 2: 3 parallel agents process data (judgement)
Phase 3: Script applies fixes (deterministic)
Phase 4: 1 agent validates results (judgement)

Agent + Script Composition

The most powerful pattern combines both:

Scripts handle deterministic pre-processing (extraction, transformation, validation)
Agents handle judgement-heavy work (content verification, research, code review)
Scripts handle deterministic post-processing (applying fixes, generating reports)

Example flow: extract-urls.sh → 3 parallel verification agents → apply-fixes.sh

Agent Teams Orchestration

Use Agent Teams when teammates need to communicate with each other across phases — not just report back to an orchestrator.

When to Use Teams vs Sub-Agents

Signal	Use Teams	Use Sub-Agents
Multi-phase workflow with feedback loops	✓
Independent parallel tasks (fan-out)		✓
Teammates need each other's findings	✓
One-shot parallel analysis		✓
Iterative creative workflow (design, video)	✓
Quick research/validation		✓

Team Orchestration Pattern

TeamCreate("my-workflow")
├── TaskCreate tasks for each work item
├── Spawn teammates (Agent tool with team_name + name)
│   ├── Teammate A claims + works tasks
│   ├── Teammate B claims + works tasks
│   └── Teammates communicate via SendMessage
├── Lead monitors progress via TaskList
├── Lead synthesizes results
└── TeamDelete (cleanup)

Conditional Team Usage

Skills should support both modes — teams when available, sub-agents as fallback:

## Orchestration Mode

Check `CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS`:
- **If enabled**: Use TeamCreate for persistent multi-phase coordination
- **If disabled** (default): Use parallel Agent tool calls (existing pattern)

Both modes produce identical results. Teams add inter-agent communication.

Complex Skill Template (Teams Variant)

When using teams instead of anonymous sub-agents:

## Full Workflow (Team Orchestration)

### Step 1: Create Team
TeamCreate("my-workflow") → spawns shared task list.

### Step 2: Define Tasks
TaskCreate for each work item (extraction, validation, enrichment, etc.)

### Step 3: Spawn Named Teammates
Launch via Agent tool with `team_name` + `name` parameters.
Each teammate claims tasks from the shared list.

### Step 4: Monitor & Synthesize
Lead polls TaskList, teammates SendMessage findings to each other.
Lead collects completed results and generates final report.

### Step 5: Cleanup
TeamDelete("my-workflow")

Defining Agent Files

For skills that spawn agents, create .claude/agents/<agent-name>.md:

---
name: agent-name
description: "Single-purpose description. NOT user-invocable — spawned by <orchestrator>."
model: sonnet  # or haiku for simple tasks
---

You are a **<Role Name>**. Your mission is to <focused task>.

## Input (provided by orchestrator)
[What the orchestrator passes in the Task prompt]

## Output Format
[Exact JSON/report structure to return]

## Workflow
[Step-by-step procedure]

Agent registration: If the skill uses an orchestrator, include a Sub-Agent Registry table in the orchestrator's agent file listing all sub-agents, their concurrency model (parallel/sequential), purpose, and model tier.

Progress Tracking for Long-Running Workflows

Skills with 3+ sequential phases or workflows lasting >2 minutes should include a task manifest — a script that defines the exact TaskCreate checklist for each workflow the skill supports.

When to Add Task Tracking

Signal	Required?
3+ sequential phases	Yes — users need visibility
Multiple workflows/subcommands	Yes — each workflow gets its own manifest
Single-phase script	No — overkill
Pure decision guidance (no execution)	No — nothing to track

Task Manifest Script Pattern

Every skill with tracking should include scripts/task-manifest.sh — a bash case statement that emits a JSON array of tasks per workflow. Each task has subject, activeForm, and description fields matching TaskCreate parameters.

See references/task-tracking-pattern.md for the full script template with examples.

Key rules:

Each workflow is a case branch emitting a JSON array
--list returns machine-readable workflow names; --help shows usage
SKILL.md includes a "Progress Tracking (MANDATORY)" section with the task table
Mark tasks in_progress before starting, completed after, deleted on abort

Generating a Task Manifest for a New Skill

~/.claude/skills/skill-authoring/scripts/generate-task-manifest.sh \
  --skill-dir /path/to/my-skill \
  --workflows "full-audit:5,quick-check:2"

Creating a New Skill — Workflow

Check existing skills — search project + user-level directories
Decide: create new vs update existing (see decision table below)
Evaluate decomposition — can this be split into parallel agents? (see below)
Evaluate script-first — can deterministic parts be captured in scripts? (see below)
Evaluate progress tracking — does the skill have 3+ phases? (see below)
Write agents (if applicable) — orchestrator + sub-agent definitions
Write scripts (if applicable) — with --help, error handling, exit codes
Write task manifest (if applicable) — scripts/task-manifest.sh for each workflow
Write SKILL.md — frontmatter + body; reference agents, scripts, and task manifest
Extract references/ — if lookup material exceeds ~30 lines
Validate — run the quality checklist
Dry-run test — run scripts against real project data (see below)
Version — start at 1.0.0

Decomposition Evaluation (Step 3)

Ask: "Can this task be split into independent subtasks that run in parallel?"

Answer	Approach	Example
Yes — N independent items	Fan-out: one agent per item, orchestrator collects	`catalog-url-validator`: 3 parallel agents, one per catalog
Yes — N independent concerns	Fan-out by concern: one agent per dimension	`pr-review-toolkit`: parallel reviewers for code, tests, errors, types
Partially — phases with parallel steps	Phased: script → parallel agents → script	`catalog-maintainer`: backup → fetch → enrich → validate → embed
No — single sequential task	No orchestrator needed; single agent or script	`catalog-embedding-sync`: one script checks all

Script-First Evaluation (Step 4)

Ask: "Can the skill's core action be expressed as a deterministic check or procedure?"

Answer	Approach	Example
Yes — fully deterministic	Write script first, SKILL.md is thin wrapper	`catalog-embedding-sync`: script checks timestamps + counts
Mostly — with some judgement	Script handles the deterministic parts, agents handle judgement	`catalog-url-validator`: script extracts URLs, agents verify content
No — primarily decision guidance	Prose-first SKILL.md, no script needed	`catalog-field-lifecycle`: decision tree for field preservation

Why script-first wins: A 140-line script replaces ~60 lines of prose in SKILL.md while being testable, runnable standalone, and composable with CI/hooks. The SKILL.md drops from "explain everything" to "explain when/why + point to script".

Progress Tracking Evaluation (Step 5)

Ask: "Does this skill have 3+ sequential phases or take >2 minutes?"

Answer	Approach	Example
Yes — 3+ phases	Add `scripts/task-manifest.sh` with one entry per phase	`review-dependabot-prs`: 8 tasks across triage→apply→test→deploy
Yes — multiple workflows	Add one `case` branch per workflow	`github-issue-triage`: full-audit (5 tasks) + quick-check (2 tasks)
No — 1-2 fast phases	Skip task manifest — no tracking needed	`catalog-embedding-sync`: single script, <30 seconds

Dry-Run Testing (Step 12)

Every skill with scripts MUST be dry-run tested against real project data before release. Scripts that look correct often fail due to: regex format mismatches, grep pipe chains where head causes SIGPIPE, find including coverage/build artifacts, and classification heuristics that misfire on edge cases.

Procedure:

Run each script with human-readable output (no --json), scan for zero counts, unexpected items, misclassified entries, empty sections
Run with --json — verify field values match human-readable output
Test with 2-3 varied inputs (different formats, different project areas)
Fix and re-test — bugs cluster (one broken heuristic often means others are too)

Common catches: grep -rlq ... | head -1 always exits 0 on empty -q output; find without -not -path "*/coverage/*" includes test artifacts; regex assumes one heading format but project uses another (e.g., ### Sub-Task vs ### AC1:).

Create vs Update Decision

Found	Action
Nothing related	Create new
Same trigger + same fix	Update existing (minor version bump)
Same trigger, different root cause	Create new, add `See Also` links both ways
Same domain, different trigger	Update existing with new variant subsection
Stale or wrong	Deprecate in Notes, create replacement

Skill Template

Simple Skill (script-only, no agents)

---
name: descriptive-kebab-name
description: "Third-person description. Use when: (1) ..., (2) ..., (3) .... Covers: topic1, topic2."
metadata:
  version: 1.0.0
---

# Skill Title

## Problem
[2-3 sentences max.]

## Quick Check
```bash
./scripts/check.sh              # Report only
./scripts/check.sh --fix        # Auto-remediate

Solution

[Decision guidance for non-scripted parts.]

Progress Tracking (MANDATORY)

Create task checklist from scripts/task-manifest.sh full-run before starting. Mark in_progress → completed per phase. On abort, mark remaining deleted.

Full Workflow (Orchestration Pattern)

Step 1: Extract Data (Script)

MANIFEST=$(./scripts/extract.sh --json)

Step 2: Launch Parallel Agents

Launch N parallel general-purpose agents via the Task tool — one per . Each agent receives its slice of data and saves results to /tmp/<skill>-report-<domain>.json.

Step 3: Apply Fixes (Script)

./scripts/apply-fixes.sh --all --dry-run    # Preview
./scripts/apply-fixes.sh --all              # Apply

Step 4: Validate

npm run validate  # Or whatever validation command applies

Agent Definitions

orchestrator-agent.md — pure orchestrator, delegates everything
specialist-agent.md — focused sub-agent, NOT user-invocable

Similar Skills

design-system

167.4k

Generates design tokens/docs from CSS/Tailwind/styled-components codebases, audits visual consistency across 10 dimensions, detects AI slop in UI.

team-skills-platform

ui-demo

167.4k

team-skills-platform

kotlin-patterns

167.4k

team-skills-platform

Stats

Parent Repo Stars1

Parent Repo Forks0

Last CommitMar 20, 2026

Actions

View Source View Plugin View on GitHub View README

Skill Authoring

Core Principles

Decompose into agents — break complex skills into an orchestrator + specialized sub-agents. Each sub-agent has a single focused responsibility. The orchestrator delegates, coordinates, and reports — it never does the work itself.
Parallelize aggressively — launch independent sub-agents in a SINGLE Task tool message. If 3 catalogs need processing, launch 3 agents simultaneously, not sequentially. Time savings compound: 3 parallel agents = ~1x latency, not 3x.
Script-first for determinism — if the skill's value can be captured in a deterministic script, write the script FIRST, then wrap SKILL.md around it. Scripts are testable, runnable outside Claude, and keep SKILL.md lean. Agents handle judgement; scripts handle procedure.
Concise is key — the context window is a shared resource. Only add what Claude doesn't already know. Challenge each paragraph: "Does this justify its token cost?"
Progressive disclosure — SKILL.md is the overview; reference files load on-demand. Keep SKILL.md body under 500 lines.
Match freedom to fragility — text instructions for flexible tasks, exact scripts for fragile operations, specialized agents for judgement-heavy tasks.
Default assumption — Claude is already very smart. Skip explanations of basic concepts, library purposes, or general programming knowledge.
Track progress for long workflows — skills with 3+ sequential phases must include a task manifest script. Use TaskCreate/TaskUpdate to give real-time progress visibility. Users should never wonder "what phase is it on?" during a 5-minute workflow.

Frontmatter Rules

Supported fields: name, description, metadata, compatibility, license.

---
name: kebab-case-name          # ≤64 chars, lowercase + hyphens only
description: "Third-person description. Use when: (1) ..., (2) ..."  # ≤1024 chars, single-line quoted
metadata:
  version: 1.0.0               # semver: patch=typos, minor=new content, major=breaking
---

Description rules:

Write in third person ("Processes files..." not "I help you..." or "You can...")
Include both what it does AND when to use it
Add numbered trigger conditions: Use when: (1) ..., (2) ..., (3) ...
Include specific symptoms, error messages, framework names
Claude uses this to choose from 100+ skills — be specific enough to win selection

Directory Layout

your-skill/
├── SKILL.md              # Required — decision workflow, when-to-use, key rules
├── scripts/              # Optional — executable automation
│   ├── extract.sh        # Pre-processing: deterministic data extraction
│   └── apply-fixes.sh    # Post-processing: apply agent results
└── references/           # Optional — lookup material loaded on-demand
    ├── field-tables.md   # Tables, matrices, lookup data
    └── examples.md       # Code examples, past case studies

# Agent definitions live alongside other agents (not inside the skill):
.claude/agents/
├── your-orchestrator.md       # Pure orchestrator — delegates everything
├── your-sub-agent-a.md        # Focused specialist (NOT user-invocable)
└── your-sub-agent-b.md        # Focused specialist (NOT user-invocable)

What Goes Where

Content Type	Location	Why
Decision workflow	SKILL.md	Always loaded — guides what to do
Trigger conditions	SKILL.md	Must be visible for skill activation
Quick-reference commands	SKILL.md	Frequently needed during use
Agent orchestration pattern	SKILL.md	Defines how agents coordinate
Agent definitions	`.claude/agents/`	Reusable across skills, standard location
Lookup tables, field refs	references/	Consulted occasionally, not always
Code examples, case studies	references/	Large blocks that dilute SKILL.md
Executable procedures	scripts/	Predictable, testable, reusable

Reference Rules

One level deep from SKILL.md — no references linking to other references
Descriptive filenames — api-field-reference.md not ref1.md
Files > 100 lines should have a table of contents at the top

Script Extraction

Default: extract a script. Only skip if the skill is purely decision guidance with no deterministic steps.

Extract into scripts/ when ANY apply:

The skill checks, validates, or detects something (staleness, sync, coverage)
The code handles error conditions (missing deps, wrong directory, invalid args)
The same code block appears in multiple skills
The script composes with other scripts or CI/hooks
Users may run it standalone outside the skill context

Script requirements:

Always support --help / -h with usage examples
Validate inputs before operating (check files exist, directories writable)
Use meaningful exit codes (0 = success, 1 = error, 2 = usage)
Include a --fix mode where applicable (detect + auto-remediate)
Make executable: chmod +x scripts/*.sh
Use #!/usr/bin/env bash shebang (portable)
Choose set flags by script purpose (see Pitfall below)

After writing the script, slim SKILL.md:

Replace procedural prose with a Quick Check section pointing to the script
Keep SKILL.md focused on when/why/context, not how (the script handles that)
Move lookup tables (field mappings, inventories) into the script or references/

Reference from SKILL.md:

## Quick Check
```bash
./scripts/validate.sh /tmp/data.json           # Report only
./scripts/validate.sh /tmp/data.json --fix     # Auto-remediate
./scripts/validate.sh --help                   # Usage
```

Agent & Orchestration Design

Default: decompose into agents. Only skip if the skill is a single-step check or pure decision guidance. Every skill with 2+ independent subtasks should use parallel agents.

When to Use Agents

Signal	Agent Approach	Teams?
Task has 2+ independent subtasks	Parallel sub-agents for each	No
Task requires web search, content reading, or AI judgement	Dedicated agent per domain	No
Task processes N items of the same type	Fan-out: one agent per item (or per batch)	No
Task has sequential phases with parallel work within	Orchestrator coordinates phase gates	Maybe
Task is a single deterministic check	No agent — use a script instead	No
Multi-phase workflow with inter-agent feedback	Named teammates via TeamCreate	Yes

Orchestrator Pattern

The pure orchestrator pattern is the gold standard for complex skills:

Orchestrator (coordinates, decides, reports)
├── Sub-agent A (focused task 1) ─── launched in parallel ──┐
├── Sub-agent B (focused task 2) ─── launched in parallel ──┤ SINGLE message
├── Sub-agent C (focused task 3) ─── launched in parallel ──┘
└── Script (deterministic pre/post-processing)

Orchestrator rules:

Pure delegation — the orchestrator NEVER does the work itself. It launches agents, collects results, makes phase-gate decisions, and generates the final report.
Parallel by default — launch all independent agents in a SINGLE Task tool message. Only sequence agents when one depends on another's output.
Progress reporting — output status updates between tool calls so the user is never left wondering what's happening.

MCP Tool Constraint (CRITICAL for skills using MCP servers)

"Lead Reads, Agents Analyze" pattern:

The orchestrator/lead reads ALL MCP content in Phase 1 (it has the permissions)
File contents are passed as TEXT in agent prompts — agents analyze text, not MCP resources
Agents must NOT call any mcp__* or ReadMcpResourceTool tools

Sub-Agent Design

Each sub-agent should be maximally specialized:

Single responsibility — one agent per focused task (e.g., "validate curated catalog URLs" not "validate all URLs across all catalogs")
Self-contained prompt — include all context the agent needs in its Task prompt. Don't rely on the agent inferring context from the conversation.
Structured output — define the exact JSON/report format the agent should return. The orchestrator parses this to make decisions.
Appropriate model — use haiku for fast/simple tasks (data extraction, formatting), sonnet for moderate judgement (code review, validation), opus only when deep reasoning is essential.

Parallelization Patterns

Fan-out by item — one agent per catalog, per PR, per test folder:

# 3 catalogs → 3 parallel agents (SINGLE message)
Task(agent=general-purpose, prompt="Validate curated catalog URLs...")
Task(agent=general-purpose, prompt="Validate google-places catalog URLs...")
Task(agent=general-purpose, prompt="Validate experiences catalog URLs...")

Fan-out by concern — one agent per review dimension:

# 3 review concerns → 3 parallel agents (SINGLE message)
Task(agent=code-reviewer, prompt="Review for bugs/correctness...")
Task(agent=code-reviewer, prompt="Review for simplicity/DRY...")
Task(agent=code-reviewer, prompt="Review for project conventions...")

Phased parallelism — sequential phases, parallel within each:

Phase 1: Script extracts data (deterministic)
Phase 2: 3 parallel agents process data (judgement)
Phase 3: Script applies fixes (deterministic)
Phase 4: 1 agent validates results (judgement)

Agent + Script Composition

The most powerful pattern combines both:

Scripts handle deterministic pre-processing (extraction, transformation, validation)
Agents handle judgement-heavy work (content verification, research, code review)
Scripts handle deterministic post-processing (applying fixes, generating reports)

Example flow: extract-urls.sh → 3 parallel verification agents → apply-fixes.sh

Agent Teams Orchestration

Use Agent Teams when teammates need to communicate with each other across phases — not just report back to an orchestrator.

When to Use Teams vs Sub-Agents

Signal	Use Teams	Use Sub-Agents
Multi-phase workflow with feedback loops	✓
Independent parallel tasks (fan-out)		✓
Teammates need each other's findings	✓
One-shot parallel analysis		✓
Iterative creative workflow (design, video)	✓
Quick research/validation		✓

Team Orchestration Pattern

TeamCreate("my-workflow")
├── TaskCreate tasks for each work item
├── Spawn teammates (Agent tool with team_name + name)
│   ├── Teammate A claims + works tasks
│   ├── Teammate B claims + works tasks
│   └── Teammates communicate via SendMessage
├── Lead monitors progress via TaskList
├── Lead synthesizes results
└── TeamDelete (cleanup)

Conditional Team Usage

Skills should support both modes — teams when available, sub-agents as fallback:

## Orchestration Mode

Check `CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS`:
- **If enabled**: Use TeamCreate for persistent multi-phase coordination
- **If disabled** (default): Use parallel Agent tool calls (existing pattern)

Both modes produce identical results. Teams add inter-agent communication.

Complex Skill Template (Teams Variant)

When using teams instead of anonymous sub-agents:

## Full Workflow (Team Orchestration)

### Step 1: Create Team
TeamCreate("my-workflow") → spawns shared task list.

### Step 2: Define Tasks
TaskCreate for each work item (extraction, validation, enrichment, etc.)

### Step 3: Spawn Named Teammates
Launch via Agent tool with `team_name` + `name` parameters.
Each teammate claims tasks from the shared list.

### Step 4: Monitor & Synthesize
Lead polls TaskList, teammates SendMessage findings to each other.
Lead collects completed results and generates final report.

### Step 5: Cleanup
TeamDelete("my-workflow")

Defining Agent Files

For skills that spawn agents, create .claude/agents/<agent-name>.md:

---
name: agent-name
description: "Single-purpose description. NOT user-invocable — spawned by <orchestrator>."
model: sonnet  # or haiku for simple tasks
---

You are a **<Role Name>**. Your mission is to <focused task>.

## Input (provided by orchestrator)
[What the orchestrator passes in the Task prompt]

## Output Format
[Exact JSON/report structure to return]

## Workflow
[Step-by-step procedure]

Progress Tracking for Long-Running Workflows

Skills with 3+ sequential phases or workflows lasting >2 minutes should include a task manifest — a script that defines the exact TaskCreate checklist for each workflow the skill supports.

When to Add Task Tracking

Signal	Required?
3+ sequential phases	Yes — users need visibility
Multiple workflows/subcommands	Yes — each workflow gets its own manifest
Single-phase script	No — overkill
Pure decision guidance (no execution)	No — nothing to track

Task Manifest Script Pattern

See references/task-tracking-pattern.md for the full script template with examples.

Key rules:

Each workflow is a case branch emitting a JSON array
--list returns machine-readable workflow names; --help shows usage
SKILL.md includes a "Progress Tracking (MANDATORY)" section with the task table
Mark tasks in_progress before starting, completed after, deleted on abort

Generating a Task Manifest for a New Skill

~/.claude/skills/skill-authoring/scripts/generate-task-manifest.sh \
  --skill-dir /path/to/my-skill \
  --workflows "full-audit:5,quick-check:2"

Creating a New Skill — Workflow

Check existing skills — search project + user-level directories
Decide: create new vs update existing (see decision table below)
Evaluate decomposition — can this be split into parallel agents? (see below)
Evaluate script-first — can deterministic parts be captured in scripts? (see below)
Evaluate progress tracking — does the skill have 3+ phases? (see below)
Write agents (if applicable) — orchestrator + sub-agent definitions
Write scripts (if applicable) — with --help, error handling, exit codes
Write task manifest (if applicable) — scripts/task-manifest.sh for each workflow
Write SKILL.md — frontmatter + body; reference agents, scripts, and task manifest
Extract references/ — if lookup material exceeds ~30 lines
Validate — run the quality checklist
Dry-run test — run scripts against real project data (see below)
Version — start at 1.0.0

Decomposition Evaluation (Step 3)

Ask: "Can this task be split into independent subtasks that run in parallel?"

Answer	Approach	Example
Yes — N independent items	Fan-out: one agent per item, orchestrator collects	`catalog-url-validator`: 3 parallel agents, one per catalog
Yes — N independent concerns	Fan-out by concern: one agent per dimension	`pr-review-toolkit`: parallel reviewers for code, tests, errors, types
Partially — phases with parallel steps	Phased: script → parallel agents → script	`catalog-maintainer`: backup → fetch → enrich → validate → embed
No — single sequential task	No orchestrator needed; single agent or script	`catalog-embedding-sync`: one script checks all

Script-First Evaluation (Step 4)

Ask: "Can the skill's core action be expressed as a deterministic check or procedure?"

Answer	Approach	Example
Yes — fully deterministic	Write script first, SKILL.md is thin wrapper	`catalog-embedding-sync`: script checks timestamps + counts
Mostly — with some judgement	Script handles the deterministic parts, agents handle judgement	`catalog-url-validator`: script extracts URLs, agents verify content
No — primarily decision guidance	Prose-first SKILL.md, no script needed	`catalog-field-lifecycle`: decision tree for field preservation

Progress Tracking Evaluation (Step 5)

Ask: "Does this skill have 3+ sequential phases or take >2 minutes?"

Answer	Approach	Example
Yes — 3+ phases	Add `scripts/task-manifest.sh` with one entry per phase	`review-dependabot-prs`: 8 tasks across triage→apply→test→deploy
Yes — multiple workflows	Add one `case` branch per workflow	`github-issue-triage`: full-audit (5 tasks) + quick-check (2 tasks)
No — 1-2 fast phases	Skip task manifest — no tracking needed	`catalog-embedding-sync`: single script, <30 seconds

Dry-Run Testing (Step 12)

Procedure:

Run each script with human-readable output (no --json), scan for zero counts, unexpected items, misclassified entries, empty sections
Run with --json — verify field values match human-readable output
Test with 2-3 varied inputs (different formats, different project areas)
Fix and re-test — bugs cluster (one broken heuristic often means others are too)

Create vs Update Decision

Found	Action
Nothing related	Create new
Same trigger + same fix	Update existing (minor version bump)
Same trigger, different root cause	Create new, add `See Also` links both ways
Same domain, different trigger	Update existing with new variant subsection
Stale or wrong	Deprecate in Notes, create replacement

Skill Template

Simple Skill (script-only, no agents)

---
name: descriptive-kebab-name
description: "Third-person description. Use when: (1) ..., (2) ..., (3) .... Covers: topic1, topic2."
metadata:
  version: 1.0.0
---

# Skill Title

## Problem
[2-3 sentences max.]

## Quick Check
```bash
./scripts/check.sh              # Report only
./scripts/check.sh --fix        # Auto-remediate

Solution

[Decision guidance for non-scripted parts.]

Progress Tracking (MANDATORY)

Create task checklist from scripts/task-manifest.sh full-run before starting. Mark in_progress → completed per phase. On abort, mark remaining deleted.

Full Workflow (Orchestration Pattern)

Step 1: Extract Data (Script)

MANIFEST=$(./scripts/extract.sh --json)

Step 2: Launch Parallel Agents

Launch N parallel general-purpose agents via the Task tool — one per . Each agent receives its slice of data and saves results to /tmp/<skill>-report-<domain>.json.

Step 3: Apply Fixes (Script)

./scripts/apply-fixes.sh --all --dry-run    # Preview
./scripts/apply-fixes.sh --all              # Apply

Step 4: Validate

npm run validate  # Or whatever validation command applies

Agent Definitions

orchestrator-agent.md — pure orchestrator, delegates everything
specialist-agent.md — focused sub-agent, NOT user-invocable

skill-authoring

Tool Access

Preview

Supporting Assets

SKILL.md

Similar Skills

skill-authoring

Tool Access

Preview

Supporting Assets

SKILL.md

Skill Authoring

Core Principles

Frontmatter Rules

Directory Layout

What Goes Where

Reference Rules

Script Extraction

Agent & Orchestration Design

When to Use Agents

Orchestrator Pattern

MCP Tool Constraint (CRITICAL for skills using MCP servers)

Sub-Agent Design

Parallelization Patterns

Agent + Script Composition

Agent Teams Orchestration

When to Use Teams vs Sub-Agents

Team Orchestration Pattern

Conditional Team Usage

Complex Skill Template (Teams Variant)

Defining Agent Files

Progress Tracking for Long-Running Workflows

When to Add Task Tracking

Task Manifest Script Pattern

Generating a Task Manifest for a New Skill

Creating a New Skill — Workflow

Decomposition Evaluation (Step 3)

Script-First Evaluation (Step 4)

Progress Tracking Evaluation (Step 5)

Dry-Run Testing (Step 12)

Create vs Update Decision

Skill Template

Simple Skill (script-only, no agents)

Solution

See Also

Progress Tracking (MANDATORY)

Full Workflow (Orchestration Pattern)

Step 1: Extract Data (Script)

Step 2: Launch Parallel Agents

Step 3: Apply Fixes (Script)

Step 4: Validate

Agent Definitions

See Also

Similar Skills

Skill Authoring

Core Principles

Frontmatter Rules

Directory Layout

What Goes Where

Reference Rules

Script Extraction

Agent & Orchestration Design

When to Use Agents

Orchestrator Pattern

MCP Tool Constraint (CRITICAL for skills using MCP servers)

Sub-Agent Design

Parallelization Patterns

Agent + Script Composition

Agent Teams Orchestration

When to Use Teams vs Sub-Agents

Team Orchestration Pattern

Conditional Team Usage

Complex Skill Template (Teams Variant)

Defining Agent Files

Progress Tracking for Long-Running Workflows

When to Add Task Tracking

Task Manifest Script Pattern

Generating a Task Manifest for a New Skill

Creating a New Skill — Workflow

Decomposition Evaluation (Step 3)