From agents
Create, improve, and audit AI agent skills. 14 structural patterns, deterministic scoring. Use when building or reviewing skills. NOT for agents, MCP servers, or running skills.
npx claudepluginhub wyattowalsh/agents --plugin agentsThis skill uses the workspace's default tool permissions.
Create, improve, and audit AI agent skills. Every skill follows 14 proven structural patterns.
evals/evals.jsonreferences/audit-guide.mdreferences/best-practices.mdreferences/evaluation-rubric.mdreferences/frontmatter-spec.mdreferences/packaging-guide.mdreferences/proven-patterns.mdreferences/refinement-plan.mdreferences/workflow.mdreports/00-broad-sweep-matrix.mdreports/01-standalone-plan.mdscripts/_shared.pyscripts/audit.pyscripts/package.pyscripts/progress.pyscripts/verify.pytemplates/dashboard.htmlCreates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
Create, improve, and audit AI agent skills. Every skill follows 14 proven structural patterns.
Scope: Skills only. NOT for creating agents (wagents new agent), building MCP servers (/mcp-creator), or running existing skills. This repo uses raw SKILL.md format committed directly to skills/.
| $ARGUMENTS | Action | Example |
|---|---|---|
create <name> / new <name> | Develop (new) | /skill-creator create my-analyzer |
create <name> --from <source> | Develop (new, from exemplar) | /skill-creator create my-analyzer --from wargame |
improve <name> / improve <path> | Develop (existing) | /skill-creator improve add-badges |
plan <name> / plan <path> | Plan (existing) | /skill-creator plan honest-review |
plan --all / plan repo | Plan (repo-wide) | /skill-creator plan --all |
audit <name> | Audit | /skill-creator audit honest-review |
audit --all | Audit All | /skill-creator audit --all |
dashboard | Dashboard | /skill-creator dashboard |
package <name> / package --all | Package | /skill-creator package wargame |
| Natural language skill idea | Auto: Develop (new) | "tool that audits Python type safety" |
| Skill name + modification verb | Auto: Develop (existing) | "refactor the wargame skill" |
| Path to SKILL.md | Auto: Develop (existing) | skills/wargame/SKILL.md |
| "MCP server" / "agent" / "run" | Refuse + redirect | — |
| Empty | Gallery | /skill-creator |
If no explicit mode keyword is provided:
SKILL.md or directory under skills/ → Develop (existing)--from <source> in arguments → Develop (new, from exemplar)wagents new skill <name> # Scaffold from template
wagents validate # Check all skills
wagents eval validate # Check eval manifests after eval changes
uv run python skills/skill-creator/scripts/audit.py skills/<name>/ # Score quality
wagents package <name> --dry-run # Check single-skill portability before packaging
Unified process for creating new skills and improving existing ones. Load references/workflow.md for the full procedure.
| Step | New Skill | Existing Skill |
|---|---|---|
| 1. Understand | Define use cases, scope, patterns | Audit + understand user's intent |
| 2. Plan | Structure, description, frontmatter | Gap analysis + improvement plan (approval gate) |
| 3. Scaffold | wagents new skill <name> | Skip |
| 4. Build | Write/edit body, references, scripts, templates, evals | Same |
| 5. Validate | wagents validate + wagents eval validate + audit.py | Same |
| 6. Iterate | Test, identify issues, loop to Step 4 | Same |
Use plan <name> for an existing-skill refinement plan without editing and plan --all or plan repo for a ranked repo-wide planning pass.
Required planning output:
For repo-wide planning, produce a ranked queue plus one standalone refinement plan per promoted skill or skill cluster. Do not edit any skill until the user approves the plan.
Load references/refinement-plan.md when producing the standalone refinement-plan packet.
Score a skill using deterministic analysis + AI review. Load references/audit-guide.md.
Comparative ranking of all repository skills. Load references/audit-guide.md § Audit All.
Render visual creation process monitor or audit quality dashboard. Load references/audit-guide.md § Dashboard.
Auto-detects mode from data: phases field → process monitor; skills array → audit overview.
Present skill inventory with scores and available actions.
Run uv run python skills/skill-creator/scripts/audit.py --all --format table, display results, offer mode menu.
Package skills into portable ZIP files for Claude Code Desktop import. Load references/packaging-guide.md for ZIP structure, manifest schema, portability checks, and cross-agent compatibility.
wagents package <name> --dry-run # Check a single skill before emitting a ZIP
wagents package <name> # Single skill → <name>-v<version>.skill.zip
wagents package --all # All skills → dist/ with manifest.json
wagents package --all --dry-run # Check portability without creating ZIPs
PreToolUse hooks intercept tool calls during skill execution. The hooks: frontmatter field scopes hooks to this skill only — they activate when the skill is loaded and deactivate when it completes.
Post-edit enforcement for this skill:
SKILL.md edits trigger uv run wagents validateevals/*.json edits trigger uv run wagents eval validateuv run wagents hooks validateStop hook enforcement:
uv run python skills/skill-creator/scripts/verify.py stopstop_hook_active: true to avoid recursive Stop-hook loopsCreation progress persists at ~/.{gemini|copilot|codex|claude}/skill-progress/<name>.json. Read/write via scripts/progress.py. Survives session restarts. Use --state-dir to override the default location.
| File | Content | Read When |
|---|---|---|
references/workflow.md | Unified 6-step skill development process for new and existing skills | Develop (new), Develop (existing) |
references/refinement-plan.md | Standalone refinement-plan contract for existing-skill and repo-wide planning output | Plan (existing), Plan (repo-wide) |
references/audit-guide.md | Audit procedure, Audit All, Dashboard rendering, Gallery, grade thresholds | Audit, Audit All, Dashboard, Gallery |
references/proven-patterns.md | 14 structural patterns with examples from repo skills | Step 4 (Build), gap analysis |
references/best-practices.md | Anthropic guide + superpowers methodology + cross-agent awareness | Step 2 (Plan), Step 4 (Build), description writing |
references/frontmatter-spec.md | Full field catalog, invocation matrix, decision tree | Step 3 (Scaffold), frontmatter configuration |
references/packaging-guide.md | ZIP structure, manifest schema, portability checks, import instructions | Package |
references/evaluation-rubric.md | 13 weighted scoring dimensions normalized to 100, grade thresholds, pressure testing | Audit (pressure testing), scoring targets |
Read reference files as indicated by the "Read When" column above. Do not rely on memory or prior knowledge of their contents.
Conciseness is respect — The context window is shared. Every line competes with the agent's working memory. Earn every line or delete it.
Progressive disclosure — Frontmatter for discovery (~100 tokens), body for dispatch (<5K tokens), references for deep knowledge (on demand), scripts/templates for execution (never loaded).
Self-exemplar — This skill follows every pattern it teaches. When in doubt, look at how skill-creator applies it.
uv run wagents validate before declaring any skill completeuv run wagents eval validate after changing evals and before declaring the skill completeuv run python skills/skill-creator/scripts/audit.py after every significant SKILL.md changewagents docs generate — delegate to docs-stewardplan <name> and plan --all are read-only planning modes — never edit during planningstop_hook_active guard — recursive hook loops are implementation bugsCanonical terms (use these exactly throughout):