From plugin-creator
Guides creation of new Claude Code skills from scratch, listing existing user/project skills, samples, current directory, and providing metadata templates for descriptions/tags.
npx claudepluginhub jamie-bitflight/claude_skills --plugin plugin-creatorThis skill uses the workspace's default tool permissions.
**Existing user-level skills:**
LICENSE.txtagents/analyzer.mdagents/comparator.mdagents/grader.mdassets/eval_review.htmleval-viewer/generate_review.pyeval-viewer/viewer.htmlreferences/agent-plugin-ecosystem.mdreferences/ai-audience-writing-rules.mdreferences/authoring-checklist.mdreferences/claude-code-skills-official.mdreferences/evaluation-and-optimization.mdreferences/executable-scripts.mdreferences/frontmatter-experiments/CLAUDE.mdreferences/frontmatter-experiments/E1-2026-02-14-allowed-tools-behavior.mdreferences/instruction-design.mdreferences/schemas.mdreferences/workflows.mdscripts/__init__.pyscripts/aggregate_benchmark.pyGuides developers in creating, improving, and organizing SKILL.md files for Claude Code plugins, covering frontmatter, triggers, structure, progressive disclosure, hooks, budgets, and best practices.
Guides creation of Claude Code skills with principles for conciseness, degrees of freedom, SKILL.md structure, and bundling scripts/references/assets.
Guides creation of effective, portable Claude skills including structure, frontmatter, progressive disclosure, context budgeting, degrees of freedom, and packaging.
Share bugs, ideas, or general feedback.
Existing user-level skills:
!python3 -c "import os, pathlib; home = pathlib.Path.home(); skills = home / '.claude' / 'skills'; print('\\n'.join(sorted([d.name for d in skills.iterdir() if d.is_dir()])[:20]) if skills.exists() else 'No user-level skills found')" 2>/dev/null || echo "No user-level skills found"
Existing project-level skills:
!python3 -c "import os, pathlib; skills = pathlib.Path('.claude/skills'); print('\\n'.join(sorted([d.name for d in skills.iterdir() if d.is_dir()])[:20]) if skills.exists() else 'No project-level skills found')" 2>/dev/null || echo "No project-level skills found"
Sample skill descriptions (for pattern reference):
!python3 -c "import pathlib, re; dirs = [pathlib.Path.home() / '.claude' / 'skills', pathlib.Path('.claude/skills')]; descs = []; [descs.extend([line.strip() for line in (d / 'SKILL.md').read_text(encoding='utf-8', errors='ignore').splitlines() if line.strip().startswith('description:')][:1]) for base in dirs if base.exists() for d in base.iterdir() if d.is_dir() and (d / 'SKILL.md').exists()]; print('\\n'.join(descs[:10]) if descs else 'No skill descriptions found')" 2>/dev/null || echo "No skill descriptions found"
Current directory:
!python3 -c "import os; print(os.getcwd())" 2>/dev/null || echo "Unable to determine current directory"
This skill provides guidance for creating effective skills.
Skills are modular, self-contained packages that extend Claude's capabilities by providing specialized knowledge, workflows, and tools. Think of them as "onboarding guides" for specific domains or tasks—they transform Claude from a general-purpose agent into a specialized agent equipped with procedural knowledge that no model can fully possess.
This skill is for creating NEW skills from scratch. For refactoring EXISTING skills (splitting oversized skills, reorganizing multi-domain skills), use the skill-refactor skill:
Skill(skill: "plugin-creator:refactor-skill")
When to use skill-creator vs skill-refactor:
flowchart TD
Start(["Skill task received"]) --> Q{"Is there an existing skill<br>to modify or split?"}
Q -->|"No — creating from scratch<br>or from requirements"| Creator["Use skill-creator<br>(this skill)"]
Q -->|"Yes — existing skill exceeds<br>warning threshold (SK006/SK007)<br>or covers multiple domains"| Refactor["Use skill-refactor<br>Skill(skill: 'plugin-creator:refactor-skill')"]
Creator --> Together(["Both can combine — create with<br>skill-creator, refactor later<br>with skill-refactor as needs evolve"])
Refactor --> Together
Add automated doc updater when skill wraps external docs (API specs, frameworks, CLI refs) that change regularly. Self-maintaining pipeline: download → process → index upstream docs.
Trigger: Skill provides access to documentation that updates over time.
Add after skill creation: /plugin-creator:add-doc-updater <skill-path>
Candidates: GitLab CI docs, CLI tools (glab, gh, kubectl), frameworks (React, Django), API specs (OpenAPI)
The context window is a public good. Skills share the context window with everything else Claude needs: system prompt, conversation history, other Skills' metadata, and the actual user request.
Default assumption: Claude is already very smart. Only add context Claude doesn't already have. Challenge each piece of information: "Does Claude really need this explanation?" and "Does this paragraph justify its token cost?"
Prefer concise examples over verbose explanations.
Skills use progressive disclosure to manage context efficiently:
<available_skills> block (~100 words per skill)Budget Constraints:
| Resource | Limit | Notes |
|---|---|---|
name field | 64 chars | Lowercase, numbers, hyphens only |
description field | 1024 chars | Critical for skill selection |
<available_skills> block | 2% of context window (fallback 16,000 chars) | Scales dynamically; separate from global context |
| Skills before truncation | ~34-36 | Varies by description complexity and length |
Truncation Behavior:
When total skill metadata exceeds the budget (2% of context window, fallback 16,000 characters):
<available_skills> block/skill-name/context to check for a warning about excluded skillsTo increase the limit, set the SLASH_COMMAND_TOOL_CHAR_BUDGET environment variable.
Fallback Strategy:
If you have many skills, embed pointers in CLAUDE.md as a safeguard:
## Skills Available
- For debugging: use `/scientific-method:scientific-thinking` skill
- For delegation: use `/delegate` skill
This ensures Claude can find skills even if truncated from <available_skills>.
CRITICAL YAML BUG:
Do NOT use YAML multiline indicators in descriptions:
# WRONG - will show ">-" as description
description: >-
This is a multiline description that breaks.
# WRONG - same problem
description: |-
This breaks too.
# CORRECT - single-line string; quote only when YAML syntax requires it (colons, leading special chars, boolean literals)
description: This works correctly without quotes when no special characters are present.
description: 'This works when the value contains colons or other YAML-special characters.'
The Claude Code skill indexer does not parse YAML multiline indicators correctly - the description appears as ">-" instead of actual content.
SOURCE: ../claude-skills-overview-2026/SKILL.md section on Skill Tokenomics.
Match the level of specificity to the task's fragility and variability:
High freedom (text-based instructions): Use when multiple approaches are valid, decisions depend on context, or heuristics guide the approach.
Medium freedom (pseudocode or scripts with parameters): Use when a preferred pattern exists, some variation is acceptable, or configuration affects behavior.
Low freedom (specific scripts, few parameters): Use when operations are fragile and error-prone, consistency is critical, or a specific sequence must be followed.
Think of Claude as exploring a path: a narrow bridge with cliffs needs specific guardrails (low freedom), while an open field allows many routes (high freedom).
Every skill consists of a required SKILL.md file and optional bundled resources:
skill-name/
├── SKILL.md (required)
│ ├── YAML frontmatter metadata (required)
│ │ └── name: (recommended — if omitted, uses directory name; required per agentskills.io spec)
│ │ └── description: (recommended)
│ └── Markdown instructions (required)
└── Bundled Resources (optional)
├── scripts/ - Executable code (Python/Bash/etc.)
├── references/ - Documentation intended to be loaded into context as needed
└── assets/ - Files used in output (templates, icons, fonts, etc.)
Every SKILL.md consists of:
name, description, argument-hint, allowed-tools, model, context, user-invocable, disable-model-invocation, and hooks. The description field (or first paragraph if omitted) is what Claude reads to determine when the skill gets used, thus it is very important to be clear and comprehensive in describing what the skill is and when it should be used.scripts/)Executable code (Python/Bash/etc.) for tasks that require deterministic reliability or are repeatedly rewritten.
scripts/rotate_pdf.py for PDF rotation tasksreferences/)Documentation and reference material intended to be loaded as needed into context to inform Claude's process and thinking.
references/finance.md for financial schemas, references/mnda.md for company NDA template, references/policies.md for company policies, references/api_docs.md for API specificationsassets/)Files not intended to be loaded into context, but rather used within the output Claude produces.
assets/logo.png for brand assets, assets/slides.pptx for PowerPoint templates, assets/frontend-template/ for HTML/React boilerplate, assets/font.ttf for typographyA skill should only contain essential files that directly support its functionality. Do NOT create extraneous documentation or auxiliary files, including:
The skill should only contain the information needed for an AI agent to do the job at hand. It should not contain auxilary context about the process that went into creating it, setup and testing procedures, user-facing documentation, etc. Creating additional documentation files just adds clutter and confusion.
Add context: fork to frontmatter when you want a skill to run in isolation without access to conversation history.
When to use:
When NOT to use:
Agent types:
context: fork
agent: Explore # or Plan, general-purpose, custom-agent-name
| Agent | Model | Tools | Use Case |
|---|---|---|---|
Explore | Haiku | File/web/MCP (read-only) | Verbatim retrieval only — never analysis or reasoning (~50% hallucination rate on reasoning tasks) |
Plan | Inherits | File/web/MCP (read-only) | Research before planning |
general-purpose | Inherits | File/web/MCP + Bash/system | Complex operations (default) |
Tool restrictions:
context: fork)SOURCE: ../claude-skills-overview-2026/SKILL.md section on Context Fork Behavior.
Control who can invoke your skill:
flowchart TD
Start(["Choose invocation mode for skill"]) --> Q{"Who should be able<br>to invoke this skill?"}
Q -->|"Both user and Claude<br>(default behavior)"| Default["Default — no frontmatter flag needed<br>User types /skill-name<br>Claude loads automatically when relevant<br>Description always in context"]
Q -->|"User only — has side effects<br>such as deploy or send-slack-message"| Manual["Set disable-model-invocation: true<br>Only user can invoke with /skill-name<br>Claude cannot load automatically<br>Description NOT in Claude's context<br>Reason — you control timing;<br>Claude won't deploy just because code looks ready"]
Q -->|"Claude only — background knowledge<br>not a meaningful user action"| Background["Set user-invocable: false<br>Only Claude can invoke (automatically when relevant)<br>Not shown in / autocomplete menu<br>Description always in context<br>Full skill loads when Claude activates it<br>Example use — /legacy-system-context"]
Default --> Done(["Invocation mode configured"])
Manual --> Done
Background --> Done
SOURCE: ../claude-skills-overview-2026/SKILL.md section on Invocation Control.
Skills can define hooks in frontmatter to respond to events during the skill's lifecycle:
Events:
PreToolUse - Before tool executesPostToolUse - After successful executionStop - When skill finishesExample:
hooks:
PreToolUse:
- matcher: "Bash" # Regex pattern matching tool name
hooks:
- type: command
command: "./scripts/check.sh"
once: true # Run only once per session
PostToolUse:
- matcher: "Write|Edit"
hooks:
- type: command
command: "./scripts/lint.sh"
Stop:
- hooks:
- type: command
command: "./scripts/cleanup.sh"
Hook I/O:
Complete documentation: Use Skill(skill: "plugin-creator:hooks-guide") for all events, matchers, JSON output control, and examples.
SOURCE: Skill /claude-skills-overview-2026
Skills use a three-level loading system to manage context efficiently:
Keep SKILL.md lean. Run uvx skilllint@latest check <skill-path> to check token complexity. Keep only core workflow and selection guidance in SKILL.md; move variant-specific details into reference files. Reference them from SKILL.md with clear descriptions of when to read each file.
Three patterns: (1) high-level guide with pointers to FORMS.md, REFERENCE.md, etc.; (2) domain-split references (finance.md, sales.md per domain); (3) conditional details (basic inline, advanced via link). Load workflows.md for full examples of all three patterns.
Rules: keep references one level deep from SKILL.md. NEVER add ToC, anchor links, or bold/italic for visual emphasis to reference files — Load ai-audience-writing-rules.md.
Editing an existing SKILL.md? Before treating an unrecognized frontmatter key as an error, check
plugins/plugin-creator/scripts/ecosystem_registry.py. If the key is returned byget_ecosystem_owned_keys()— such asmcp:(OpenCode) — preserve it and all its nested content verbatim. Do not strip, rewrite, or normalize it. Formcp:specifically, see thereferences/agent-plugin-ecosystem.mdreference (OpenCode SKILL.md Extensions section) for the full schema.
flowchart TD
S1["Step 1 — Understand the skill<br>with concrete examples"] --> S2
S2["Step 2 — Plan reusable skill contents<br>(scripts, references, assets)"] --> S3
S3["Step 3 — Determine skill location<br>and distribution strategy"] --> S4
S4["Step 4 — Initialize the skill<br>(run init_skill.py — MANDATORY)"] --> S5
S5["Step 5 — Edit the skill<br>(implement resources and write SKILL.md)"] --> S6Q
S6Q{"Distributing via plugin<br>marketplace?"}
S6Q -->|"Yes — plugin distribution"| S6["Step 6 — Package the skill<br>(validate then package)"]
S6Q -->|"No — project or user level<br>already in final location"| S7
S6 --> S7
S7["Step 7 — Define test cases<br>(evals/evals.json)"] --> S8
S8["Step 8 — Run and evaluate<br>(A/B harness, grading, viewer)"] --> S9
S9["Step 9 — Improve the skill<br>(targeted fixes, re-run tests)"] --> S9Q
S9Q{"Improvements<br>sufficient?"}
S9Q -->|"No — regressions or failures"| S8
S9Q -->|"Yes"| S10
S10["Step 10 — Description optimization<br>(automated trigger tuning)"] --> Done(["Skill creation complete"])
Follow these steps in order, skipping only if there is a clear reason why they are not applicable.
Skip this step only when the skill's usage patterns are already clearly understood. It remains valuable even when working with an existing skill.
To create an effective skill, clearly understand concrete examples of how the skill will be used. This understanding can come from either direct user examples or generated examples that are validated with user feedback.
For example, when building an image-editor skill, relevant questions include:
To avoid overwhelming users, avoid asking too many questions in a single message. Start with the most important questions and follow up as needed for better effectiveness.
Conclude this step when there is a clear sense of the functionality the skill should support.
To turn concrete examples into an effective skill, analyze each example by:
Example: When building a pdf-editor skill to handle queries like "Help me rotate this PDF," the analysis shows:
scripts/rotate_pdf.py script would be helpful to store in the skillExample: When designing a frontend-webapp-builder skill for queries like "Build me a todo app" or "Build me a dashboard to track my steps," the analysis shows:
assets/hello-world/ template containing the boilerplate HTML/React project files would be helpful to store in the skillExample: When building a big-query skill to handle queries like "How many users have logged in today?" the analysis shows:
references/schema.md file documenting the table schemas would be helpful to store in the skillTo establish the skill's contents, analyze each concrete example to create a list of the reusable resources to include: scripts, references, and assets.
flowchart TD
Start(["Determine where the skill will live"]) --> Q{"Is the target location<br>already known?"}
Q -->|"No — location is unclear"| Ask["STOP — ask the user:<br>'Where should this skill be created?<br>(1) Plugin for marketplace distribution,<br>(2) Project-level (.claude/skills/) for team sharing via git,<br>(3) User-level (~/.claude/skills/) for personal use?'"]
Ask --> Q2{"User has answered<br>location question"}
Q -->|"Yes — location known"| Q2
Q2 -->|"Plugin distribution<br>(public or team marketplace)"| Plugin["Location: plugins/*/skills/<br>Packaging step 6 IS required<br>Namespace: plugin-name:skill-name<br>(does not conflict with other levels)"]
Q2 -->|"Project-level<br>(team sharing via git)"| Project["Location: .claude/skills/<br>Skip packaging step 6 entirely<br>Already in final location"]
Q2 -->|"User-level<br>(personal use across projects)"| User["Location: ~/.claude/skills/<br>Skip packaging step 6 entirely<br>Already in final location"]
Plugin --> Priority["Location priority when skills share same name:<br>managed/enterprise > user > project<br>Plugin skills use plugin-name:skill-name namespace"]
Project --> Priority
User --> Priority
Priority --> Discovery["Note: Claude Code auto-discovers skills<br>from nested .claude/skills/ directories<br>(supports monorepo setups)"]
Discovery --> Done(["Location decided — proceed to Step 4"])
SOURCE: ../claude-skills-overview-2026/SKILL.md section on Directory Structure and Location Priority.
CRITICAL: This is the MANDATORY first step for creating ANY skill. Do NOT skip this step.
For ALL skills (plugin, project, and user):
Run the init_skill.py script. This script generates a complete template skill directory with:
scripts/, references/, assets/)Usage:
# The script has executable permissions and a shebang - run it directly
${CLAUDE_PLUGIN_ROOT}/skills/skill-creator/scripts/init_skill.py <skill-name> --path <output-directory>
Run scripts using
uv runwhen a shebang invocation fails due to missing dependencies. Ifuvis not available, read.claude/rules/uv-run-fallback.mdfor install instructions and pip fallback procedure.
Examples:
# Plugin skill
${CLAUDE_PLUGIN_ROOT}/skills/skill-creator/scripts/init_skill.py my-new-skill --path plugins/my-plugin/skills
# Project skill
${CLAUDE_PLUGIN_ROOT}/skills/skill-creator/scripts/init_skill.py my-skill --path .claude/skills
# User skill
${CLAUDE_PLUGIN_ROOT}/skills/skill-creator/scripts/init_skill.py my-skill --path ~/.claude/skills
What the script does:
scripts/, references/, assets/ directoriesIMPORTANT: Do NOT manually create skill directories. The script provides essential scaffolding, validation, and examples that manual creation misses.
Only skip if: The skill directory already exists and you're iterating on an existing skill.
After initialization, customize or remove the generated SKILL.md and example files as needed.
When editing the (newly-generated or existing) skill, remember that the skill is being created for another instance of Claude to use. Include information that would be beneficial and non-obvious to Claude. Consider what procedural knowledge, domain-specific details, or reusable assets would help another Claude instance execute these tasks more effectively.
Consult these helpful guides based on your skill's needs:
These files contain established best practices for effective skill design.
To begin implementation, start with the reusable resources identified above: scripts/, references/, and assets/ files. Note that this step may require user input. For example, when implementing a brand-guidelines skill, the user may need to provide brand assets or templates to store in assets/, or documentation to store in references/.
Documentation-wrapping skills: If the skill provides access to external docs (API specs, framework guides, CLI references), add automated updater BEFORE finalizing implementation:
/plugin-creator:add-doc-updater {skill-path}
This creates self-maintaining doc pipeline: download → process → index with cooldown enforcement.
Added scripts must be tested by actually running them to ensure there are no bugs and that the output matches what is expected. If there are many similar scripts, only a representative sample needs to be tested to ensure confidence that they all work while balancing time to completion.
Any example files and directories not needed for the skill should be deleted. The initialization script creates example files in scripts/, references/, and assets/ to demonstrate structure, but most skills won't need all of them.
Orchestrator role in Step 5: write YAML frontmatter only, pass file paths to the sub-agent, verify output after completion.
Writing Guidelines: Always use imperative/infinitive form.
Every sentence in a skill body must be either a command the agent executes or knowledge the agent recalls. Before marking a skill done, scan for these anti-patterns and fix each one:
| Anti-pattern trigger | What it signals | Fix |
|---|---|---|
| "Open a new terminal" / "Click X" / "Navigate to Y" | Action only a human can take | Replace with the equivalent agent action: export PATH=..., source ~/.bashrc. If no direct equivalent is known, research how to achieve the same outcome programmatically — use a subagent if available, otherwise research and test directly. Do not document the human step as a dead end |
| Troubleshooting table with causes and fixes | Unverified guesses presented as facts | Remove entirely unless each row was derived from an actual observed failure — state the observed evidence, not a theory |
| "You should..." / "Consider..." / "It is recommended..." | Passive advice to a human reader | Rewrite as an imperative command or delete |
| Platform-specific steps with no prior environment check | Assumes an environment that may differ | Precede with a detection command (echo $MSYSTEM, uname, etc.) and branch on its output |
| Subprocess check that inherits the parent process's PATH | False positive — finds a tool via parent env, not the target shell's env | Use an environment-isolated check (e.g., [Environment]::GetEnvironmentVariable('PATH','User') reads the Windows registry directly) |
| "Install X, then verify" with no observed output shown | Gives steps without confirming they work | Run the steps, capture the actual output, include that output as the expected result |
These patterns appear when skill content is drafted from training data or general knowledge rather than from executing the steps in the actual environment. The fix in every case is the same: run the command, observe the output, write what you observed.
Write the YAML frontmatter. All fields are optional, but description is strongly recommended:
name: Optional. The skill name (defaults to directory name if omitted). Lowercase letters, numbers, and hyphens only. Max 64 characters.description: Optional but strongly recommended. This is the primary triggering mechanism for your skill, and helps Claude understand when to use the skill. If omitted, uses the first paragraph of markdown content.
>-, |-, |) - they are broken and will display as ">-" instead of your text. Use single-line quoted strings instead.docx skill: "Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. Use when Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks"argument-hint: Optional. Hint shown during autocomplete to indicate expected arguments. Example: [issue-number] or [filename] [format].allowed-tools: Optional. Lists tools Claude can use without asking permission when this skill is active, and restricts Claude to only those tools. Also reduces context size by limiting included tool definitions. When omitted, the skill inherits all tool capabilities from the parent agent. (comma-separated). Example: Read, Grep, Glob, Bash(npm run:*)model: Optional. Model to use when this skill is active. Options: claude-opus-4-5-20251101, claude-sonnet-4-20250514, opus, sonnet, haikucontext: Optional. Set to fork to run in a forked subagent context for isolation. See advanced patterns below.agent: Optional. Which subagent type to use when context: fork is set. Options: Explore, Plan, general-purpose, or custom agent name.user-invocable: Optional. Set to false to hide from the / menu. Use for background knowledge users shouldn't invoke directly. Default: true.disable-model-invocation: Optional. Set to true to prevent Claude from automatically loading this skill. Use for workflows you want to trigger manually with /name. Default: false. Note: This field has no effect in -p (headless/Agent SDK CLI) mode — ALL skill invocations via /skill-name are unavailable in -p mode regardless of this setting. Skills used in automation must embed their full workflow in the prompt. See ../claude-skills-overview-2026/resources/headless-agent-sdk.md.hooks: Optional. Hooks scoped to this skill's lifecycle. See hooks documentation for configuration format.Multi-runtime scaffold — when a skill targets multiple runtimes, combine portable fields with runtime-specific extensions. Fields not recognized by a runtime are silently ignored:
---
name: my-skill
description: Does something useful
mcp:
server-name:
command: npx
args: ["-y", "some-mcp-package"]
---
Here mcp: is an OpenCode-only extension — Claude Code ignores it. Use this pattern to ship a single SKILL.md that works on both runtimes without branching.
Complete field reference: See ../claude-skills-overview-2026/SKILL.md for definitive schema documentation, or the references/claude-code-skills-official.md for the authoritative source specification.
Write instructions for using the skill and its bundled resources.
Advanced Body Features:
String Substitutions - Skills support dynamic value replacement:
$ARGUMENTS - All arguments passed when invoking the skill$ARGUMENTS[N] or $N - Specific argument by 0-based index (e.g., $0, $1)${CLAUDE_SESSION_ID} - Current session ID for logging or session-specific filesExample:
---
name: migrate-component
description: Migrate a component from one framework to another
---
Migrate the $0 component from $1 to $2.
Preserve all existing behavior and tests.
Running /migrate-component SearchBar React Vue replaces $0 with SearchBar, $1 with React, and $2 with Vue.
Dynamic Context Injection - Preprocess shell commands before skill content reaches Claude:
Example (summarize GitHub PR):
---
name: pr-summary
description: Summarize GitHub pull request changes
---
Pull Request Data:
\`\!\`gh pr view $0 --json title,body,commits\`\`\`
Create a summary of the changes above.
Extended Thinking Mode - Include the word "ultrathink" anywhere in skill content to enable extended thinking mode for complex reasoning tasks.
SOURCE: ../claude-skills-overview-2026/SKILL.md sections on String Substitutions and Dynamic Context Injection.
flowchart TD
Start(["Skill development complete<br>Enter Step 6"]) --> Q{"Distributing via<br>plugin marketplace?"}
Q -->|"No — skill is in .claude/skills/<br>or ~/.claude/skills/<br>already in final location"| Skip(["Skip Step 6 entirely<br>Proceed to Step 7"])
Q -->|"Yes — plugin distribution planned"| Q2{"Prefer standalone .skill file<br>or bundle directly in plugin?"}
Q2 -->|"Bundle directly in plugin<br>(recommended)"| Bundle["Place skill directory under<br>plugin's skills/ directory<br>Claude Code auto-discovers all skills under skills/<br>No plugin.json update needed"]
Q2 -->|"Standalone .skill file"| RunPkg["Run: scripts/package_skill.py path/to/skill-folder<br>Optional: scripts/package_skill.py path/to/skill-folder ./dist"]
RunPkg --> Validate["Script validates automatically:<br>YAML frontmatter format and fields<br>Skill naming conventions and directory structure<br>Description completeness and quality<br>File organization and resource references"]
Validate --> VQ{"Validation<br>exit code?"}
VQ -->|"0 — validation passed"| Package["Script packages the skill<br>Creates my-skill.skill (zip with .skill extension)<br>Includes all files with proper directory structure"]
VQ -->|"non-zero — validation failed<br>script reports errors and exits"| Fix["Fix reported validation errors<br>then run packaging command again"]
Fix --> RunPkg
Bundle --> SkillReg["Skills auto-discovered when no<br>skills field present in plugin.json<br>Do NOT add skill entries to plugin.json<br>(auto_sync_manifests.py skips skills;<br>adding skills field opts into manual allowlist mode —<br>SK009 fires as INFO reminder)"]
Package --> AgentReg["Agents require explicit registration<br>in the agents array in plugin.json<br>Claude Code does not auto-discover agents<br>⚠️ ALL-OR-NOTHING — when agents array exists,<br>EVERY agent must be listed or it becomes invisible<br>List all existing agents BEFORE adding the new one"]
SkillReg --> Done(["Step 6 complete — proceed to Step 7"])
AgentReg --> Done
See claude-plugins-reference-2026 for plugin creation documentation.
After creating the skill, test it with real prompts, grade results with the A/B evaluation harness, iterate on failures, and optimize the description for triggering accuracy.
Read references/evaluation-and-optimization.md for the complete workflow covering:
evals/evals.json)agents/grader.md, viewer via eval-viewer/generate_review.py)agents/comparator.md and agents/analyzer.md)scripts/run_loop.py with train/test split)Load schemas.md for evals.json and grading.json formats.
| Directory | Contents |
|---|---|
agents/ | agents/grader.md (assertion grading), agents/comparator.md (blind A/B), agents/analyzer.md (post-hoc analysis) |
references/ | references/schemas.md (JSON schemas), references/evaluation-and-optimization.md (Steps 7-10), references/claude-code-skills-official.md (spec), references/workflows.md (patterns) |
eval-viewer/ | viewer.html (interactive eval viewer), generate_review.py (HTML generator) |
assets/ | eval_review.html (trigger eval review template) |
scripts/ | init_skill.py, package_skill.py, quick_validate.py, run_eval.py, run_loop.py, improve_description.py, generate_report.py, aggregate_benchmark.py |