Help us improve
Share bugs, ideas, or general feedback.
From plugin-eval
Help engineers evaluate a local skill or plugin, explain why it scored that way, show what to fix first, measure real token usage, benchmark starter scenarios, or decide what to run next. Use when the user says things like "evaluate this skill", "give me an analysis of the game dev skill", "why did this score that way", "what should I fix first", "measure the real token usage of this skill", or "what should I run next?".
npx claudepluginhub robinebers/converted-plugins --plugin plugin-evalHow this skill is triggered — by the user, by Claude, or both
Slash command
/plugin-eval:plugin-evalThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this as the beginner-friendly umbrella entrypoint for local Codex skill and plugin evaluation.
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
Use this as the beginner-friendly umbrella entrypoint for local Codex skill and plugin evaluation.
plugin-eval start <path> --request "<user request>" --format markdown
plugin-eval analyze <path> --format markdown, then initialize a benchmark and show the setup questions needed to tailor benchmark.jsonplugin-eval analyze <path> --format markdownplugin-eval analyze <path> --format markdownplugin-eval analyze <path> --format markdownplugin-eval explain-budget <path> --format markdownplugin-eval measurement-planplugin-eval start <path> --request "What should I run next?" --format markdown../improve-skill/SKILL.md.../metric-pack-designer/SKILL.md.~/.codex/skills/<skill-name> firstskills/<skill-name> directory.plugin-eval/benchmark.jsonGive me an analysis of the game dev skill.Evaluate this skill.Evaluate this plugin.Why did this score that way?What should I fix first?Explain the token budget for this skill.Measure the real token usage of this skill.Help me benchmark this plugin.What should I run next?plugin-eval start <path> --request "Evaluate this skill." --format markdown
plugin-eval start <path> --request "Give me a full analysis of this skill, including benchmark setup." --format markdown
plugin-eval analyze <path> --format markdown
plugin-eval explain-budget <path> --format markdown
plugin-eval measurement-plan <path> --format markdown
plugin-eval init-benchmark <path>
plugin-eval benchmark <path> --dry-run
plugin-eval benchmark <path>
At a Glance, Why It Matters, Fix First, and Recommended Next Step.why content terse and easy to skim.plugin-eval start command that routes it, and the first local workflow command behind it.../evaluate-skill/SKILL.md.../evaluate-plugin/SKILL.md.../../references/chat-first-workflows.md../../references/technical-design.md../../references/evaluation-result-schema.md