By laurigates
Evaluate and benchmark Claude Code skills/plugins: run behavioral test cases on skills, grade outputs against assertions with evidence, aggregate plugin-level reports, compare versions blindly, analyze patterns for strengths/weaknesses, track quality trends over time, and get prioritized suggestions to improve instructions/examples/structure.
npx claudepluginhub laurigates/claude-plugins --plugin evaluate-pluginAnalyze evaluation results to identify patterns, weaknesses, and improvement opportunities. Operates in comparison mode (with-skill vs baseline) or benchmark mode (trends across runs). Use after grading to generate suggestions.
Blind comparison of two outputs without knowing their origin. Rates content quality and structure quality to objectively determine which output is better. Use to compare with-skill vs baseline runs without bias.
Grade evaluation runs against predefined assertions. Examines execution transcripts and outputs to determine pass/fail with cited evidence. Use as a subagent from evaluation orchestration skills.
Analyze evaluation results and suggest concrete skill improvements. Use after running evaluations to get actionable recommendations for improving skill quality, descriptions, or instructions.
Batch evaluate all skills in a plugin. Runs /evaluate:skill for each skill that has eval cases, then produces a plugin-level report. Use when auditing an entire plugin's quality or before a release.
View evaluation results and benchmark reports. Use when you want to see past eval results, compare benchmark runs, or review quality trends for a skill or plugin.
Evaluate a skill's effectiveness by running test cases and grading results. Use when you want to test whether a skill produces correct guidance, validate skill improvements, or benchmark a skill before release.
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications
Uses power tools
Uses Bash, Write, or Edit tools
Unity Development Toolkit - Expert agents for scripting/refactoring/optimization, script templates, and Agent Skills for Unity C# development
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, rules, and legacy command shims evolved over 10+ months of intensive daily use
Complete collection of battle-tested Claude Code configs agents, skills, hooks, rules, and legacy command shims evolved over 10+ months of intensive daily use
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.