Browse the full directory of Claude Code plugins — commands, agents, skills, MCP servers, and more.
Browse plugins →The Claude Code testing plugins category is huge, yet few write tests—here's what each of eight actually does and which one to install first.
Writing and maintaining tests is the chore developers put off, and the Claude Code testing plugins in this directory all promise to take it off your hands — but only a few actually generate tests, and several were tagged testing by accident.
Here's the short answer before the evidence:
The testing category on ClaudePluginHub lists 5,532 plugins out of 43,258 in the full directory, but "testing" is a loose tag. Most of what surfaces does review, standards enforcement, or spec workflows — not unit-test generation. Browse the testing category yourself and the pattern is obvious.
Installs are 7-day counts; where the snapshot has no install figure, the cell reads n/a.
| Plugin | Components | Installs (7d) | Stars | Job it does |
|---|---|---|---|---|
| ponytail | skills, hooks | 556 | 68,012 | Review-time code-quality guardrails |
| everything-claude-code | commands, agents, skills, hooks, MCP | 108 | 2 | Full dev kit; tests are one part |
| agentic-sdd | commands, agents, skills, hooks | n/a | 0 | Spec-driven gates |
| guidelines | skills | n/a | 0 | Coding-standards skill |
| nature-skills | skills | 110 | 20,042 | Scientific manuscript writing |
| ketch | MCP | n/a | 231 | Web research server |
| wayfinder | skills, MCP | n/a | 0 | Unproven listing |
| hotline | MCP | n/a | 0 | Unproven listing |
The spread is stark: ponytail and nature-skills carry real adoption, ketch has moderate stars from its CLI, and the rest are effectively unrated. Stars measure repo popularity, not test quality, so treat them as a signal of maturity, not correctness.
ponytail ships skills and hooks that scan for over-engineering, dead code, and speculative abstractions while you review or write code, and it tracks the shortcuts you take on purpose. Reach for it when your problem is not "I have no tests" but "my code and tests are accumulating cruft nobody prunes." Unlike everything-claude-code, it does one job with just skills and hooks rather than bundling a full agent-and-command suite, and with 556 installs in 7 days and 68,012 stars it is the only plugin here with meaningful adoption. Start at its plugin page.
burgebj's everything-claude-code is the broadest entry — commands, agents, skills, hooks, and an MCP server in one install — so test workflows sit alongside code generation, review, and automation rather than standing alone. Use it when you want a single opinionated kit for the whole loop instead of stitching together single-purpose plugins. It differs from ponytail by spanning every component type — commands, agents, skills, hooks, and an MCP server — where ponytail ships only skills and hooks, and from agentic-sdd by adding an MCP server. At 108 installs in 7 days and 2 stars it is widely tried but young; if you like this shape, pair it with the ideas in automating your Claude Code workflows. See its plugin page.
agentic-sdd ships commands, agents, skills, and hooks aimed at spec-driven development, where testing is a gate in a plan-then-build loop rather than a standalone step. Use it if you already write specs first and want verification wired into that flow. It overlaps everything-claude-code on components but drops the MCP server, keeping the footprint narrower. With 0 stars it is unproven — treat it as an early option to trial, not a dependency. Its plugin page has the details.
guidelines is a skills-only plugin for enforcing coding standards; with no enriched description available, its component set is all we can judge it on. It fits teams that want Claude to apply a house style during reviews, which is QA-adjacent rather than test generation. It differs from ponytail — the other skills-based option — by shipping no hooks, so it will not fire automatically the way ponytail's do, and it has 0 stars. See its plugin page.
The remaining plugins answer a different question than "how do I test this."
nature-skills generates Nature-style manuscripts, figures, response letters, and patents, and runs a simulated peer review — the "review" that likely landed it in this category. Use it for academic writing, not test suites. Despite 110 installs in 7 days and 20,042 stars, it has no bearing on software testing; it is here by keyword collision. Its plugin page confirms the scope.
ketch exposes a single MCP server that searches the web and codebases, fetches docs, and crawls sites through the ketch CLI. It is useful when your gap is research or documentation lookup, not writing tests. Its 231 stars come from the CLI project, not test capability, and unlike ponytail it ships no skills or hooks. Its plugin page has setup notes.
wayfinder combines skills and an MCP server but carries 0 stars and no install data or description, so there is nothing to judge its testing value on beyond its components. Consider it only if you are deliberately trialing new listings. Its plugin page is the place to look.
hotline ships a single MCP server, 0 stars, and no description — the thinnest entry in the set. Unlike ketch, another MCP-only plugin, it has no adoption signal at all, so there is no evidence it does testing work. Check its plugin page before relying on it.
If your real problem is test coverage today, none of these writes a full unit suite for you — the honest move is ponytail to keep the code and tests you already have clean, plus a broad kit like everything-claude-code for generation across the loop. If you work spec-first, agentic-sdd gates checks where you already make decisions. If you want a house style enforced, guidelines does that one thing. And if a plugin above turned out to solve research or writing instead of testing, that is the data telling you the category is thinner than its 5,532-plugin count suggests.
For a broader treatment of test-focused setups, read our earlier testing plugins guide; then filter the testing category by component type and install the one whose components match the job you actually have.
A walk through the 4,106-plugin testing category on ClaudePluginHub: which plugins focus on QA workflows like Playwright, TDD, and CLAUDE.md audits — and which ones are tagged testing but cover broader full-stack work.
8 plugins that use hooks, agents, and skills to automate Claude Code workflows — compared by install data, component coverage, and automation approach.
Few Claude Code plugins for memory actually persist context across sessions. Here are the two that do — and the “memory” results that are false matches.
Enforce minimal, lazy-correct code by automatically scanning for over-engineering, dead code, and speculative abstractions during reviews and coding sessions. Also tracks deliberate shortcuts and provides quick-reference commands.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, rules, and legacy command shims evolved over 10+ months of intensive daily use
Spec-Driven Development lifecycle with TDD and E2E testing. Ships lifecycle agents (spec, tdd, implement, e2e, review, refactor, architect) and stack experts (React, React Native, Node/Express/Fastify, NestJS, Next.js, C#/ASP.NET Core, Avalonia/XAML desktop, PostgreSQL, MongoDB), reusable skills, slash commands, and enforcing quality-gate hooks for clean, maintainable, scalable code. Cross-platform incl. macOS Apple Silicon.
Coding guidelines skill covering input validation, error handling, DDD structure, type safety, testing, and more.
Generate Nature-style scientific manuscripts, figures, response letters, patent drafts, and presentations from research materials. Conduct multi-source literature searches, citation management, paper reading with Chinese-English side-by-side rendering, and simulated peer review.
Run web research workflows directly from Claude: search the web and codebases, fetch documentation, scrape pages, and crawl sites through a local MCP server powered by the ketch CLI.
Wayfinder retirement-planning Agent Skills (create-plan, create-scenarios, retirement-advising) plus the hosted Wayfinder MCP server — build, stress-test, and get advice on plans (life-paths + Monte Carlo) with live data from your account.
Wires up the hotline MCP server so Claude Code sessions get the Telegram channel: reply, react, edit_message, and download_attachment tools plus inbound message relay. Requires the hotline binary on PATH (go install github.com/1broseidon/hotline@latest) and a configured bot token — see the README quickstart at https://github.com/1broseidon/hotline.