Skill

policy-core

Global TDD governance policy. Enforces plan-first development, behavior-driven test quality, and strict completion gates.

npx claudepluginhub xiaolai/claude-plugin-marketplace --plugin tdd-guardian

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/tdd-guardian:policy-core

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

1. Plan-first: map all work to explicit work items and acceptance criteria before edits.

SKILL.md

129 lines · ~1.6k tokens

Similar Skills

skill-lookup

162.5k

Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.

prompts.chat

karpathy-guidelines

135.7k

Provides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.

andrej-karpathy-skills

ui-ux-pro-max

80.0k

Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.

ui-ux-pro-max

Stats

LanguageJavaScript

Stars3

Forks1

MaintenanceExcellent

Last CommitMay 19, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Policy Core

Required behavior

Plan-first: map all work to explicit work items and acceptance criteria before edits.
Scope lock: implement only requested scope; document extras as deferred notes.
Small batches: complete one work item at a time with immediate verification.
Regression safety: every bug fix includes a failing reproducer test before the fix.
Findings-first review: report defects and risks before summary.

Test quality requirements

The core principle: Test BEHAVIOR, not WIRING

A test must verify what the code does (its observable output, side effects, or state changes), not how it does it (which internal functions it calls). If you can refactor the internals and the test still passes despite broken behavior, the test is worthless.

Assertion hierarchy (prefer higher, justify lower)

Level	Assertion type	Example	Quality
1	Output verification	`expect(result).toEqual({ id: "mx-foo-abc123", port: 7700 })`	Best
2	Side-effect verification	`expect(await db.query("SELECT ...")).toHaveLength(1)`	Best
3	Real integration	`const res = await app.inject({ method: "GET", url: "/healthz" })`	Best
4	State verification	`expect(container.State.Running).toBe(true)`	Good
5	Mock return + output	Mock returns data, assert caller produces correct output from it	Good
6	Mock call args	`expect(mockFn).toHaveBeenCalledWith(...)`	Weak
7	Mock was called	`expect(mockFn).toHaveBeenCalled()`	Unacceptable alone

Mandatory rules

Every test must have at least one Level 1-5 assertion. A test that only verifies mock call arguments (Level 6-7) is a wiring test and MUST be upgraded.
Mock call assertions are supplements, not replacements. You may assert expect(mockCreate).toHaveBeenCalledWith(opts) but ONLY if you also verify the observable result (return value, formatted output, error thrown).
Prefer real objects over mocks. Use real implementations when feasible:
- Real Fastify with app.inject() (not mocked HTTP)
- Real SQLite in-memory DB (not mocked queries)
- Real file I/O with tmpdir() (not mocked fs)
- Real streams with actual write/read (not mocked EventEmitter)
- Real Zod parse (not mocked validation)
Mock only at boundaries. Acceptable mock targets: Docker daemon, network I/O, child processes, Date.now(), crypto.randomBytes(). Unacceptable: mocking your own modules, mocking types/schemas, mocking pure functions.
Security properties must be tested behaviorally. Do NOT verify security by asserting mock call args like expect(callArgs.HostConfig.CapDrop).toEqual(["ALL"]). Instead, inspect the actual created resource or use integration tests.
Add tests for success, boundaries, invalid input, guard clauses, and error paths.
Include state-transition/idempotency tests when behavior is stateful.
Include timeout/retry/concurrency tests when logic is async or distributed.
Avoid assertion-free tests and snapshot-only logic verification.

Anti-patterns (flag these in review)

// BAD: Wiring-only test — would pass even if createContainer was gutted
it("creates container", async () => {
  await mechaUp(client, opts);
  expect(mockCreateContainer).toHaveBeenCalledWith(client, {
    containerName: "mecha-mx-foo-abc123",
    image: "mecha-runtime:latest",
    // ... 10 lines of expected args
  });
});

// GOOD: Behavior test — verifies the observable result
it("creates and starts a mecha, returning its ID and port", async () => {
  const result = await mechaUp(client, { projectPath: "/tmp/test" });
  expect(result.id).toMatch(/^mx-/);
  expect(result.port).toBeGreaterThanOrEqual(1024);
  expect(result.authToken).toHaveLength(64);
  expect(result.name).toBe(`mecha-${result.id}`);
});

// BAD: Security check via mock args — would miss if defaults were silently dropped
it("applies security defaults", async () => {
  await createContainer(client, opts);
  const callArgs = mockCreate.mock.calls[0][0];
  expect(callArgs.HostConfig.ReadonlyRootfs).toBe(true);
});

// GOOD: Security check via integration — verifies Docker actually received the config
it("applies security defaults", async () => {
  await createContainer(client, opts);
  const info = await inspectContainer(client, name);
  expect(info.HostConfig.ReadonlyRootfs).toBe(true);
  expect(info.HostConfig.CapDrop).toContain("ALL");
});

// BAD: Mock-was-called as sole assertion
it("stops the container", async () => {
  await mechaStop(client, "test-id");
  expect(mockStopContainer).toHaveBeenCalled();
});

// GOOD: Verify state change
it("stops a running container", async () => {
  await mechaStop(client, "test-id");
  const info = await inspectContainer(client, containerName("test-id"));
  expect(info.State.Running).toBe(false);
});

Coverage ignore directives

When excluding lines from coverage, always use range comments, never the line-count form:

// BAD: /* v8 ignore next N */ silently fails on ??, ternaries, catch bodies, and short-circuit operators
/* v8 ignore next 3 */
const value = input ?? defaultValue;

// GOOD: range comments work reliably on all constructs
/* v8 ignore start */
const value = input ?? defaultValue;
/* v8 ignore stop */

/* v8 ignore next N */ silently miscounts on these constructs — coverage appears covered but the ignore is not applied, or worse, it skips the wrong lines. The start/stop form is explicit and immune to this class of bugs.

Mandatory rule: flag any /* v8 ignore next */ or /* v8 ignore next N */ in review as a potential silent failure. Replace with /* v8 ignore start */ / /* v8 ignore stop */.

Completion gates

Test command must pass.
Coverage command must pass.
Coverage totals must satisfy thresholds for lines/functions/branches/statements.
Test quality audit: no test file may have ONLY Level 6-7 assertions. Every test must include at least one Level 1-5 assertion.
Mutation gate must pass when enabled.
High-severity findings must be resolved or explicitly waived with rationale.

policy-core

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

policy-core

Popularity

Invocation

Context Preview

SKILL.md

Policy Core

Required behavior

Test quality requirements

The core principle: Test BEHAVIOR, not WIRING

Assertion hierarchy (prefer higher, justify lower)

Mandatory rules

Anti-patterns (flag these in review)

Coverage ignore directives

Completion gates

Similar Skills

Help us improve

Policy Core

Required behavior

Test quality requirements

The core principle: Test BEHAVIOR, not WIRING

Assertion hierarchy (prefer higher, justify lower)

Mandatory rules

Anti-patterns (flag these in review)

Coverage ignore directives

Completion gates