Skill

nw-property-based-testing

From nw

Guides property-based testing patterns, shrinking, mutation testing tools, and workflows for test suite validation in Python, JS/TS, Rust, Java, C#.

Python

npx claudepluginhub nwave-ai/nwave --plugin nw

Tool Access

This skill uses the workspace's default tool permissions.

Preview

> Deferred to Phase 2.25: Mutation testing runs ONCE per feature as final quality gate at orchestrator Phase 2.25 (after all steps complete). Do NOT run mutation testing during inner TDD loop.

SKILL.md

Similar Skills

nw-pbt-fundamentals

484

Provides property-based testing fundamentals: core concepts, property taxonomy with decision tables/trees, and selection strategies. Language-agnostic.

property-based-testing

5.1k

Guides property-based testing for serialization roundtrips, idempotence, invariants, parsing, validation, and smart contracts across languages.

8 files

property-based-testing

206

Guides property-based testing for serialization, validation, normalization, and pure functions with property catalog, pattern detection, and library references.

ed3d-house-style

Stats

Parent Repo Stars484

Parent Repo Forks49

Last CommitMar 20, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Property-Based Testing and Mutation Testing

Deferred to Phase 2.25: Mutation testing runs ONCE per feature as final quality gate at orchestrator Phase 2.25 (after all steps complete). Do NOT run mutation testing during inner TDD loop.

Property-Based Testing (PBT)

Instead of examples ("given X, expect Y"), write properties ("for all valid inputs, condition Z holds"). Framework generates hundreds/thousands of inputs checking property. Dramatically expands test coverage.

Property Patterns

Invariants: "for all inputs, condition holds" (sorted list is ordered, balance >= 0)
Roundtrip: "encode then decode = original" (serialize/deserialize, compress/decompress)
Oracle: "compare against reference implementation" (optimized vs correct-but-slow)
Metamorphic: "different operations, same result" (add(a,b)==add(b,a), filter can't increase size)

Shrinking

When property fails, framework auto-finds minimal failing input. Dramatically accelerates debugging. Algorithm: find failing input -> try simpler variants -> if still fails, use as new candidate -> repeat.

PBT Tools by Language

Language	Framework
Python	Hypothesis
JavaScript/TypeScript	fast-check
Haskell	QuickCheck
Rust	quickcheck
Java	jqwik
C#	FsCheck

Adopted by Amazon, Volvo, Stripe, Jane Street (ICSE 2024 study).

When PBT Adds Value

PBT + TDD Integration

Start with example-based TDD for specific cases (drives detailed design)
Once basic implementation works, write properties to generalize
If property fails: found bug or need refined implementation
Refactor freely - properties verify behavior preservation

Properties = higher-level spec that survives refactoring better than examples.

Mutation Testing

Evaluates test suite quality by introducing artificial bugs (mutations) and checking if tests catch them. Mutation score = killed mutants / total mutants. Stronger metric than code coverage.

Mutation Score Targets

Score	Quality
< 60%	Weak suite, significant gaps
60-80%	Moderate, some gaps
> 80%	Strong, few gaps

Target: 75-80% minimum. Not all survivors indicate bad tests (equivalent mutants exist).

Mutation Operators

Mutation Testing Tools

Language	Tool
Java	PIT
JavaScript/TypeScript/C#	Stryker
Python	mutmut, Cosmic Ray

Computationally expensive. Use incremental: on changed code in PRs, full codebase weekly.

Combined PBT + Mutation Workflow

Write example-based tests (TDD) -> cover known scenarios
Apply mutation testing -> identify assertion gaps -> write more tests
Add PBT for complex logic -> cover input space systematically
Mutation testing again -> verify properties are comprehensive

Quality ratchet: each technique exposes gaps others miss. Prioritize critical paths and complex algorithms.

PBT Performance Guidance

Fast feedback: ~100 examples | CI/CD: ~1000 examples | Nightly builds: ~10000+ examples

Modern frameworks allow configuring example count per context.