Mutation Testing

Expert knowledge for mutation testing - validating that your tests actually catch bugs by introducing deliberate code mutations.

Core Expertise

Mutation Testing Concept

Mutants: Small code changes (mutations) introduced automatically
Killed: Test fails with mutation (good - test caught the bug)
Survived: Test passes with mutation (bad - weak test)
Coverage: Tests execute mutated code but don't catch it
Score: Percentage of mutants killed (aim for 80%+)

What Mutation Testing Reveals

Tests that don't actually verify behavior
Missing assertions or edge cases
Overly permissive assertions
Dead code or unnecessary logic
Areas needing stronger tests

TypeScript/JavaScript (Stryker)

Installation

# Using Bun
bun add -d @stryker-mutator/core

# Using npm
npm install -D @stryker-mutator/core

# For Vitest
bun add -d @stryker-mutator/vitest-runner

# For Jest
bun add -d @stryker-mutator/jest-runner

Configuration

// stryker.config.mjs
/** @type {import('@stryker-mutator/api/core').PartialStrykerOptions} */
export default {
  packageManager: 'bun',
  reporters: ['html', 'clear-text', 'progress', 'dashboard'],
  testRunner: 'vitest',
  coverageAnalysis: 'perTest',

  // Files to mutate
  mutate: [
    'src/**/*.ts',
    '!src/**/*.test.ts',
    '!src/**/*.spec.ts',
    '!src/**/*.d.ts',
  ],

  // Thresholds for CI
  thresholds: {
    high: 80,
    low: 60,
    break: 60, // Fail build if below this
  },

  // Incremental mode (faster)
  incremental: true,
  incrementalFile: '.stryker-tmp/incremental.json',

  // Concurrency
  concurrency: 4,

  // Timeout
  timeoutMS: 60000,
}

Running Stryker

# Run mutation testing
npx stryker run

# Incremental mode (only changed files)
npx stryker run --incremental

# Specific files
npx stryker run --mutate "src/utils/**/*.ts"

# With specific configuration
npx stryker run --configFile stryker.prod.config.mjs

# Generate HTML report
npx stryker run --reporters html,clear-text

# Open HTML report
open reports/mutation/html/index.html

Understanding Results

Mutation score: 82.5%
- Killed: 66 (tests caught the mutation)
- Survived: 14 (tests passed despite mutation - weak tests!)
- No Coverage: 0 (mutated code not executed)
- Timeout: 0 (tests took too long)
- Runtime Errors: 0 (mutation broke the code)
- Compile Errors: 0 (mutation caused syntax error)

Example: Weak Test

// Source code
function calculateDiscount(price: number, percentage: number): number {
  return price - (price * percentage / 100)
}

// ❌ WEAK: Test passes even if we mutate the calculation
test('applies discount', () => {
  const result = calculateDiscount(100, 10)
  expect(result).toBeDefined() // Too weak!
})

// Stryker mutates the code:
// function calculateDiscount(price: number, percentage: number): number {
//   return price + (price * percentage / 100)  // Changed - to +
// }
// Test still passes! ❌ Survived mutant

// ✅ STRONG: Test catches mutation
test('applies discount correctly', () => {
  expect(calculateDiscount(100, 10)).toBe(90)
  expect(calculateDiscount(100, 20)).toBe(80)
  expect(calculateDiscount(50, 10)).toBe(45)
})
// Mutation causes test to fail ✅ Killed mutant

Common Mutation Types

// Arithmetic Operator
// Original: a + b
// Mutants: a - b, a * b, a / b

// Relational Operator
// Original: a > b
// Mutants: a >= b, a < b, a <= b, a === b

// Conditional Boundary
// Original: a >= 10
// Mutants: a > 10, a < 10

// Logical Operator
// Original: a && b
// Mutants: a || b, a, b

// Unary Operator
// Original: !condition
// Mutants: condition

// String Literal
// Original: 'hello'
// Mutants: '', 'Stryker was here!'

// Boolean Literal
// Original: true
// Mutants: false

// Array Declaration
// Original: [1, 2, 3]
// Mutants: []

Ignoring Specific Mutations

// Disable mutation for a block
// Stryker disable all
function debugOnlyCode() {
  console.log('This won\'t be mutated')
}
// Stryker restore all

// Disable specific mutator
// Stryker disable next-line ArithmeticOperator
const total = price + tax

CI/CD Integration

# .github/workflows/mutation.yml
name: Mutation Testing

on:
  pull_request:
    branches: [main]

jobs:
  mutation:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Setup Bun
        uses: oven-sh/setup-bun@v1

      - name: Install dependencies
        run: bun install

      - name: Run mutation tests
        run: bun run stryker run

      - name: Upload mutation report
        uses: actions/upload-artifact@v3
        if: always()
        with:
          name: mutation-report
          path: reports/mutation/html/

Python (mutmut)

Installation

# Using uv
uv add --dev mutmut

# Using pip
pip install mutmut

Basic Configuration

# pyproject.toml
[tool.mutmut]
paths_to_mutate = "src/"
backup = false
runner = "python -m pytest -x --assert=plain -q"
tests_dir = "tests/"

Running mutmut

# Run mutation testing
uv run mutmut run

# Run on specific paths
uv run mutmut run --paths-to-mutate=src/calculator.py

# Show results
uv run mutmut results

# Show summary
uv run mutmut summary

# Show specific mutant
uv run mutmut show 1

# Apply a mutant (to test it manually)
uv run mutmut apply 1

# Generate HTML report
uv run mutmut html
open html/index.html

Understanding Results

Status: 45/50 mutants killed (90%)
- Killed: 45 (tests caught the mutation)
- Survived: 5 (tests passed despite mutation)
- Suspicious: 0 (inconsistent results)
- Timeout: 0 (tests took too long)

Example: Improving Weak Test

# Source code
def calculate_discount(price: float, percentage: float) -> float:
    return price - (price * percentage / 100)

# ❌ WEAK: Test passes with mutations
def test_applies_discount():
    result = calculate_discount(100, 10)
    assert result is not None  # Too weak!
    # mutmut can change the calculation and test still passes

# ✅ STRONG: Test catches mutations
def test_applies_discount_correctly():
    assert calculate_discount(100, 10) == 90.0
    assert calculate_discount(100, 20) == 80.0
    assert calculate_discount(50, 10) == 45.0
    assert calculate_discount(0, 10) == 0.0

Common Mutation Types (Python)

# Arithmetic operators
# Original: a + b → a - b, a * b, a / b, a // b, a % b

# Comparison operators
# Original: a > b → a >= b, a < b, a <= b, a == b, a != b

# Logical operators
# Original: a and b → a or b, a, b

# Assignment operators
# Original: a += 1 → a -= 1, a *= 1, a /= 1

# Boolean literals
# Original: True → False

# Number literals
# Original: 42 → 43, 41

# String literals
# Original: "hello" → "XXhelloXX", ""

Filtering Results

# Show only survived mutants
uv run mutmut results | grep "^Survived"

# Show killed mutants
uv run mutmut results | grep "^Killed"

# Show mutants for specific file
uv run mutmut show src/calculator.py

Excluding Code from Mutation

# Exclude entire function
# pragma: no mutate
def legacy_code():
    pass

# Exclude specific line
result = expensive_computation()  # pragma: no mutate

CI/CD Integration

# .github/workflows/mutation.yml
name: Mutation Testing

on:
  pull_request:
    branches: [main]

jobs:
  mutation:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Install uv
        uses: astral-sh/setup-uv@v1

      - name: Install dependencies
        run: uv sync

      - name: Run mutation tests
        run: uv run mutmut run

      - name: Show results
        if: always()
        run: uv run mutmut results

      - name: Generate HTML report
        if: always()
        run: uv run mutmut html

      - name: Upload report
        uses: actions/upload-artifact@v3
        if: always()
        with:
          name: mutation-report
          path: html/

Interpreting Results

Mutation Score Targets

Score	Quality	Action
90%+	Excellent	Maintain quality
80-89%	Good	Small improvements
70-79%	Acceptable	Focus on weak areas
60-69%	Needs work	Add missing tests
< 60%	Poor	Major test improvements needed

Survived Mutants Analysis

Common Reasons Mutants Survive:

Missing assertions

// Mutant survives
test('processes data', () => {
  processData(input)
  // No assertion!
})

// Fix: Add assertion
test('processes data', () => {
  const result = processData(input)
  expect(result).toEqual(expected)
})

Weak assertions

# Mutant survives
def test_calculation():
    result = calculate(10, 5)
    assert result > 0  # Too weak!

# Fix: Specific assertion
def test_calculation():
    result = calculate(10, 5)
    assert result == 15

Dead code

function example(x: number) {
  if (x > 10) {
    return 'large'
  }
  if (x < 0) {
    return 'negative'
  }
  return 'small'
  // Unreachable code mutated but never executed
}

// Fix: Remove dead code or add test
test('handles edge case', () => {
  expect(example(0)).toBe('small')
})

Equivalent mutants

// Original
const result = x + 0

// Mutant (mathematically equivalent)
const result = x - 0

// Both produce same result - valid survivor

Best Practices

When to Run Mutation Tests

After achieving high code coverage (80%+)
Before major releases
On critical business logic modules
When test quality is questioned
In CI for important branches

Incremental Approach

Start with core business logic modules
Fix survived mutants
Gradually expand coverage
Integrate into CI pipeline
Maintain high mutation score

Performance Optimization

// Stryker
export default {
  // Only mutate changed files
  incremental: true,

  // Focus on important files
  mutate: ['src/core/**/*.ts', 'src/utils/**/*.ts'],

  // Skip slow tests
  disableTypeChecks: '{src,test}/**/*.ts',

  // Adjust concurrency
  concurrency: 4,
}

# mutmut - incremental runs
uv run mutmut run --paths-to-mutate=src/core/

Common Pitfalls

❌ Running mutation tests before basic coverage
❌ Expecting 100% mutation score
❌ Not excluding generated code
❌ Treating equivalent mutants as failures
❌ Running full mutation suite on every commit

Integration with Coverage

# 1. First, ensure good coverage
vitest --coverage
# Target: 80%+ coverage

# 2. Then, validate test quality with mutations
npx stryker run
# Target: 80%+ mutation score

Workflow Example

TypeScript/Vitest/Stryker

# 1. Write tests with coverage
bun test --coverage

# 2. Check coverage report
open coverage/index.html
# Ensure 80%+ coverage

# 3. Run mutation testing
npx stryker run

# 4. Check mutation report
open reports/mutation/html/index.html

# 5. Fix survived mutants by improving tests
# (Review each survived mutant)

# 6. Re-run mutation tests
npx stryker run --incremental

Python/pytest/mutmut

# 1. Write tests with coverage
uv run pytest --cov

# 2. Check coverage report
uv run pytest --cov --cov-report=html
open htmlcov/index.html
# Ensure 80%+ coverage

# 3. Run mutation testing
uv run mutmut run

# 4. Check results
uv run mutmut results

# 5. Investigate survived mutants
uv run mutmut show <id>

# 6. Fix weak tests

# 7. Re-run on updated files
uv run mutmut run --paths-to-mutate=src/fixed_module.py

Improving Weak Tests

Pattern: Insufficient Assertions

// Before: Mutation survives
test('calculates sum', () => {
  const result = sum([1, 2, 3])
  expect(result).toBeGreaterThan(0) // Weak!
})

// After: Mutation killed
test('calculates sum correctly', () => {
  expect(sum([1, 2, 3])).toBe(6)
  expect(sum([0, 0, 0])).toBe(0)
  expect(sum([-1, 1])).toBe(0)
  expect(sum([])).toBe(0)
})

Pattern: Missing Edge Cases

# Before: Mutation survives
def test_divide():
    result = divide(10, 2)
    assert result == 5

# After: Mutation killed
def test_divide():
    assert divide(10, 2) == 5
    assert divide(9, 3) == 3
    assert divide(7, 2) == 3.5
    with pytest.raises(ZeroDivisionError):
        divide(10, 0)

Pattern: Boundary Conditions

// Before: Mutation survives
test('validates age', () => {
  expect(isValidAge(25)).toBe(true)
})

// After: Mutation killed (tests boundaries)
test('validates age boundaries', () => {
  expect(isValidAge(18)).toBe(true)   // Min valid
  expect(isValidAge(17)).toBe(false)  // Just below
  expect(isValidAge(100)).toBe(true)  // Max valid
  expect(isValidAge(101)).toBe(false) // Just above
  expect(isValidAge(0)).toBe(false)
  expect(isValidAge(-1)).toBe(false)
})

Troubleshooting

Mutation tests taking too long

// Stryker: Increase concurrency
export default {
  concurrency: Math.max(os.cpus().length - 1, 1),
  timeoutMS: 30000,
}

# mutmut: Use caching
uv run mutmut run --use-coverage

Too many survived mutants

Focus on critical modules first
Add specific assertions
Test boundary conditions
Review mutation details

All tests failing

# Verify tests pass before mutations
bun test  # or uv run pytest

False positives (equivalent mutants)

// Ignore mathematically equivalent mutants
// Stryker disable next-line all
const result = x + 0

References

Stryker: https://stryker-mutator.io/
mutmut: https://github.com/boxed/mutmut
Mutation Testing Intro: https://en.wikipedia.org/wiki/Mutation_testing

mutation-testing

Mutation Testing

Core Expertise

TypeScript/JavaScript (Stryker)

Installation

Configuration

Running Stryker

Understanding Results

Example: Weak Test

Common Mutation Types

Ignoring Specific Mutations

CI/CD Integration

Python (mutmut)

Installation

Basic Configuration

Running mutmut

Understanding Results

Example: Improving Weak Test

Common Mutation Types (Python)

Filtering Results

Excluding Code from Mutation

CI/CD Integration

Interpreting Results

Mutation Score Targets

Survived Mutants Analysis

Best Practices

Workflow Example

TypeScript/Vitest/Stryker

Python/pytest/mutmut

Improving Weak Tests

Pattern: Insufficient Assertions

Pattern: Missing Edge Cases

Pattern: Boundary Conditions

Troubleshooting

See Also

References

Similar Skills