Skill

theo-calvin-testing

Differential testing with Theodore Calvin's framework (tc). Use when writing tc tests, reasoning about test scenarios, creating input/expected JSON pairs, or debugging test failures.

npx claudepluginhub a3lem/my-claude-plugins --plugin theo-calvin-testing

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/theo-calvin-testing:theo-calvin-testing

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

> "In the AI age, specifications and tests are permanent while implementations are disposable."

SKILL.md

142 lines · ~945 tokens

Similar Skills

testing-strategies

Provides test design patterns, coverage strategies (80-100% targets), types (unit/integration/E2E), organization, and best practices for comprehensive test suites. Use for new suites, coverage improvement, or test design.

autonomous-agent

test

Generates tests for files, functions, or modules that catch real bugs. Auto-detects framework via package.json or configs, analyzes code, writes test files autonomously.

4 files7 tools

universe

test-writer

517

Writes comprehensive tests for new or changed code using git diff to detect changes. Covers happy paths, edge cases, errors, concurrency, UI components, APIs, and database ops.

test-writer

Stats

LanguagePython

Parent stars0

MaintenanceGood

Last CommitMar 5, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Theodore Calvin's Testing Framework (tc)

"In the AI age, specifications and tests are permanent while implementations are disposable."

What is Differential Testing?

Differential testing compares actual output against expected output. No assertions, no matchers, no framework magic - just a diff.

Philosophy

Tests are nothing more than a script that takes input.json and produces output.json, which is then diffed against expected.json. This simple model:

Makes tests language-agnostic - the same test suite validates any implementation
Separates what (the spec) from how (the implementation)
Treats test suites as specifications, not just validation

Core Concept

input.json → run → output.json ←→ diff ←→ expected.json

Your code reads input.json from stdin
Your code writes JSON to stdout
The framework compares output with expected.json

JSON Comparison Rules

Objects are order-invariant - {"a":1,"b":2} equals {"b":2,"a":1}
Arrays maintain order - [1,2,3] does NOT equal [3,2,1]
Comparison is deep and semantic, not string-based

Directory Structure

tests/
└── my-test-suite/
    ├── run                    # Executable (any language)
    └── data/
        └── scenario-name/
            ├── input.json     # Test input
            └── expected.json  # Expected output

The `run` Executable

The run file is any executable that:

Reads JSON from stdin
Writes JSON to stdout
Exit code 0 = success, non-zero = error

Example (bash):

#!/usr/bin/env bash
jq '.value * 2'

Example (python):

#!/usr/bin/env python3
import json, sys
data = json.load(sys.stdin)
print(json.dumps({"result": data["value"] * 2}))

Pattern Matching

For dynamic values, use patterns in expected.json:

Pattern	Matches
`<uuid>`	UUID v4 format
`<timestamp>`	ISO 8601 datetime
`<number>`	Any number
`<string>`	Any string
`<boolean>`	true/false
`<any>`	Any value
`<null>`	null

Example expected.json:

{
  "id": "<uuid>",
  "created_at": "<timestamp>",
  "count": "<number>"
}

Commands

tc                    # Run all tests in current directory
tc run path/to/suite  # Run specific suite
tc new path/to/suite  # Create new test suite scaffold
tc list               # List available test suites
tc tags               # Show available tags
tc explain suite      # Describe what a test does
tc --parallel         # Run tests in parallel
tc --tags=unit        # Filter by tag

Creating a New Test

tc new tests/user-creation

This creates:

tests/user-creation/
├── run
└── data/
    └── basic/
        ├── input.json
        └── expected.json

Best Practices

One concept per test suite - each run tests one thing
Multiple scenarios per suite - use data/ subdirectories for variations
Descriptive scenario names - data/empty-input/, data/unicode-handling/
Keep run minimal - just wire input to your code and format output
Use patterns for non-deterministic values - timestamps, UUIDs, etc.

Why This Approach

Portable: Tests don't depend on any test framework or language
Diffable: JSON diffs are clear and human-readable
AI-friendly: Easy for AI to generate and verify tests
Implementation-agnostic: Rewrite in any language, tests still pass

Reference

Repository: https://github.com/ahoward/tc
Requires: bash 4.0+, jq

theo-calvin-testing

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

theo-calvin-testing

Invocation

Context Preview

SKILL.md

Theodore Calvin's Testing Framework (tc)

What is Differential Testing?

Philosophy

Core Concept

JSON Comparison Rules

Directory Structure

The run Executable

Pattern Matching

Commands

Creating a New Test

Best Practices

Why This Approach

Reference

Similar Skills

Help us improve

Theodore Calvin's Testing Framework (tc)

What is Differential Testing?

Philosophy

Core Concept

JSON Comparison Rules

Directory Structure

The run Executable

Pattern Matching

Commands

Creating a New Test

Best Practices

Why This Approach

Reference

The `run` Executable

The `run` Executable