Skill

whip-simulate

Runs multi-agent simulations to measure consistency of non-deterministic outputs. Use for A/B testing, behavioral equivalence validation, or large-scale stress testing.

testing

ai-ml

npx claudepluginhub bang9/ai-tools --plugin whip

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Run multi-agent simulations from a user-provided scenario. Concretize the scenario into test cases, spawn agents, and analyze output patterns for consistency.

SKILL.md

Similar Skills

agent-evaluation

37.1k

Evaluates LLM agents through behavioral testing, capability assessment, reliability metrics, and production monitoring—where top agents score under 50% on real-world benchmarks.

antigravity-awesome-skills

agentv-bench

Runs AgentV evaluations to benchmark AI agents, optimize prompts/skills via eval-driven iteration, compare outputs across providers, and analyze results.

12 files

agentv-dev

calibrate

Generates synthetic problems with quasi-ground-truth outcomes to test agents and skills, measuring recall, precision, and confidence calibration. Use for validating routing accuracy, A/B testing changes.

8 files8 tools

foundry

Stats

Parent Repo Stars10

Parent Repo Forks1

Last CommitMar 24, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Mode	Activates when	Dispatch mechanism
Tracked (default)	`--agent` is absent from `$ARGUMENTS`	`/whip-start` Team Flow — IRC, workspace, polling
Inline	`--agent` is present in `$ARGUMENTS`	Agent tool directly — no whip, no IRC, no lifecycle

Mode

Activates when

Dispatch mechanism

Tracked (default)

--agent is absent from $ARGUMENTS

/whip-start Team Flow — IRC, workspace, polling

Inline

--agent is present in $ARGUMENTS

Agent tool directly — no whip, no IRC, no lifecycle

Field	Description
Name	Short identifier (e.g., `deprecated-move-1`)
Setup	Context the agent receives (file contents, code, instructions)
Action	What the agent executes
Output contract	Structured format the agent must produce

Field

Description

Name

Short identifier (e.g., deprecated-move-1)

Setup

Context the agent receives (file contents, code, instructions)

Action

What the agent executes

Output contract

Structured format the agent must produce

Strategy	When to use	Agent count
Sequential	Outputs are structured (code, configs) — one agent runs A then B	N
Isolated	Outputs involve judgment or prose — separate agents per version	2N

Strategy

When to use

Agent count

Sequential

Outputs are structured (code, configs) — one agent runs A then B

Isolated

Outputs involve judgment or prose — separate agents per version

## Simulation Report ### Consistency: X/N (Y%) ### Output Patterns | Pattern | Count | Runs | Description | |---------|-------|------|-------------| | A | 8 | #1-6,#8,#10 | [dominant behavior] | | B | 2 | #7,#9 | [variant behavior] | ### Divergence Analysis For each non-dominant pattern: - Runs: [list] - Root cause: [why] - Severity: cosmetic | functional | breaking - Diff from dominant: [key differences] ### Summary - Total: N runs across M test cases - Dominant pattern: A (X%) - Key findings: ... - Recommendation: [if applicable]

Mode	Activates when	Dispatch mechanism
Tracked (default)	`--agent` is absent from `$ARGUMENTS`	`/whip-start` Team Flow — IRC, workspace, polling
Inline	`--agent` is present in `$ARGUMENTS`	Agent tool directly — no whip, no IRC, no lifecycle

Mode

Activates when

Dispatch mechanism

Tracked (default)

--agent is absent from $ARGUMENTS

/whip-start Team Flow — IRC, workspace, polling

Inline

--agent is present in $ARGUMENTS

Agent tool directly — no whip, no IRC, no lifecycle

Field	Description
Name	Short identifier (e.g., `deprecated-move-1`)
Setup	Context the agent receives (file contents, code, instructions)
Action	What the agent executes
Output contract	Structured format the agent must produce

Field

Description

Name

Short identifier (e.g., deprecated-move-1)

Setup

Context the agent receives (file contents, code, instructions)

Action

What the agent executes

Output contract

Structured format the agent must produce

Strategy	When to use	Agent count
Sequential	Outputs are structured (code, configs) — one agent runs A then B	N
Isolated	Outputs involve judgment or prose — separate agents per version	2N

Strategy

When to use

Agent count

Sequential

Outputs are structured (code, configs) — one agent runs A then B

Isolated

Outputs involve judgment or prose — separate agents per version

whip-simulate

Tool Access

Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

whip-simulate

Tool Access

Preview

SKILL.md

Input

Execution Mode (mutually exclusive)

Workflow

1. Concretize

2. Execute

Tracked mode (default)

Inline mode (`--agent`)

3. Analyze

4. Report

Rules

Similar Skills

Help us improve

Input

Execution Mode (mutually exclusive)

Workflow

1. Concretize

2. Execute

Tracked mode (default)

Inline mode (`--agent`)

3. Analyze

4. Report

Rules

whip-simulate

Tool Access

Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

whip-simulate

Tool Access

Preview

SKILL.md

Input

Execution Mode (mutually exclusive)

Workflow

1. Concretize

2. Execute

Tracked mode (default)

Inline mode (--agent)

3. Analyze

4. Report

Rules

Similar Skills

Help us improve

Input

Execution Mode (mutually exclusive)

Workflow

1. Concretize

2. Execute

Tracked mode (default)

Inline mode (--agent)

3. Analyze

4. Report

Rules

Inline mode (`--agent`)

Inline mode (`--agent`)