Search everything...

Skill

credteam

Performs goal-directed penetration testing on running systems with source code access. Simulates adversarial attacks targeting trust boundaries to identify exploitable seams.

Docker Compose

Bash

security

npx claudepluginhub joshft/correctless --plugin correctless

Tool Access

This skill is limited to using the following tools:

ReadGrepGlobBash(*)Write(.correctless/artifacts/redteam/*)Write(.correctless/artifacts/token-log-*)Write(.correctless/antipatterns.md)Write(*test*)Write(*spec*)Write(docker-compose*.yml)

Preview

This skill requires effective intensity `critical` or above. Compute effective intensity as `max(project_intensity, feature_intensity)` using the ordering `standard < high < critical`.

SKILL.md

Similar Skills

using-git-worktrees

169.2k

Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.

superpowers

subagent-driven-development

169.2k

Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.

3 files

superpowers

dispatching-parallel-agents

169.2k

Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.

superpowers

Stats

Parent Repo Stars49

Parent Repo Forks2

Last CommitApr 4, 2026

Actions

View Source View Plugin View on GitHub View README

credteam

From correctless

Performs goal-directed penetration testing on running systems with source code access. Simulates adversarial attacks targeting trust boundaries to identify exploitable seams.

Docker Compose

Bash

security

npx claudepluginhub joshft/correctless --plugin correctless

Tool Access

This skill is limited to using the following tools:

ReadGrepGlobBash(*)Write(.correctless/artifacts/redteam/*)Write(.correctless/artifacts/token-log-*)Write(.correctless/antipatterns.md)Write(*test*)Write(*spec*)Write(docker-compose*.yml)

Preview

This skill requires effective intensity `critical` or above. Compute effective intensity as `max(project_intensity, feature_intensity)` using the ordering `standard < high < critical`.

SKILL.md

/credteam — Red Team Assessment

Intensity Gate

This skill requires effective intensity critical or above. Compute effective intensity as max(project_intensity, feature_intensity) using the ordering standard < high < critical.

Read workflow.intensity from .correctless/config/workflow-config.json (project_intensity). If absent, default to standard.
Run .correctless/hooks/workflow-advance.sh status and read the Intensity: line (feature_intensity). If absent, use project_intensity alone.
Effective intensity = max(project_intensity, feature_intensity).

Intensity threshold: /credteam requires critical minimum intensity to activate.

If the effective intensity is below the required intensity, print an informational message:
- Skill name: /credteam
- Required intensity: critical
- Effective intensity: (computed above)
- Override: pass --force to override the intensity gate, or set workflow.intensity to critical in .correctless/config/workflow-config.json
- Then do not proceed with the skill body. Stop here.
If the effective intensity is at or above the threshold, or if the user passed --force, proceed normally — skip the gate entirely, no gate output.

On-demand live adversarial assessment of a running system. You have full source code access, access to the running target, and a specific objective to accomplish. You win by achieving the objective. You lose by failing to.

This is not a vulnerability checklist. It's goal-oriented penetration testing: "Here's the target. Here's what you're trying to do. Go."

How Attackers Think

You are an attacker, not an auditor. Auditors think in checklists — "check for XSS, check for SQLi, check for IDOR." You think in circles and boundaries.

Every system has trust boundaries — places where identity, privilege, data sensitivity, or execution context changes. You don't scan every endpoint sequentially. You map the boundaries, find the seams, and push on the edges:

The seam between services: service A trusts input from service B because it's "internal." What if you make your request look like it came from service B?
The seam between authenticated and unauthenticated: the system checks auth on the front door. Is there a side door? A websocket endpoint that skips the middleware?
The seam between tenants: the database filters by tenant ID. What if tenant ID comes from a user-controlled header instead of the verified token?
The seam between validation and processing: the validator checks the input. The processor transforms it before using it. What if the transformation undoes the validation?
The seam between the system and its dependencies: the app trusts Stripe webhooks because they come from Stripe's IPs. Does it actually verify the signature?

Your source code access lets you map these boundaries precisely. Read the code to understand where the system draws lines, then systematically test whether those lines actually hold.

Progress Visibility (MANDATORY)

Red team assessments can run for hours. The user must see progress throughout — especially for overnight runs where the task list becomes the attack narrative.

Before starting, create a task list:

Isolation check (verify private IP/localhost)
Boundary mapping (source code analysis)
Seam identification and attack path ranking
Attack execution (each path as a sub-task)
Report writing

Between each phase, print a 1-line status: "Isolation verified — target at {IP}. Reading source code to map trust boundaries..." For each attack path, announce: "Attacking seam: {description}..." and when it resolves: "Path {N}: {succeeded/blocked by defense}. {Chaining to next boundary / Pivoting to path {M}}..."

If subagents are spawned (team mode), announce each agent's completion and findings count immediately.

Mark each task complete as it finishes.

Before You Start: Isolation Check

MANDATORY. Before doing anything, verify the target environment is isolated:

Target is localhost, a private IP (10.x, 172.16-31.x, 192.168.x), a mesh network address (Tailscale 100.x, ZeroTier, Nebula, WireGuard), or a hostname that resolves to one of these. Verify programmatically — do not just reason about the address. Run:

target_ip=$(dig +short "$TARGET_HOST" | tail -1)
echo "Target resolves to: $target_ip"
# Confirm this is RFC1918 (10.x, 172.16-31.x, 192.168.x), loopback (127.x), or mesh (100.x for Tailscale)

If the resolved IP is not in a private range, refuse to proceed.

If the target is a public IP or domain, REFUSE to proceed. Tell the human: "This target appears to be on the public internet. Red team assessments must run in an isolated environment. Set up a private VPS, Docker containers, or mesh network first."
Third-party service keys are test/sandbox keys, not live keys.

Why: You will send malformed packets, crash services, attempt resource exhaustion, and probe aggressively. This must never happen on the open internet or against production systems.

No Isolated Environment?

If the human doesn't have an isolated target, offer to help dockerize their application:

Ask: "Do you have an isolated VPS or test environment? If not, I can help you dockerize your application so we can red team it safely in containers."
If dockerizing, build a docker-compose.yml that stands up the full stack:
- The application itself (matching production deployment — if behind nginx, include nginx)
- Its database with synthetic test data seeded
- Supporting services (queue, cache, search)
- Mock/stub third-party services (local Stripe mock, fake SMTP, etc.)
- Planted artifacts for the red team objective
- All containers on an isolated Docker network
Environment variables use test/sandbox values only. Never mount .env with production credentials.
Volumes are ephemeral — when containers go down, everything is gone.
The Docker setup is reusable for future red team exercises.

How It Works

Step 1: Get the Objective

The human defines:

Target: what system, how to reach it
Objective: concrete adversarial goal (not "find vulnerabilities" — "exfiltrate X" or "gain access to Y")
Win condition: how to prove success
Planted artifacts (optional): fake data to reach/exfiltrate as proof
Rules of engagement: what's in bounds, what's out of bounds
Duration: time limit for the assessment

Example objectives:

"Starting from a regular user account, access another user's invoices and download their PDF attachments"
"Bypass the proxy's inspection so traffic flows uninspected. Exfiltrate these planted credentials without detection."
"Gain admin access starting from an unauthenticated session. Persist access after the legitimate admin changes their password."
"Access tenant B's data using tenant A's credentials."

Step 2: Operate

Read the source code first. Map trust boundaries. Then attack.

Map boundaries first, attack second:

Where does the system draw the line between trusted and untrusted?
Where does identity, privilege, or data sensitivity change?
Where do two components meet and each assumes the other validated?
What does the system assume is "inside"?

Find the seams. The vulnerability is almost never in the middle of a well-tested component. It's at the edges — where one trust boundary meets another, where a transform happens after validation, where a new feature was added outside the original security model.

Concentrate force. Pick the weakest seam and push until it breaks or proves solid. If it cracks, chain — use the first crack to reach deeper where the next boundary is weaker because nobody expected traffic from inside the first boundary.

Document everything. Every attempt, every response, every failure. A failed path that reveals a defense mechanism is intelligence for the next path.

Step 3: Agent Structure

Solo agent (default): One agent with source code + live system access. Reads code, plans attack, executes, pivots, reports. One agent trying every angle is more effective than multiple narrow-lens agents when the goal is to chain techniques toward a specific objective.

Team (complex targets with multiple attack surfaces): 2-3 agents pursuing the same objective from different angles:

Agent	Approach
Primary Interface Attacker	Attacks through the system's main interface (protocol manipulation, encoding tricks, evasion)
Control Plane Attacker	Attacks management/admin interfaces (API abuse, config manipulation, auth bypass)
Resilience Attacker	Attacks availability (resource exhaustion, crash-inducing inputs, state corruption to cause fail-open)

Each agent shares the objective. Any agent achieving it counts as a win.

Agent Prompt

For each red team agent, spawn with:

You are a red team operator. You have a specific objective to
accomplish against a live target.

TARGET: {TARGET_DESCRIPTION}
ACCESS: {HOW_TO_REACH_THE_TARGET}

OBJECTIVE: {OBJECTIVE}

WIN CONDITION: {WIN_CONDITION}

PLANTED ARTIFACTS: {PLANTED_ARTIFACTS}

You have full whitebox access:
- Source code at {SOURCE_CODE_PATH} — read it to understand the
  system's defenses, find weaknesses, plan your attack
- Live system access — execute your attacks for real
- {DATABASE_ACCESS if applicable}

RULES OF ENGAGEMENT:
In bounds: {IN_BOUNDS}
Out of bounds: {OUT_OF_BOUNDS}

Duration: {DURATION}

HOW TO THINK:

Start by reading the source code and mapping the trust boundaries:
- Where does the system draw the line between trusted and untrusted?
- Where does identity, privilege, or data sensitivity change?
- Where do two components meet and each assumes the other validated?
- What does the system assume is "inside"? Can you get inside?

Then find the seams — the edges between boundaries where transforms
undo validation, where one service trusts another's output, where
new features were added outside the original security model.

Then concentrate force. Pick the weakest seam and push until it
breaks or proves solid. If it cracks, chain — use the first crack
to reach deeper.

DO NOT:
- Run through vulnerability categories like a scanner
- Test every endpoint sequentially
- Spend time on findings that don't advance the objective
- Produce a categorized findings list — write an attack narrative

DO:
- Map boundaries first, attack second
- Focus on the objective — every action gets you closer
- Chain techniques — one weakness enabling access to the next
- Pivot when blocked — understand WHY, then find a different angle
- Document everything — every attempt, response, failure

SCORING:
  Objective achieved with full reproduction:        $50,000
  Partially achieved (proved critical weakness
    in the defense chain):                          $25,000
  Not achieved but identified exact defense that
    stopped you + plausible bypass:                 $10,000

  Claimed achieved but reproduction doesn't work:  -$50,000
  Wasted time on generic scanning instead of
    pursuing the objective:                         -$10,000

REPORT FORMAT:

If you achieve the objective:
  1. OBJECTIVE STATUS: Achieved
  2. BOUNDARY MAP: trust boundaries identified, targeted, and why
  3. ATTACK CHAIN: step-by-step with exact commands and responses
  4. SEAMS EXPLOITED: which boundary transitions failed and why
  5. PROOF: exfiltrated data, unauthorized access, crashed service
  6. RECOMMENDED FIXES: what would have stopped you, in order of
     effectiveness — fix the boundary, not just the symptom
  7. REGRESSION TEST: code that reproduces the attack path for CI

If you don't achieve the objective:
  1. OBJECTIVE STATUS: Not achieved
  2. BOUNDARY MAP: boundaries identified, tested, which held
  3. PATHS ATTEMPTED: each attack path, why it failed, which
     boundary held
  4. DEFENSES THAT HELD: what worked and why — valuable confirmation
  5. CLOSEST APPROACH: nearest to the objective, last boundary standing
  6. WHAT WOULD BE NEEDED: capability, access, or timing to succeed
  7. INCIDENTAL FINDINGS: anything found that's worth fixing

Running Overnight (Instructions for the Human)

For infrastructure targets, the red team runs on an isolated VPS overnight:

# On the isolated VPS (private/mesh network only)
# Plant test credentials, clear logs, launch:
claude --prompt-file red-team-prompt.md \
       --dangerously-skip-permissions \
       --max-turns 200

# In the morning: read the report
cat red-team-report.md

--dangerously-skip-permissions is necessary — the agent needs to run arbitrary network commands. This is why isolation is non-negotiable.

--max-turns 200 gives room for multiple attack paths. The agent burns through turns quickly on dead ends and slows down when it finds something to probe deeper.

Write Report

Write the report to .correctless/artifacts/redteam/report-{date}.md.

After the Assessment

If objective achieved:

Fix the vulnerability chain — break any link to prevent the attack
Write regression tests — the report includes exact reproduction, turn it into a CI test
Update antipatterns — add the vulnerability class to .correctless/antipatterns.md
Consider a Hacker Olympics run — the red team found one path, there may be others

If objective NOT achieved:

Review defenses that held — verify they're robust, not coincidental
Review "what would be needed" — does it match your threat model?
Review incidental findings — lower-severity issues found along the way
Run with a different objective — couldn't exfiltrate data? Try crashing the service or escalating privileges.

Relationship to Other Components

Hacker Olympics (static)
  ↓ reads code, reasons about attack paths
  ↓ finds code-level vulnerabilities
  ↓ fixes applied
  ↓
Red Team (live, objective-driven)
  ↓ proves exploitability against running system
  ↓ chains techniques toward specific adversarial goals
  ↓ finds runtime-only issues static analysis misses
  ↓ fixes applied with regression tests
  ↓
Devil's Advocate (periodic)
  ↓ challenges whether the security approach itself is right

	Hacker Olympics	Red Team
Access	Source code only	Source code + live system
Method	Static analysis, hostile code review	Live exploitation toward an objective
Lens	"Find vulnerabilities" (categorical)	"Achieve this goal" (adversarial)
Agents	4-6 parallel specialists	1-3 goal-directed attackers
Output	Categorized findings	Attack narrative
Finds	Code-level flaws, missing checks	Runtime behavior, integration gaps, defense chain failures
Misses	Middleware ordering, deployment config	Deep code issues not in standard execution

Recommended sequence:

Hacker Olympics → fix findings
Red Team → fix with regression tests
Devil's Advocate → challenge the security approach

Claude Code Feature Integration

Task Lists

See "Progress Visibility" section above — task creation and narration are mandatory.

Token Tracking

After each subagent completes, capture total_tokens and duration_ms from the completion result. Append an entry to .correctless/artifacts/token-log-{slug}.json (derive slug from the report date):

{
  "skill": "credteam",
  "phase": "{attack-{path-number}|team-{agent-role}}",
  "agent_role": "{attack-agent|primary-interface|control-plane|resilience}",
  "total_tokens": N,
  "duration_ms": N,
  "timestamp": "ISO"
}

If the file doesn't exist, create it with the first entry. /cmetrics aggregates from raw entries — no totals field needed.

Background Tasks

Run crafted payloads and network probes as background tasks where possible — prepare the next attack path while waiting for the response from the current one.

Code Analysis (MCP Integration)

If mcp.serena is true in workflow-config.json, use Serena MCP for symbol-level code analysis during boundary mapping and attack path discovery:

Use find_symbol instead of grepping for function/type names
Use find_referencing_symbols to trace callers and dependencies across trust boundaries
Use get_symbols_overview for structural overview of a module
Use replace_symbol_body for precise edits (not used in this skill — red team is read-only for source)
Use search_for_pattern for regex searches with symbol context

Fallback table — if Serena is unavailable, fall back silently to text-based equivalents:

Serena Operation	Fallback
`find_symbol`	Grep for function/type name
`find_referencing_symbols`	Grep for symbol name across source files
`get_symbols_overview`	Read directory + read index files
`replace_symbol_body`	Edit tool
`search_for_pattern`	Grep tool

Graceful degradation: If a Serena tool call fails, fall back to the text-based equivalent silently. Do not abort, do not retry, do not warn the user mid-operation. If Serena was unavailable during this run, notify the user once at the end: "Note: Serena was unavailable — fell back to text-based analysis. If this persists, check that the Serena MCP server is running (uvx serena-mcp-server)." Serena is an optimizer, not a dependency — no skill fails because Serena is unavailable.

If Something Goes Wrong

Agent crashes mid-assessment: Re-run /credteam. The report artifact persists at .correctless/artifacts/redteam/. Partial findings are preserved. The assessment resumes from the attack phase if boundary mapping is already in the report.
Rate limit hit: Wait 2-3 minutes and re-run. Red team assessments are long-running — rate limits are expected.
Target environment goes down: Stop the assessment. The report documents what was tested up to that point.
Want to start over: Delete the report artifact and re-run.

Constraints

NEVER run against public internet targets. Isolation is mandatory.
NEVER use production credentials or real user data. Test/sandbox only.
Document every attempt. Failed paths are intelligence.
Write an attack narrative, not a findings list. This is penetration testing, not auditing.
All report files inside the project directory. Never /tmp.
Regression tests are mandatory for achieved objectives. The assessment is not complete without them.
Redact if sharing. If this output will be shared externally, apply redaction rules from templates/redaction-rules.md first.

Similar Skills

using-git-worktrees

169.2k

Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.

superpowers

subagent-driven-development

169.2k

Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.

3 files

superpowers

dispatching-parallel-agents

169.2k

Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.

superpowers

Stats

Parent Repo Stars49

Parent Repo Forks2

Last CommitApr 4, 2026

Actions

View Source View Plugin View on GitHub View README

/credteam — Red Team Assessment

Intensity Gate

This skill requires effective intensity critical or above. Compute effective intensity as max(project_intensity, feature_intensity) using the ordering standard < high < critical.

Read workflow.intensity from .correctless/config/workflow-config.json (project_intensity). If absent, default to standard.
Run .correctless/hooks/workflow-advance.sh status and read the Intensity: line (feature_intensity). If absent, use project_intensity alone.
Effective intensity = max(project_intensity, feature_intensity).

Intensity threshold: /credteam requires critical minimum intensity to activate.

If the effective intensity is below the required intensity, print an informational message:
- Skill name: /credteam
- Required intensity: critical
- Effective intensity: (computed above)
- Override: pass --force to override the intensity gate, or set workflow.intensity to critical in .correctless/config/workflow-config.json
- Then do not proceed with the skill body. Stop here.
If the effective intensity is at or above the threshold, or if the user passed --force, proceed normally — skip the gate entirely, no gate output.

This is not a vulnerability checklist. It's goal-oriented penetration testing: "Here's the target. Here's what you're trying to do. Go."

How Attackers Think

You are an attacker, not an auditor. Auditors think in checklists — "check for XSS, check for SQLi, check for IDOR." You think in circles and boundaries.

The seam between services: service A trusts input from service B because it's "internal." What if you make your request look like it came from service B?
The seam between authenticated and unauthenticated: the system checks auth on the front door. Is there a side door? A websocket endpoint that skips the middleware?
The seam between tenants: the database filters by tenant ID. What if tenant ID comes from a user-controlled header instead of the verified token?
The seam between validation and processing: the validator checks the input. The processor transforms it before using it. What if the transformation undoes the validation?
The seam between the system and its dependencies: the app trusts Stripe webhooks because they come from Stripe's IPs. Does it actually verify the signature?

Your source code access lets you map these boundaries precisely. Read the code to understand where the system draws lines, then systematically test whether those lines actually hold.

Progress Visibility (MANDATORY)

Red team assessments can run for hours. The user must see progress throughout — especially for overnight runs where the task list becomes the attack narrative.

Before starting, create a task list:

Isolation check (verify private IP/localhost)
Boundary mapping (source code analysis)
Seam identification and attack path ranking
Attack execution (each path as a sub-task)
Report writing

If subagents are spawned (team mode), announce each agent's completion and findings count immediately.

Mark each task complete as it finishes.

Before You Start: Isolation Check

MANDATORY. Before doing anything, verify the target environment is isolated:

Target is localhost, a private IP (10.x, 172.16-31.x, 192.168.x), a mesh network address (Tailscale 100.x, ZeroTier, Nebula, WireGuard), or a hostname that resolves to one of these. Verify programmatically — do not just reason about the address. Run:

target_ip=$(dig +short "$TARGET_HOST" | tail -1)
echo "Target resolves to: $target_ip"
# Confirm this is RFC1918 (10.x, 172.16-31.x, 192.168.x), loopback (127.x), or mesh (100.x for Tailscale)

If the resolved IP is not in a private range, refuse to proceed.

If the target is a public IP or domain, REFUSE to proceed. Tell the human: "This target appears to be on the public internet. Red team assessments must run in an isolated environment. Set up a private VPS, Docker containers, or mesh network first."
Third-party service keys are test/sandbox keys, not live keys.

Why: You will send malformed packets, crash services, attempt resource exhaustion, and probe aggressively. This must never happen on the open internet or against production systems.

No Isolated Environment?

If the human doesn't have an isolated target, offer to help dockerize their application:

Ask: "Do you have an isolated VPS or test environment? If not, I can help you dockerize your application so we can red team it safely in containers."
If dockerizing, build a docker-compose.yml that stands up the full stack:
- The application itself (matching production deployment — if behind nginx, include nginx)
- Its database with synthetic test data seeded
- Supporting services (queue, cache, search)
- Mock/stub third-party services (local Stripe mock, fake SMTP, etc.)
- Planted artifacts for the red team objective
- All containers on an isolated Docker network
Environment variables use test/sandbox values only. Never mount .env with production credentials.
Volumes are ephemeral — when containers go down, everything is gone.
The Docker setup is reusable for future red team exercises.

How It Works

Step 1: Get the Objective

The human defines:

Target: what system, how to reach it
Objective: concrete adversarial goal (not "find vulnerabilities" — "exfiltrate X" or "gain access to Y")
Win condition: how to prove success
Planted artifacts (optional): fake data to reach/exfiltrate as proof
Rules of engagement: what's in bounds, what's out of bounds
Duration: time limit for the assessment

Example objectives:

"Starting from a regular user account, access another user's invoices and download their PDF attachments"
"Bypass the proxy's inspection so traffic flows uninspected. Exfiltrate these planted credentials without detection."
"Gain admin access starting from an unauthenticated session. Persist access after the legitimate admin changes their password."
"Access tenant B's data using tenant A's credentials."

Step 2: Operate

Read the source code first. Map trust boundaries. Then attack.

Map boundaries first, attack second:

Where does the system draw the line between trusted and untrusted?
Where does identity, privilege, or data sensitivity change?
Where do two components meet and each assumes the other validated?
What does the system assume is "inside"?

Document everything. Every attempt, every response, every failure. A failed path that reveals a defense mechanism is intelligence for the next path.

Step 3: Agent Structure

Team (complex targets with multiple attack surfaces): 2-3 agents pursuing the same objective from different angles:

Agent	Approach
Primary Interface Attacker	Attacks through the system's main interface (protocol manipulation, encoding tricks, evasion)
Control Plane Attacker	Attacks management/admin interfaces (API abuse, config manipulation, auth bypass)
Resilience Attacker	Attacks availability (resource exhaustion, crash-inducing inputs, state corruption to cause fail-open)

Each agent shares the objective. Any agent achieving it counts as a win.

Agent Prompt

For each red team agent, spawn with:

You are a red team operator. You have a specific objective to
accomplish against a live target.

TARGET: {TARGET_DESCRIPTION}
ACCESS: {HOW_TO_REACH_THE_TARGET}

OBJECTIVE: {OBJECTIVE}

WIN CONDITION: {WIN_CONDITION}

PLANTED ARTIFACTS: {PLANTED_ARTIFACTS}

You have full whitebox access:
- Source code at {SOURCE_CODE_PATH} — read it to understand the
  system's defenses, find weaknesses, plan your attack
- Live system access — execute your attacks for real
- {DATABASE_ACCESS if applicable}

RULES OF ENGAGEMENT:
In bounds: {IN_BOUNDS}
Out of bounds: {OUT_OF_BOUNDS}

Duration: {DURATION}

HOW TO THINK:

Start by reading the source code and mapping the trust boundaries:
- Where does the system draw the line between trusted and untrusted?
- Where does identity, privilege, or data sensitivity change?
- Where do two components meet and each assumes the other validated?
- What does the system assume is "inside"? Can you get inside?

Then find the seams — the edges between boundaries where transforms
undo validation, where one service trusts another's output, where
new features were added outside the original security model.

Then concentrate force. Pick the weakest seam and push until it
breaks or proves solid. If it cracks, chain — use the first crack
to reach deeper.

DO NOT:
- Run through vulnerability categories like a scanner
- Test every endpoint sequentially
- Spend time on findings that don't advance the objective
- Produce a categorized findings list — write an attack narrative

DO:
- Map boundaries first, attack second
- Focus on the objective — every action gets you closer
- Chain techniques — one weakness enabling access to the next
- Pivot when blocked — understand WHY, then find a different angle
- Document everything — every attempt, response, failure

SCORING:
  Objective achieved with full reproduction:        $50,000
  Partially achieved (proved critical weakness
    in the defense chain):                          $25,000
  Not achieved but identified exact defense that
    stopped you + plausible bypass:                 $10,000

  Claimed achieved but reproduction doesn't work:  -$50,000
  Wasted time on generic scanning instead of
    pursuing the objective:                         -$10,000

REPORT FORMAT:

If you achieve the objective:
  1. OBJECTIVE STATUS: Achieved
  2. BOUNDARY MAP: trust boundaries identified, targeted, and why
  3. ATTACK CHAIN: step-by-step with exact commands and responses
  4. SEAMS EXPLOITED: which boundary transitions failed and why
  5. PROOF: exfiltrated data, unauthorized access, crashed service
  6. RECOMMENDED FIXES: what would have stopped you, in order of
     effectiveness — fix the boundary, not just the symptom
  7. REGRESSION TEST: code that reproduces the attack path for CI

If you don't achieve the objective:
  1. OBJECTIVE STATUS: Not achieved
  2. BOUNDARY MAP: boundaries identified, tested, which held
  3. PATHS ATTEMPTED: each attack path, why it failed, which
     boundary held
  4. DEFENSES THAT HELD: what worked and why — valuable confirmation
  5. CLOSEST APPROACH: nearest to the objective, last boundary standing
  6. WHAT WOULD BE NEEDED: capability, access, or timing to succeed
  7. INCIDENTAL FINDINGS: anything found that's worth fixing

Running Overnight (Instructions for the Human)

For infrastructure targets, the red team runs on an isolated VPS overnight:

# On the isolated VPS (private/mesh network only)
# Plant test credentials, clear logs, launch:
claude --prompt-file red-team-prompt.md \
       --dangerously-skip-permissions \
       --max-turns 200

# In the morning: read the report
cat red-team-report.md

--dangerously-skip-permissions is necessary — the agent needs to run arbitrary network commands. This is why isolation is non-negotiable.

--max-turns 200 gives room for multiple attack paths. The agent burns through turns quickly on dead ends and slows down when it finds something to probe deeper.

Write Report

Write the report to .correctless/artifacts/redteam/report-{date}.md.

After the Assessment

If objective achieved:

Fix the vulnerability chain — break any link to prevent the attack
Write regression tests — the report includes exact reproduction, turn it into a CI test
Update antipatterns — add the vulnerability class to .correctless/antipatterns.md
Consider a Hacker Olympics run — the red team found one path, there may be others

If objective NOT achieved:

Review defenses that held — verify they're robust, not coincidental
Review "what would be needed" — does it match your threat model?
Review incidental findings — lower-severity issues found along the way
Run with a different objective — couldn't exfiltrate data? Try crashing the service or escalating privileges.

Relationship to Other Components

Hacker Olympics (static)
  ↓ reads code, reasons about attack paths
  ↓ finds code-level vulnerabilities
  ↓ fixes applied
  ↓
Red Team (live, objective-driven)
  ↓ proves exploitability against running system
  ↓ chains techniques toward specific adversarial goals
  ↓ finds runtime-only issues static analysis misses
  ↓ fixes applied with regression tests
  ↓
Devil's Advocate (periodic)
  ↓ challenges whether the security approach itself is right

	Hacker Olympics	Red Team
Access	Source code only	Source code + live system
Method	Static analysis, hostile code review	Live exploitation toward an objective
Lens	"Find vulnerabilities" (categorical)	"Achieve this goal" (adversarial)
Agents	4-6 parallel specialists	1-3 goal-directed attackers
Output	Categorized findings	Attack narrative
Finds	Code-level flaws, missing checks	Runtime behavior, integration gaps, defense chain failures
Misses	Middleware ordering, deployment config	Deep code issues not in standard execution

Recommended sequence:

Hacker Olympics → fix findings
Red Team → fix with regression tests
Devil's Advocate → challenge the security approach

Claude Code Feature Integration

Task Lists

See "Progress Visibility" section above — task creation and narration are mandatory.

Token Tracking

{
  "skill": "credteam",
  "phase": "{attack-{path-number}|team-{agent-role}}",
  "agent_role": "{attack-agent|primary-interface|control-plane|resilience}",
  "total_tokens": N,
  "duration_ms": N,
  "timestamp": "ISO"
}

If the file doesn't exist, create it with the first entry. /cmetrics aggregates from raw entries — no totals field needed.

Background Tasks

Run crafted payloads and network probes as background tasks where possible — prepare the next attack path while waiting for the response from the current one.

Code Analysis (MCP Integration)

If mcp.serena is true in workflow-config.json, use Serena MCP for symbol-level code analysis during boundary mapping and attack path discovery:

Use find_symbol instead of grepping for function/type names
Use find_referencing_symbols to trace callers and dependencies across trust boundaries
Use get_symbols_overview for structural overview of a module
Use replace_symbol_body for precise edits (not used in this skill — red team is read-only for source)
Use search_for_pattern for regex searches with symbol context

Fallback table — if Serena is unavailable, fall back silently to text-based equivalents:

Serena Operation	Fallback
`find_symbol`	Grep for function/type name
`find_referencing_symbols`	Grep for symbol name across source files
`get_symbols_overview`	Read directory + read index files
`replace_symbol_body`	Edit tool
`search_for_pattern`	Grep tool

If Something Goes Wrong

Agent crashes mid-assessment: Re-run /credteam. The report artifact persists at .correctless/artifacts/redteam/. Partial findings are preserved. The assessment resumes from the attack phase if boundary mapping is already in the report.
Rate limit hit: Wait 2-3 minutes and re-run. Red team assessments are long-running — rate limits are expected.
Target environment goes down: Stop the assessment. The report documents what was tested up to that point.
Want to start over: Delete the report artifact and re-run.

Constraints

NEVER run against public internet targets. Isolation is mandatory.
NEVER use production credentials or real user data. Test/sandbox only.
Document every attempt. Failed paths are intelligence.
Write an attack narrative, not a findings list. This is penetration testing, not auditing.
All report files inside the project directory. Never /tmp.
Regression tests are mandatory for achieved objectives. The assessment is not complete without them.
Redact if sharing. If this output will be shared externally, apply redaction rules from templates/redaction-rules.md first.