Use when verifying vulnerabilities with Dynamic Exploit Generation (Phase 3)
From perseusnpx claudepluginhub kaivyy/perseus --plugin perseusThis skill uses the workspace's default tool permissions.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Enables AI agents to execute x402 payments with per-task budgets, spending controls, and non-custodial wallets via MCP tools. Use when agents pay for APIs, services, or other agents.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
This skill executes the Verification Phase with controlled exploit simulation. Instead of blind payload spraying, Perseus validates each finding with environment-aware, low-risk proofs.
Goal: Reduce false positives while preserving production safety and service stability.
Safety Rule: Never run destructive actions. Never degrade availability.
| Mode | Environment | Verification Depth |
|---|---|---|
PRODUCTION_SAFE | Live production | Passive-first checks + minimal non-disruptive proofs |
STAGING_ACTIVE | Staging/pre-prod | Targeted active verification with strict throttling |
LAB_FULL | Isolated lab | Full dynamic verification and payload mutation |
LAB_RED_TEAM | Dedicated security lab | Multi-step chain simulation and resilience stress tests |
If mode is not explicitly provided, default to PRODUCTION_SAFE.
VERIFIED, FAILED, or POTENTIAL-PROD-BLOCKED.Immediately pause verification and mark run as ABORTED-SAFETY when any threshold is exceeded:
Create/update deliverables/verification_scope.md with:
PRODUCTION_SAFE, limit to replay-like checks and non-invasive timing tests.alert(1) pattern detection).PRODUCTION_SAFE: static and configuration-based verification only.STAGING_ACTIVE/LAB_FULL/LAB_RED_TEAM: controlled callback validation using approved endpoint only.LAB_RED_TEAM only)Run controlled attack-chain validation on synthetic test data:
Record where the chain is broken and which control stopped progression.
PRODUCTION_SAFE: maximum 2 attempts per finding, then mark POTENTIAL-PROD-BLOCKED.STAGING_ACTIVE: allow targeted payload variation with throttling.LAB_FULL: allow broader mutation strategies, still non-destructive.LAB_RED_TEAM: allow chain-based scenario replay with hard caps per scenario.For each verified exploit, log the result in deliverables/exploitation_report.md:
PRODUCTION_SAFE | STAGING_ACTIVE | LAB_FULL | LAB_RED_TEAMVERIFIED | FAILED | POTENTIAL-PROD-BLOCKED | ABORTED-SAFETYHigh | Medium | LowNext Step: Proceed to perseus:report to document the verified findings.