From code-auditor-agent
Trigger with "ecaa self test", "run efficacy gate". Use when verifying the ultracode engine catches every seeded bug in the bundled fixtures — before release or on a schedule. Runs the pytest gate plus one engine pass.
How this skill is triggered — by the user, by Claude, or both
Slash command
/code-auditor-agent:ecaa-self-test-24The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Efficacy self-test for the migrated **ultracode engine** (`scripts/workflows/caa-engine.js`). Two halves:
references/fixtures/agent-01/buggy.pyreferences/fixtures/agent-02/actual_diff.patchreferences/fixtures/agent-02/pr_description.mdreferences/fixtures/agent-03/callers.pyreferences/fixtures/agent-03/diff.patchreferences/fixtures/agent-04/auth.pyreferences/fixtures/agent-17/orders.pyreferences/fixtures/agent-17/payments.pyreferences/fixtures/agent-17/products.pyreferences/fixtures/agent-17/users.pyreferences/fixtures/agent-18/cache.pyreferences/fixtures/agent-20-api/routes.pyreferences/fixtures/agent-20-docker/container_spec.txtreferences/fixtures/agent-20-frontend/App.tsxreferences/fixtures/agent-20-graphql/schema.graphqlreferences/fixtures/agent-20-jwt/signer.pyreferences/fixtures/agent-20-prompt/prompts.pyreferences/fixtures/agent-21-elixir/user_service.exreferences/fixtures/agent-21-i18n/messages.pyreferences/fixtures/agent-21-ios/NetworkManager.swiftEfficacy self-test for the migrated ultracode engine (scripts/workflows/caa-engine.js). Two halves:
tests/integration/test_pipeline_efficacy.py) — encoding, lint, structure, etc.caa-engine once over the bundled seeded-bug fixtures
(references/fixtures/, one per former specialist domain) with ALL domain lenses active, and
assert the consolidated report flags a finding in EVERY fixture (each fixture contains ≥1 seeded
defect). Replaces the old per-agent dispatches — combined + domain lenses embody every former agent's logic.uv + pytest installed; session effort max/xhigh (the engine is opus-only).<main-repo-root>/reports/code-auditor-agent/efficacy-audit/.MAIN_ROOT="$(git worktree list | head -n1 | awk '{print $1}')", TS="$(date +%Y%m%d_%H%M%S%z)".$MAIN_ROOT run uv run pytest tests/integration/test_pipeline_efficacy.py -v --tb=short > "/tmp/ecaa-$TS-pytest.log" 2>&1 (do NOT abort on non-zero — record it).Workflow tool, SKIP this half — record SKIPPED (ultracode required) in the verdict and assert only on the script half (the simple-scan fallback can't substitute;
this gate tests the ENGINE). Else resolve the fixture files (git -C "$MAIN_ROOT" ls-files "skills/ecaa-self-test-24/references/fixtures" → absolute paths) and invoke the engine with EVERY domain lens active:
Workflow({scriptPath: "${CLAUDE_PLUGIN_ROOT}/scripts/workflows/caa-engine.js", args: {root: "$MAIN_ROOT", files: [<fixture abs paths>], mode: "scan", reportType: "audit", reportSuffix: "ecaa-self-test", runId: "ecaa-$TS", domainLenses: ["docker","solidity","ios-native","graphql","elixir","frontend","monorepo","i18n","l10n","jwt","prompt-injection","logging","mcp-server","api-design","type-design","assumption","function-contract","pre-mortem","architecture-consistency"], lensDir: "${CLAUDE_PLUGIN_ROOT}/scripts/workflows/lenses", conc: 6}}).
No lensSet (default combined; the engine fail-fasts on unknown values).finalReport, every seeded-bug fixture (all EXCEPT those under
references/fixtures/clean-suspicious/) must appear with ≥1 confirmed finding; zero on a seeded
fixture is a MISS.references/fixtures/clean-suspicious/ must have ZERO
confirmed CRITICAL/MAJOR (a flagged-then-REFUTED/DOWNGRADED finding passes — see the "Refuted /
downgraded" section). A confirmed CRITICAL/MAJOR on a clean fixture is a FALSE-POSITIVE failure.[PASS] ecaa-self-test-24 — recall <N>/<N> · precision <C>/<C> · pytest <P>/<P> / [PARTIAL] … / [FAIL] … — and write the report + a verdict JSON to <main-repo-root>/reports/code-auditor-agent/efficacy-audit/<TS>-self-test.{md,json}. Report BOTH recall and precision.The consolidated report + verdict line/JSON under reports/code-auditor-agent/efficacy-audit/. PASS = every seeded fixture flagged + pytest detectors passed.
[FAIL] ecaa-self-test-24 — pytest not installed.[PARTIAL].[FAIL] immediately. The engine itself is robust by construction (.catch + rate-limit re-queue)."run ecaa self test" → pytest gate + one engine pass over references/fixtures/ → verdict
scripts/workflows/caa-engine.js — the engine under test.references/fixtures/ — seeded-bug fixtures (one per former specialist domain).tests/integration/test_pipeline_efficacy.py — the script-detector half.npx claudepluginhub emasoft/emasoft-plugins --plugin code-auditor-agentCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.