From soundcheck
Detects and prevents autonomous LLM agents from taking irreversible or high-impact actions without human approval. Use when building agentic workflows with tool use.
How this skill is triggered — by the user, by Claude, or both
Slash command
/soundcheck:excessive-agencyThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Prevents autonomous agents from taking irreversible or high-impact actions without
Prevents autonomous agents from taking irreversible or high-impact actions without human oversight. When an LLM can directly write files, send emails, or modify databases, a single compromised or hallucinated step can cause unrecoverable damage.
Flag the vulnerable code and explain the risk. Then suggest a fix that establishes these properties:
run_sql that accepts arbitrary queries violates
this; a tool named archive_record(id) that only issues a scoped update does not.Translate these principles to the audited file's language, agent framework, and tool-dispatch surface. Use the framework's documented approval / human-in-the-loop hook — do not invent ad-hoc confirmation prompts inside the prompt template.
Confirm these properties hold regardless of language or framework:
npx claudepluginhub thejefflarson/soundcheck --plugin soundcheckConstrain LLM agent capabilities with least-privilege controls and human-in-the-loop gates to prevent excessive agency in autonomous AI systems.
Audit applications for AI prompt injection, agent security, and LLM permission boundary vulnerabilities. Use when securing AI features or agents.
Detects code patterns where LLM output is treated as authoritative fact without human review. Flags missing disclaimers, confidence gates, and audit logging in high-stakes contexts.