Search everything...

Stats

Actions

Available In

mega-security

Name: mega-security
Author: mega-edo

By mega-edo

Hardens LLM agent pipelines and system prompts by running 100 attack simulations (injection, jailbreak, PII leak), auditing code against OWASP Top 10, and autonomously optimizing workflows with compliance and safety checks.

Publisher marketplacemega-security@mega-edo · marketplace and plugin share one repository (mega-edo/mega-security)

npx claudepluginhub mega-edo/mega-security --plugin mega-security

Popularity

Stars

Top 25%

Med: 0·Avg: 826

Copy clicks

Med: 0·Avg: 2

What's Inside

Agents13

attack-suite-curator

/attack-suite-curator

Curates the Red Team simulation suite for mega-security (the "attack" half of dual-axis evaluation) from public benchmarks, respecting activated threat-simulation tiers, product profile, and contamination weighting. Outputs JSONL probe sets with train/val split + manifest. The val split is distributionally distant (multilingual / rephrased / novel-entity) from train to enforce defense generalization and prevent cherry-picking.

benign-utility-curator

/benign-utility-curator

Curates the Blue Team (legitimate-use / usability-regression) suite for mega-security — the second axis of dual-axis evaluation that catches Red Team hardening collapsing legitimate product flows. Prefers to reuse the user's existing data-eval (audited), falls back to domain-tied synthesis or general benchmarks. Outputs JSONL stratified by intent and edge-case proximity.

judge-auditor

/judge-auditor

Audits judge verdict correctness on a sample of failed/refused traces from an iter-0 baseline before agent-optimize enters its main loop. Computes false-positive rate per axis (attack DSR, benign FRR) and gates loop entry.

mas-code-reviewer

/mas-code-reviewer

Reviews code changes via git diff — verifies intent vs reality, runs syntactic and runtime checks, applies caller-supplied review criteria

mas-commit

/mas-commit

Commit current changes on the MEGA branch with a context-aware message. Use after completing a MEGA pipeline step.

Skills2

agent-meta-learning

/agent-meta-learning

Internal report writer. Auto-invoked by agent-optimize at loop completion; not a user-facing entry point. Reads loop history from .mega_security/feedback/ and writes .mega_security/MEGA_SECURITY.md (final audit-grade hardening report) plus .mega_security/meta/security-learnings.md.

prompt-check

/prompt-check

Lightweight security check for a single chat system prompt (no agent loop, no tools, no RAG). Runs 100 attack tests across 4 attack types (prompt injection, jailbreak, PII disclosure, system prompt leak) as the held-out scoring set, plus a parallel tuning set of 100 used only by the optimizer. The attacks ship with the tool as a vetted set — each one previously got past or barely held against a capable baseline AI, so meaningful differences between models actually surface. Plus 16+16 legitimate-use tests for over-blocking detection. Writes MEGA_PROMPT_CHECK.md with block rate per attack type, failure examples, and weakness analysis.

Hooks1

Event Hooks

All tools

13 hooks across 6 events

Stats

Version1.1.1

LanguagePython

Stars30

MaintenanceExcellent

LicenseApache-2.0

Last CommitMay 7, 2026

AddedMay 18, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

mega-edo30

Safety Signals

Critical

Matches all tools

Hooks run on every tool call, not just specific ones

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

MEGA Security

The evaluation-driven approach to LLM system-prompt and agent security.
Define the attack surface, measure it, harden to pass — for chat prompts and full agent pipelines.

Quick Start · What it does · Agent Security · Benchmark · Leaderboard ↗ · megacode.ai ↗

✨ Why?

[!WARNING] Routing through OpenClaw, Hermes, LiteLLM, or OpenRouter? Your system prompt runs on whichever model the router picks at request time and defense rates swing from 0.50 to 0.91 across vendors. Untuned, you ship the worst case.

[!IMPORTANT] Your system prompt is your trust asset. In production it has been breaking repeatedly: EchoLeak (zero-click M365 Copilot exfiltration), the Gap chatbot jailbreak, the Chevy "$1 Tahoe" persona override, and 7+ vendor system prompts now public on GitHub. A static prompt is no longer enough — and once tools, RAG, and memory enter the picture, the attack surface widens beyond what any single prompt can hold.

The common pain points teams hit shipping LLM products:

🧨 Attacks evolve faster than benchmarks — HarmBench, DAN, PII catalogs all live in separate repos, English-only, and lag behind real-world techniques.
⚖️ Defense vs. usability is unmeasured — teams regress into "block-everything" prompts that frustrate legitimate users (high false-refusal rate).
🎯 No reproducible stop condition — there's no objective signal for "is this prompt ship-ready?"
🔁 Manual review is the only feedback loop — you can't tell whether a prompt edit actually helped.
🧰 Agent-shaped products break the prompt model — tools, RAG corpora, and rendered output add categories (tool abuse, RAG poisoning, output handling) that a single-prompt benchmark can't see.

mega-security is an example of evaluation-driven development applied to LLM security. It ships four Claude Code commands that diagnose and harden chat system prompts and full agent pipelines, fail-closed, reproducible, and never modifying your code without your explicit approval.

🚀 Quick Start

Inside any Claude Code session:

/plugin marketplace add https://github.com/mega-edo/mega-security

/plugin install mega-security@mega-edo

That's it. Commands become available immediately:

Chat system prompts — single prompt.txt / system-message scope:

/prompt-check                  # 5–10 min diagnosis of a single system prompt

/prompt-optimize               # iterative hardening with no-regression guarantees

Full agent pipelines — products with tools, RAG, memory, or multi-archetype orchestration:

/agent-check                   # static OWASP review + Red/Blue Team baseline (~10–20 min)

/agent-optimize                # source-level hardening loop with Pareto acceptance gates

To pull updates later: /plugin upgrade mega-security.

[!TIP] Not sure which one you want? If your product has tools, a vector store, or rendered output, run /agent-check. If it's a pure text-in/text-out chat with one system prompt, /prompt-check is faster and ships the same defensive posture for that scope.

Local development install (contributors only)

git clone https://github.com/mega-edo/mega-security ~/mega-agent-security
claude --plugin-dir ~/mega-agent-security

--plugin-dir is session-scoped and additive. To load multiple plugins in one session, repeat the flag. After editing plugin files mid-session, run /reload-plugins to refresh.

📊 Proven across 4 vendors × 2 tiers × 3 scenarios

View full README on GitHub

mega-security

Popularity

What's Inside

Confidence

README

MEGA Security

✨ Why?

🚀 Quick Start

📊 Proven across 4 vendors × 2 tiers × 3 scenarios

Similar Plugins

trustabl

agentguard

adrian-cc

bridgeward

MEGA Security

✨ Why?

🚀 Quick Start

📊 Proven across 4 vendors × 2 tiers × 3 scenarios

Popularity

Health & Quality

Similar Plugins

trustabl

agentguard

adrian-cc

bridgeward

Setup Eval

reins