By JoranHonig
Orchestrate security research workflows: annotate codebases, define vulnerability patterns, hunt for bugs, draft structured findings, deduplicate results, generate PoCs, and encode findings into reusable detection modules — all within an agentic audit workspace.
QA gatekeeper and triage partner. This agent should be invoked when the user or another agent says "triage this finding", "verify this vulnerability", "check if this is real", "is this a false positive", "validate this hypothesis", "review these sigil results", "triage findings", "familiar", "run triage", "double check this", "sanity check", "quality check this finding", "review this PoC", "evaluate this finding", "triage all findings", "batch triage", "process sigil output", "check this PoC", "is this PoC correct", or when sigil agents produce findings that need validation before presenting to the user. Three modes: finding triage (validate a single finding or hypothesis), batch triage (process multiple sigil findings), and PoC review (evaluate proof-of-concept quality and completeness).
Worker agent that builds artifacts from explicit plans. This agent should be invoked when another agent says "delegate to gnome", "spawn a gnome", "have a gnome build this", or when a parent agent (Scribe or Familiar) needs isolated execution of a clearly-defined build task. Also invoked when the user says "gnome", "build this check", "build a semgrep rule", "build a slither detector", "implement this detection module", or "create this detection module". For PoC construction, invoked when a parent agent delegates with an explicit plan — the user-facing PoC workflow is the write-poc skill. Four modes: build check (agentic detection module), build semgrep rule, build slither detector, and build PoC.
Adversarial red-teamer of findings from a bug-bounty-host perspective. This agent should be invoked when the user or another agent says "red team this finding", "red-team this report", "hag", "run the hag", "challenge this finding", "what would a bounty host say", "disprove this finding", "stress test this PoC", "would this get paid", "attack this report", "find holes in this finding", "adversarial review", "bounty triage", "decline or pay", "devil's advocate this", or when a confirmed finding needs final prosecutorial review before submission or payout. The hag goes beyond verification — she prosecutes the finding, assumes it's overstated, and rules only for payout when the evidence forces her hand.
External research specialist. This agent should be invoked when the user or another agent says "look up", "research", "find documentation for", "what does the spec say about", "check if this is a known vulnerability", "study the specification", "find prior audit findings", "how does protocol X handle Y", "search for known issues with", "fact check this", or whenever information cannot be found in the current codebase. Covers documentation lookups, protocol specifications, vulnerability databases (solodit), prior audit reports, GitHub repositories, and security knowledge bases. Two modes: directed questions (specific Q&A with citations) and generic study (broad topic context priming).
Detection module builder and spellbook manager. This agent should be invoked when the user or another agent says "distill this finding", "create a detection module", "build a sigil from this", "encode this as automation", "scribe", "update my spellbook", "merge sigils", "what sigils do I have", "clean up sigils", "garbage collect", "scribe-gc", "promote sigils", "end of audit merge", "encode finding as detection", "what detectors do I have", "show my spellbook", "list my sigils", or when a confirmed finding should be assessed for automated detection potential. Three modes: distill (finding to detection module), spellbook management (merge, promote, garbage collect), and query (list and describe sigils).
This skill should be used when the user says "find annotations", "list audit tags", "show @audit comments", "compile annotations", "/annotation", "find todos", "find audit comments", "what did I annotate", "annotation summary", "list audit findings", "what's annotated", or wants to discover, list, or filter @audit-* comment annotations scattered throughout a codebase. This skill is for annotation discovery only — how annotations are used downstream (spawning subagents, cross-referencing findings, etc.) is out of scope.
This skill should be used when the user says "build context on a flow", "trace a flow", "map how X works", "cartography", "/cartography", "document a flow", "create a flow map", "trace how authentication works", "map the data flow", or wants to explore and document how a specific code flow works so that context can be quickly rebuilt on future visits. This is the primary skill for creating cartography files in grimoire/cartography/.
This skill should be used when the user says "create a check", "write a check", "add a check", "apply checks", "run checks", "/checks", "vulnerability pattern", "detection check", "check for common bugs", "scan with checks", or wants to create, apply, or manage simple vulnerability pattern files that agents use to find flaws. Checks are the simplest unit of agentic vulnerability detection — markdown files describing what to look for and how to assess matches. This skill is NOT for general code review or ad-hoc vulnerability analysis.
This skill should be used when the user says "deduplicate findings", "dedup findings", "compare findings", "find duplicate findings", "merge findings", "clean up findings", "/finding-dedup", or wants to identify and resolve duplicate or overlapping security findings in a project. Classifies finding pairs as duplicate (can delete one) or similar (different scope, may merge). This skill is NOT for drafting new findings (use /finding-draft) or reviewing individual findings (use /finding-review).
This skill should be used when the user says "draft a finding", "write a finding", "create a finding", "document a vulnerability", "write up this bug", "finding template", "report a vulnerability", "/finding-draft", or wants to construct a new structured security finding from a vulnerability observation. This skill is NOT for reviewing existing findings (use /finding-review) or deduplicating findings (use /finding-dedup).
Requires secrets
Needs API keys or credentials to function
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A security research toolkit that learns.
Grimoire takes the raw agent experience and tunes it for security research. Clean, readable and reproducible PoCs,
automatic static analysis module distillation, and more.
There are many audit agents and vulnerability discovery skills.
These are great, but the real power of agents is in amplifying operator skill. Grimoire embraces that philosophy and implements several skills that make your agent a better co-auditor.
Some skills (such as the cartography) skill come with a small workflow adaption. However, most features such as the librarian are designed to just work within whatever workflow you follow.
The librarian is an agent that looks for documentation and references (e.g. previous audit findings, docs, blog posts, etc.) . It is very focussed on providing only information backed up by reference and keeps the main context clear from large mcp descriptions.
The cartography skill provides instructions to claude on how it can document a mapping from features / flows to code locations. This allows you to say
hey load context on the authentication flow, claude will review the file and very quickly load the relevant context.
Grimoire is built on a few hard convictions from real-world security research:
Grimoire is still at a very early stage and under continuous development expect there to be major changes.
Grimore is set up as a claude plugin that's easy to set up.
At first you should just use the summon command and get going, grimoire will automatically hop in once you ask claude to write a finding, proof of concept, or a similar task.
Once you're ready to dive deep we would suggest having a look at the scribe and cartography skills.
git clone https://github.com/JoranHonig/grimoire.git
claude --plugin-dir /path/to/grimoire
Skills and agents auto-discover via the plugin manifest. Some features require API keys:
| Service | Key | Purpose |
|---|---|---|
| Solodit | SOLODIT_API_KEY | Audit findings search via claudit |
| Context7 | CONTEXT7_API_KEY | Library documentation lookups |
Set these in your Claude Code settings (~/.claude/settings.json):
{
"env": {
"SOLODIT_API_KEY": "your-key-here",
"CONTEXT7_API_KEY": "your-key-here"
}
}
Both are optional — the librarian will fall back to web search if they are not set. You can also export them as regular shell environment variables (e.g. in ~/.zshrc) instead of using settings.json.
Start any engagement with summon:
You: "Summon grimoire on this codebase"
Grimoire: → Analyzes project structure, architecture, integrations
→ Identifies crown jewels and attack surface
→ Writes GRIMOIRE.md contextual map
→ Spawns detection checks across the codebase
→ Surfaces initial findings for triage
Then work naturally — Grimoire's skills trigger from context:
"Map the authentication flow" → cartography
"Write a PoC for the reentrancy I found" → write-poc
"Document this as a finding" → finding-draft
"Review my findings before submission" → finding-review
"Check for duplicates" → finding-dedup
Learn more about grimoire by reading docs
npx claudepluginhub joranhonig/grimoireSpecialized security review subagent
Security testing skills for AI-assisted IDEs and coding agents. 25 vulnerability patterns across code execution, prompt injection, data exfiltration, and trust persistence.
Security best practices advisor with vulnerability detection and fixes
Build deep architectural context through ultra-granular code analysis before vulnerability hunting
Web vulnerability testing methodology distilled from 88,636 real-world cases from the WooYun vulnerability database (2010-2016)
71-skill bug-hunting & external red-team bundle for Claude Code — 48 hunt-* web/vuln-class + framework skills, enterprise platform attack chains (M365/Entra, Okta, SharePoint, vCenter, SSL-VPN, APK), recon/OSINT, reporting & validation gates, and Burp MCP integration. Skills auto-load by topic; 15 slash commands included.