Search everything...

Skill

safety-scan

Scan inputs for prompt injection, unsafe content, and adversarial attacks using AIDefence

npx claudepluginhub akhilyad/deployy --plugin hyrex-aidefence

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/hyrex-aidefence:safety-scan <input-text>

User invocable

Model invocable

Inline context

Default effort

Argument hint<input-text>

Tool Access

This skill is limited to the following tools:

mcp__hyrex__aidefence_scanmcp__hyrex__aidefence_analyzemcp__hyrex__aidefence_is_safemcp__hyrex__aidefence_learnmcp__hyrex__aidefence_statsBash

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Scan content for prompt injection, jailbreak attempts, and unsafe patterns.

SKILL.md

31 lines · ~328 tokens

Similar Skills

skill-comply

213.3k

Measures whether skills, rules, and agent definitions are actually followed by auto-generating test scenarios at 3 strictness levels and reporting compliance rates with full tool call timelines.

20 files

ecc

Stats

LanguageTypeScript

Parent stars0

MaintenanceGood

Last CommitMay 13, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

safety-scan | hyrex-aidefence

Skill

safety-scan

From hyrex-aidefence

Scan inputs for prompt injection, unsafe content, and adversarial attacks using AIDefence

npx claudepluginhub akhilyad/deployy --plugin hyrex-aidefence

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/hyrex-aidefence:safety-scan <input-text>

User invocable

Model invocable

Inline context

Default effort

Argument hint<input-text>

Tool Access

This skill is limited to the following tools:

mcp__hyrex__aidefence_scanmcp__hyrex__aidefence_analyzemcp__hyrex__aidefence_is_safemcp__hyrex__aidefence_learnmcp__hyrex__aidefence_statsBash

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Scan content for prompt injection, jailbreak attempts, and unsafe patterns.

SKILL.md

31 lines · ~328 tokens

Safety Scan

Scan content for prompt injection, jailbreak attempts, and unsafe patterns.

When to use

Before processing untrusted input (user submissions, API payloads, webhook data), scan it to detect prompt injection, adversarial content, or policy violations.

Steps

Quick safety check — call mcp__hyrex__aidefence_is_safe with the input text for a boolean safe/unsafe result
Deep analysis — call mcp__hyrex__aidefence_analyze for detailed threat classification and confidence scores
Full scan — call mcp__hyrex__aidefence_scan for comprehensive multi-layer scanning
Train defenses — call mcp__hyrex__aidefence_learn with confirmed threats to improve detection
View stats — call mcp__hyrex__aidefence_stats for detection rates and false positive metrics

Threat categories

Prompt injection (direct and indirect)
Jailbreak attempts
Data exfiltration patterns
Instruction override attacks
Social engineering prompts

Similar Skills

skill-comply

213.3k

Measures whether skills, rules, and agent definitions are actually followed by auto-generating test scenarios at 3 strictness levels and reporting compliance rates with full tool call timelines.

20 files

ecc

Stats

LanguageTypeScript

Parent stars0

MaintenanceGood

Last CommitMay 13, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

safety-scan

Invocation

Tool Access

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

safety-scan

Invocation

Tool Access

Context Preview

SKILL.md

Safety Scan

When to use

Steps

Threat categories

Similar Skills

Help us improve

Safety Scan

When to use

Steps

Threat categories