Skill

verify

Phase 7: Final holistic verification. Automatically invoked on G6 failure, or run manually. Checks for omissions, conflicts, speculation, and consistency, then reports confidence. Artifacts are not modified — only the verification log is recorded. Keywords: verification, holistic check, confidence, feedback loop, omission, conflict, speculation

npx claudepluginhub seokrae/context-engineering --plugin context-engineering

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/context-engineering:verify

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadWriteBashGrepGlob

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Final Verification phase. Checks whether the compose output actually satisfies the Phase 1 criteria. Does not modify the output — only reports verification results. Records the verification log (`_verify-log.md`).

SKILL.md

198 lines · ~1.7k tokens

Similar Skills

receiving-code-review

221.0k

Guides technical evaluation of code review feedback: read fully, restate for understanding, verify against codebase, respond with reasoning or pushback before implementing.

superpowers

Stats

Stars0

MaintenanceExcellent

Last CommitApr 30, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Verify (Phase 7)

Prerequisites

Confirm what to verify:

compose output (execution instruction / KB entry / project artifacts)
Phase 1 analysis results (purpose / constraints / success criteria)

If no output exists:

"Please run /context-engineering:compose first." Stop.

Trigger

Automatic: compose auto-invokes when G6 is not met
Manual: run /context-engineering:verify directly (re-verify existing artifacts)

Phase 7. Final Verification

Purpose: Check the output against 4 criteria and report confidence.

4 Checks

Check	Criteria
Omission	Are all Phase 1 purpose/constraints reflected in the output?
Conflict	Are there no contradictions within the output?
Speculation	Are there no claims unsupported by collected sources? Review `[source: general]` tagged items first.
Consistency	Does the output format match the Phase 1 success criteria?

Check Output Format

[Phase 7 Verification Result]

Omission: {N items reflected / Unreflected: {item name} — {description}}
Conflict: {none / {itemA} ↔ {itemB} — {description}}
Speculation: {none / {item name} — no source evidence}
Consistency: {match / {mismatch detail} — Phase 1 success criteria: {original text}}

[Confidence: H/M/L]
{confidence description}

Confidence Criteria

Level	Condition	Meaning
H (High)	3+ sources, no conflicts, no speculation	Output is grounded in sufficient evidence
M (Medium)	1-2 sources or minor gaps present	Reasonable but can be improved with additional information
L (Low)	Insufficient sources or conflicts present	Relies on general knowledge — review required

M level: "Gathering more information can improve accuracy." L level: "⚠ Sources are insufficient. Re-run gather or provide additional information."

Feedback Loop

Guide the return to the appropriate Phase based on the issue type:

Issue Type	Return Phase	Command
Omission — purpose/constraints not reflected	Phase 1-2	`/context-engineering:gather`
Conflict — contradiction between Phase 3 selected items	Phase 3	`/context-engineering:gather` (re-select)
Conflict — contradiction during Phase 4 structuring	Phase 4	`/context-engineering:build` re-run
Speculation — insufficient source evidence	Phase 2	`/context-engineering:gather` (add sources)
Consistency — output format mismatch	Phase 6	`/context-engineering:compose` re-run

Automatic Recovery Support

Once the return Phase is determined in the feedback loop, propose automatic execution to the user:

[Phase 7 Feedback]
Issue: {issue type} — {detailed description}
Return target: Phase {N}
Recommended action: {specific correction}

Run `{command}` automatically? (Y/n)

If the user responds "Y", "네" (yes), "진행" (proceed), or no response → automatically run the command
If the user responds "n", "아니오" → provide the command only and stop
When running automatically, pass the issue context along:
- Failed check items
- Specific unmet content
- What was lacking in the previous attempt

Returning without context risks the target Phase repeating the same mistake.

When all criteria are met:

Verification complete. [Confidence: H] The output is ready to use.

G7 Gate

AI auto-evaluates the following criteria:

Criteria	Evaluation
Confidence determined	Is one of H/M/L clearly assigned?
Feedback Loop complete	Was return guidance given per issue type, or was "all criteria met" concluded?
Self-scoring rubric complete	Was a rubric score (0-100) produced?
Log recorded	Was the result recorded in `_verify-log.md`?

G7 Pass: Verification complete — output is ready to use. Delete _phase1-result.md if it exists.

G7 Failure: Return to the appropriate Phase per feedback loop

Phase 7 result: {verification summary}
Unmet items: {check item} — {reason}
Recommended action: {specific correction direction}
Run `{command}` automatically? (Y/n)

Saving Verification Results

On each verification run, append the result to _verify-log.md:

### {YYYY-MM-DD HH:mm} — {output format: A/B/C}

| Check | Result | Detail |
|-------|--------|--------|
| Omission | Pass/Fail | {content} |
| Conflict | Pass/Fail | {content} |
| Speculation | Pass/Fail | {content} |
| Consistency | Pass/Fail | {content} |

Confidence: {H/M/L}
Action: {none / feedback loop → Phase N}

Comparing Previous Results

If _verify-log.md already exists:

Read the previous verification results
Compare Pass/Fail changes for the same check items
If a repeated failure pattern is found, state it explicitly:

[Repeated Failure Pattern] {check item} has failed {N} consecutive times — estimated cause: {cause} Recommended action: {specific Phase return guidance}

Related: Circuit Breaker Protocol — present user options when the same gate fails ~3 times consecutively.

Self-scoring Rubric

After verification, calculate the overall score using the rubric below:

Criteria	Points	Score
Phase 1 purpose coverage	30	{0-30}
Source evidence sufficiency	25	{0-25}
Internal consistency	25	{0-25}
Output format appropriateness	20	{0-20}
Total	100	{total}

Score interpretation: 80+ = H, 50-79 = M, 49- = L

Memory Consolidation (Manual Run)

Running /context-engineering:verify --consolidate manually inspects the KB:

Duplicate detection: Entries within the same domain with 70%+ overlapping tags → Suggest merge
Staleness detection: Entries with date older than 90 days → Suggest re-verification or archive
Orphan detection: Entries not referenced in any other entry's related → Suggest linking

Report results to the user only. Actual deletion/modification proceeds after user approval.

verify

Invocation

Tool Access

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

verify

Invocation

Tool Access

Context Preview

SKILL.md

Verify (Phase 7)

Prerequisites

Trigger

Phase 7. Final Verification

4 Checks

Check Output Format

Confidence Criteria

Feedback Loop

Automatic Recovery Support

G7 Gate

Saving Verification Results

Comparing Previous Results

Self-scoring Rubric

Memory Consolidation (Manual Run)

Similar Skills

Help us improve

Verify (Phase 7)

Prerequisites

Trigger

Phase 7. Final Verification

4 Checks

Check Output Format

Confidence Criteria

Feedback Loop

Automatic Recovery Support

G7 Gate

Saving Verification Results

Comparing Previous Results

Self-scoring Rubric

Memory Consolidation (Manual Run)