Help us improve
Share bugs, ideas, or general feedback.
From context-engineering
Phase 7: Final holistic verification. Automatically invoked on G6 failure, or run manually. Checks for omissions, conflicts, speculation, and consistency, then reports confidence. Artifacts are not modified — only the verification log is recorded. Keywords: verification, holistic check, confidence, feedback loop, omission, conflict, speculation
npx claudepluginhub seokrae/context-engineering --plugin context-engineeringHow this skill is triggered — by the user, by Claude, or both
Slash command
/context-engineering:verifyThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Final Verification phase. Checks whether the compose output actually satisfies the Phase 1 criteria. Does not modify the output — only reports verification results. Records the verification log (`_verify-log.md`).
Guides technical evaluation of code review feedback: read fully, restate for understanding, verify against codebase, respond with reasoning or pushback before implementing.
Share bugs, ideas, or general feedback.
Final Verification phase. Checks whether the compose output actually satisfies the Phase 1 criteria. Does not modify the output — only reports verification results. Records the verification log (_verify-log.md).
Confirm what to verify:
compose output (execution instruction / KB entry / project artifacts)
Phase 1 analysis results (purpose / constraints / success criteria)
If no output exists:
"Please run
/context-engineering:composefirst." Stop.
/context-engineering:verify directly (re-verify existing artifacts)Purpose: Check the output against 4 criteria and report confidence.
| Check | Criteria |
|---|---|
| Omission | Are all Phase 1 purpose/constraints reflected in the output? |
| Conflict | Are there no contradictions within the output? |
| Speculation | Are there no claims unsupported by collected sources? Review [source: general] tagged items first. |
| Consistency | Does the output format match the Phase 1 success criteria? |
[Phase 7 Verification Result]
Omission: {N items reflected / Unreflected: {item name} — {description}}
Conflict: {none / {itemA} ↔ {itemB} — {description}}
Speculation: {none / {item name} — no source evidence}
Consistency: {match / {mismatch detail} — Phase 1 success criteria: {original text}}
[Confidence: H/M/L]
{confidence description}
| Level | Condition | Meaning |
|---|---|---|
| H (High) | 3+ sources, no conflicts, no speculation | Output is grounded in sufficient evidence |
| M (Medium) | 1-2 sources or minor gaps present | Reasonable but can be improved with additional information |
| L (Low) | Insufficient sources or conflicts present | Relies on general knowledge — review required |
M level: "Gathering more information can improve accuracy." L level: "⚠ Sources are insufficient. Re-run gather or provide additional information."
Guide the return to the appropriate Phase based on the issue type:
| Issue Type | Return Phase | Command |
|---|---|---|
| Omission — purpose/constraints not reflected | Phase 1-2 | /context-engineering:gather |
| Conflict — contradiction between Phase 3 selected items | Phase 3 | /context-engineering:gather (re-select) |
| Conflict — contradiction during Phase 4 structuring | Phase 4 | /context-engineering:build re-run |
| Speculation — insufficient source evidence | Phase 2 | /context-engineering:gather (add sources) |
| Consistency — output format mismatch | Phase 6 | /context-engineering:compose re-run |
Once the return Phase is determined in the feedback loop, propose automatic execution to the user:
[Phase 7 Feedback]
Issue: {issue type} — {detailed description}
Return target: Phase {N}
Recommended action: {specific correction}
Run `{command}` automatically? (Y/n)
Returning without context risks the target Phase repeating the same mistake.
When all criteria are met:
Verification complete. [Confidence: H] The output is ready to use.
AI auto-evaluates the following criteria:
| Criteria | Evaluation |
|---|---|
| Confidence determined | Is one of H/M/L clearly assigned? |
| Feedback Loop complete | Was return guidance given per issue type, or was "all criteria met" concluded? |
| Self-scoring rubric complete | Was a rubric score (0-100) produced? |
| Log recorded | Was the result recorded in _verify-log.md? |
G7 Pass: Verification complete — output is ready to use. Delete _phase1-result.md if it exists.
G7 Failure: Return to the appropriate Phase per feedback loop
Phase 7 result: {verification summary}
Unmet items: {check item} — {reason}
Recommended action: {specific correction direction}
Run `{command}` automatically? (Y/n)
On each verification run, append the result to _verify-log.md:
### {YYYY-MM-DD HH:mm} — {output format: A/B/C}
| Check | Result | Detail |
|-------|--------|--------|
| Omission | Pass/Fail | {content} |
| Conflict | Pass/Fail | {content} |
| Speculation | Pass/Fail | {content} |
| Consistency | Pass/Fail | {content} |
Confidence: {H/M/L}
Action: {none / feedback loop → Phase N}
If _verify-log.md already exists:
Read the previous verification results
Compare Pass/Fail changes for the same check items
If a repeated failure pattern is found, state it explicitly:
[Repeated Failure Pattern] {check item} has failed {N} consecutive times — estimated cause: {cause} Recommended action: {specific Phase return guidance}
Related: Circuit Breaker Protocol — present user options when the same gate fails ~3 times consecutively.
After verification, calculate the overall score using the rubric below:
| Criteria | Points | Score |
|---|---|---|
| Phase 1 purpose coverage | 30 | {0-30} |
| Source evidence sufficiency | 25 | {0-25} |
| Internal consistency | 25 | {0-25} |
| Output format appropriateness | 20 | {0-20} |
| Total | 100 | {total} |
Score interpretation: 80+ = H, 50-79 = M, 49- = L
Running /context-engineering:verify --consolidate manually inspects the KB:
tags → Suggest mergedate older than 90 days → Suggest re-verification or archiverelated → Suggest linkingReport results to the user only. Actual deletion/modification proceeds after user approval.