From terraphim-engineering-skills
Phase 1.5/2.5 of disciplined development. Evaluates document quality using the KLS (Krogstie-Lindland-Sindre) 6-dimension framework. Produces structured ratings, identifies gaps, suggests specific revisions, and blocks phase transitions when quality is below threshold. Applies to Phase 1 research docs and Phase 2 design docs.
npx claudepluginhub terraphim/terraphim-skills --plugin terraphim-engineering-skillsThis skill uses the workspace's default tool permissions.
You evaluate Research Documents (Phase 1) and Implementation Plans (Phase 2) using the KLS framework before they proceed to next phases.
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
You evaluate Research Documents (Phase 1) and Implementation Plans (Phase 2) using the KLS framework before they proceed to next phases.
The Krogstie-Lindland-Sindre framework evaluates document quality across six dimensions:
| Dimension | Question | Evaluation Focus |
|---|---|---|
| Physical | Is it readable, well-formatted, accessible? | Formatting, structure, accessibility |
| Empirical | Can it be understood by intended audience? | Clarity, terminology, examples, linguistic precision (no nominalisations standing in for undefined processes, no unspecified actors, no universal quantifiers without evidence) |
| Syntactic | Is it internally consistent and well-structured? | Consistency, organization, completeness |
| Semantic | Does it accurately represent the domain? | Accuracy, correctness, domain fit |
| Pragmatic | Does it enable the intended decisions/actions? | Actionability, usefulness, guidance |
| Social | Do stakeholders agree with its content? | Consensus, review status, approvals |
| Score | Meaning | Characteristics |
|---|---|---|
| 1 | Poor | Major issues, blocks understanding or use |
| 2 | Below Standard | Significant gaps, needs substantial work |
| 3 | Adequate | Meets minimum bar, minor improvements needed |
| 4 | Good | Clear, useful, few issues |
| 5 | Excellent | Exemplary, no issues, could be a template |
minimum_dimension_score: 3 # No dimension below 3
minimum_average_score: 3.5 # Average across all dimensions
blocking: true # Fail blocks phase transition
In addition to KLS dimensions, evaluate essentialism alignment:
| Check | Question | Evaluation |
|---|---|---|
| Vital Few Focus | Does this focus on 5 or fewer essential items? | Count major scope items |
| Eliminated Noise | Is there a clear "out of scope" section? | Check for elimination documentation |
| Effortless Path | Is the proposed path the simplest possible? | Look for over-engineering |
| 90% Rule | Does each item pass the "HELL YES" test? | Challenge marginal inclusions |
For each dimension:
Apply GO/NO-GO rules to determine status.
# Quality Evaluation: [Document Name]
**Document Type**: Research Document / Implementation Plan
**Phase Transition**: Phase X -> Phase Y
**Status**: PASS / CONDITIONAL PASS / FAIL
**Evaluator**: [Name]
**Date**: [YYYY-MM-DD]
## Executive Summary
[2-3 sentences on overall quality and decision]
## KLS Dimension Scores
| Dimension | Score | Justification | Required Fix |
|-----------|-------|---------------|--------------|
| Physical | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
| Empirical | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
| Syntactic | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
| Semantic | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
| Pragmatic | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
| Social | X/5 | [Evidence-based reasoning] | [If <3, specific fix] |
**Average Score**: X.X/5
**Minimum Score**: X/5 ([dimension])
## Essentialism Evaluation
| Check | Status | Evidence |
|-------|--------|----------|
| Vital Few Focus (<=5 items) | Pass/Fail | [Count and list] |
| Eliminated Noise | Pass/Fail | [Out of scope section exists?] |
| Effortless Path | Pass/Fail | [Simplicity assessment] |
| 90% Rule | Pass/Fail | [Marginal items identified] |
## Decision
**GO/NO-GO**: [PASS / CONDITIONAL PASS / FAIL]
**Rationale**: [Brief explanation of decision]
### Required Actions (if FAIL)
1. [Specific, actionable fix]
2. [Specific, actionable fix]
### Recommended Actions (if CONDITIONAL PASS)
1. [Improvement suggestion]
2. [Improvement suggestion]
### Commendations (if PASS)
- [What was done well]
## Re-Evaluation
After fixes are applied:
- [ ] All required actions addressed
- [ ] Re-score affected dimensions
- [ ] Update decision status
disciplined-research -> disciplined-quality-evaluation -> disciplined-design
disciplined-design -> disciplined-quality-evaluation -> disciplined-implementation
The quality-gate skill delegates document quality evaluation to this skill when reviewing Research or Design documents.
When evaluating documents for ZDP (Zestic AI Development Process) gate transitions, add this optional 7th dimension to the KLS framework. This dimension can be ignored for standalone usage or non-gate documents.
| Aspect | Question | Evaluation Focus |
|---|---|---|
| Uncertainty Classification | Does the document explicitly classify what is known vs. unknown vs. contested? | Look for epistemic status labels on key claims |
| Bounded Commitments | Are commitments scoped, time-limited, and reversible where possible? | Check for open-ended or irreversible decisions |
| Escalation Paths | Does the document identify what should be escalated vs. decided locally? | Look for escalation criteria and routing |
| Forced Closure Check | Does the document avoid faking certainty to produce clean answers? | Check for hedged language where evidence is thin |
Scoring: Same 1-5 scale as other KLS dimensions.
When to apply: This dimension is optional for standard Phase 1/2 documents but recommended for ZDP gate-transition documents (PFA, LCO, LCA, IOC, FOC, CLR reviews).
Threshold: When applied, the governance dimension follows the same minimum score (3/5) as other dimensions.
If available, use perspective-investigation skill for governance-grade assessment of contested findings.