Skill

context-engineering

From mtk

Loads minimal relevant context when starting, switching phases, or entering unfamiliar code. Prevents output drift by refreshing scope and rules.

developer-tools

documentation

npx claudepluginhub moberghr/mtk-agent-toolkit --plugin mtk

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/mtk:context-engineering

Not user invocable

Model invocable

Inline context

Default effort

Uses dynamic context injection — preprocesses shell commands at runtime

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

```!

SKILL.md

208 lines · ~2.7k tokens

Similar Skills

using-superpowers

198.3k

Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.

3 files

superpowers

Stats

LanguageShell

Stars6

MaintenanceExcellent

Last CommitMay 18, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Context Engineering

Active Stack

echo "--- Tech Stack ---"
cat .claude/tech-stack 2>/dev/null || echo "(not set)"
if [ -f .claude/tech-stack-pm ]; then echo "--- Package Manager ---"; cat .claude/tech-stack-pm; fi

Overview

Good output depends on good context. Load the minimum relevant context needed to act correctly, then refresh it when the task shifts.

When To Use

Starting a new session
Switching from planning to implementation or implementation to review
Entering an unfamiliar area of the codebase
When the model starts making assumptions or drifting from project norms

When NOT To Use

As an excuse to endlessly read without acting

Workflow

Start with CLAUDE.md when present.
Load only the shared references relevant to the task.
Path-scoped auto-load. Reference entries in .claude/manifest.json may declare an applyTo glob array. When the current task has a known set of files in scope (from the spec's change_manifest or from git diff --name-only HEAD):
- MCP-first: If mtk_resolve_references tool is available, call it with the list of touched files. It returns deterministic glob matches against the manifest's applyTo arrays. Use its output directly.
- Fallback: If the MCP tool is unavailable, manually test each touched file against the globs (bash case / fnmatch semantics).
- Load references whose globs match at least one touched file.
- Skip references whose globs match nothing — they're not relevant to this task.
- References without applyTo are always-on when needed (e.g. coding-guidelines, framework-patterns); load on demand per phase.
Read the exact file to be changed and 2-3 neighboring files that establish local patterns.
Separate trusted local standards from untrusted external inputs.
Before a new phase, summarize what matters now:
- current goal
- files in scope
- governing rules
- open risks
- which applyTo references activated and why
Refresh context when the scope or failure mode changes. If new files enter scope, re-run the path-scoped match and load any newly-applicable references.

Parallel Loading

Reference reads in load-context steps are independent — issue multiple Read calls in a single message, not sequentially. Same applies to independent Glob/Grep discovery and to reviewer agents fanning out on orthogonal axes. If Call B's input would mention Call A's output, force them sequential; otherwise batch them.

See docs/parallelism-patterns.md for canonical patterns (parallel ref load, Stage 2 reviewer fan-out, batch deferred-tool hydration).

Context Fatigue Signals

Track four lightweight signals during a session and flag fatigue early — refresh, prune, or hand off before output quality collapses. None of these require tooling; estimate from session state.

Signal	Weight	Read as
Token utilization	40%	Approaching the conversation's context limit (e.g., compaction warnings appearing). High = imminent fatigue.
Scope scatter	25%	Number of distinct directories or features touched this session. >3 unrelated areas = scope creep, recall degrades.
Re-read ratio	20%	How often the same file is re-loaded because earlier reads aged out. >2 re-reads of the same file = context evicted.
Error density	15%	Build/test failures, corrections from the engineer, or tool-call retries per phase. Rising density = signal-to-noise dropping.

Composite reading. If 2+ signals are elevated simultaneously:

Pause before the next phase.
Prune: drop references no longer relevant; release skills not in active use.
Re-summarize the active goal (3-5 lines) so the next phase anchors on a clean restatement, not on accumulated noise.
If pruning isn't enough, escalate to handoff — capture state, end the session, resume in a fresh context.

Honest reporting. These are heuristics, not measurements. When you report fatigue, name which signals are elevated and why — don't hide behind a composite score.

Context Budget Tracking

Track the cumulative context loaded in the session. Fewer, focused instructions beat many, diluted ones — every extra rule competes with the ones that actually matter most for the current task.

Budget guidelines:

CLAUDE.md: target under 200 lines (~50 instructions)
Rules files: each under 120 lines
Each skill loaded: 60-120 lines
Reference files: vary, load only relevant sections

When to check the budget:

After loading 3+ skills in a single session, pause and assess: are all still relevant?
If output quality drops or instructions are being ignored, context may be over-budget
Before loading a new reference, check if an earlier one can be released

Warning signals:

5+ skills loaded simultaneously — prune to the 2-3 most relevant
Full reference files loaded when only a section is needed
Same context loaded multiple times (after compaction recovery)

Context Footprint

After completing reference loading at the end of Phase 0 (and after any subsequent phase that loads new references), emit a one-block footprint report so the engineer can see the cost of what was loaded:

# Run wc -l on each loaded reference file, then format the output:
# Example output:
#
# Context footprint (Phase 0):
#   security-checklist.md                         78 lines  (~2k tokens)
#   testing-patterns.md                          112 lines  (~2k tokens)
#   dotnet/coding-guidelines.md                  195 lines  (~3k tokens)
#   ─────────────────────────────────────────────────────────────────
#   Total: 3 files, 385 lines (~5k tokens)
#   (actual load depends on path-scoped matching — unmatched refs not counted)

Token estimate: 1 line ≈ 13 tokens (median for reference docs at ~65 chars/line ÷ 5 chars/token). This is a proxy, not an exact count.

Omit the block if no references were loaded in that phase (e.g., a Bash-only phase that touched no reference files). Keep it skimmable — one line per file, one totals line. Engineers can skip past it if they already know their setup.

Rules

Read before writing.
Prefer targeted context over broad dumping.
Re-anchor on the local codebase pattern before introducing new structures.
If confidence drops, gather better context before guessing.
Track context budget: fewer, more relevant instructions beat more, diluted ones.
Respect applyTo globs: if a reference's globs don't match any touched file, do NOT load it as a "just in case" measure. That defeats the budget.
When in doubt about which globs match, use git diff --name-only HEAD as the authoritative list of touched files.

Model Routing

Not all tasks need the same model tier. Route work by complexity to optimize cost and quality:

Phase / Skill	Model	Rationale
Pre-commit linter	N/A (bash)	Deterministic — no model needed
Setup-bootstrap scan recipes	haiku	File discovery, grep — structured data collection
Setup-audit scan recipes	haiku	Same — structured data collection
Planning and task breakdown	sonnet	Judgment needed but scope is bounded
Incremental implementation	sonnet	Standard code generation
Test-driven development	sonnet	Standard test generation
Pre-commit AI review	sonnet	Fast review with bounded scope
Compliance review	opus	Security, financial state, audit trails — highest stakes
Security and hardening	opus	Security decisions cannot be shallow
Spec-drift detection	sonnet	Structured comparison, moderate judgment
Architecture review	sonnet	Pattern matching against known rules
Test review	sonnet	Assertion quality, coverage gaps
Brainstorming	opus	Creative exploration benefits from deeper reasoning

Agent frontmatter model: sets the model for subagents. Entry-point skills run on the user's selected model. When a skill spawns a reviewer agent, the agent's frontmatter controls its model.

Common Rationalizations

Shared table for all MTK skills. Individual skills reference this section instead of repeating their own tables. If you catch yourself thinking one of these, stop and re-read what the current skill actually requires.

Rationalization	Reality
"I'll just start coding and adjust later"	Early wrong assumptions produce the most expensive rework. Read before writing.
"More context is always better"	No. Irrelevant context crowds out the rules that actually matter.
"I already read a similar file in another project"	Local codebase patterns win over generic memory.
"This change is trivial, it obviously works"	Trivial changes cause production incidents. Verify anyway.
"I'll verify / test / document it later"	Later rarely happens. Do it now or it won't happen.
"I know where the bug / issue is without reproducing it"	You have a hunch, not evidence. Reproduce first.
"It's only one more file"	Hidden scope creep is how quick fixes become feature work. Escalate instead.
"Probably works / should work / the framework handles it"	Probably is not a control. Verify the actual behavior.
"The tests pass, so this is fine"	Passing tests do not clear architecture, security, or performance risks.
"I'll remember this for next time"	You won't — no persistent memory without explicit capture. Write it down.
"The approach is obvious — skip planning / approval / alternatives"	Obvious to whom? Planning and approval exist to catch the mis-framings that feel obvious.
"The spec is outdated; the implementation is right"	Then amend the spec and re-approve. Drift checks run against the current spec, not a hypothetical one.

Red Flags

Editing without reading the target file and neighbors
Repeating generic patterns that the local codebase does not use
Loading many files with no clear reason

Verification

Governing standards were loaded first
Local pattern files were read before editing or reviewing
Context matches the current phase and task scope
No more than 3 skills loaded simultaneously unless justified
Reference files loaded by section, not in full, when possible
Path-scoped references were matched against actual touched files, not loaded speculatively
When scope changed mid-session, path-scoped matches were re-run

context-engineering

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

context-engineering

Popularity

Invocation

Context Preview

SKILL.md

Context Engineering

Active Stack

Overview

When To Use

When NOT To Use

Workflow

Parallel Loading

Context Fatigue Signals

Context Budget Tracking

Context Footprint

Rules

Model Routing

Common Rationalizations

Red Flags

Verification

Similar Skills

Help us improve

Context Engineering

Active Stack

Overview

When To Use

When NOT To Use

Workflow

Parallel Loading

Context Fatigue Signals

Context Budget Tracking

Context Footprint

Rules

Model Routing

Common Rationalizations

Red Flags

Verification