Skill

Context Engineering

Loads context hierarchically (rules → arch → source → errors → conversation) to maximize signal per token for AI coding agents.

developer-tools

npx claudepluginhub galando/temper --plugin temper

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/temper:context-engineering

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Version:** 1.0.0

SKILL.md

126 lines · ~1.4k tokens

Similar Skills

context-engineering

41.2k

Optimizes AI agent context setup using rules files like CLAUDE.md, specs, source files, and hierarchy. Use for new sessions, degrading output, task switches, or project configuration.

agent-skills

context-engineering

Curates Claude Code agent context hierarchy: rules files, memory, specs, source, and live state. Use for new sessions, output drift, subagent fan-out, rules config.

6 tools

nexus-agents

context-engineering

Guides writing enforceable conventions, stack declarations, and context for HARNESS.md to help LLMs follow project standards and aid team onboarding.

1 file

ai-literacy-superpowers

Stats

LanguageShell

Stars12

MaintenanceExcellent

Last CommitMay 14, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Context Engineering

Version: 1.0.0 Last Updated: 2026-05-12

Overview

Context engineering is the discipline of loading the right information, in the right order, at the right time. AI coding agents have finite context windows. Loading too much wastes tokens and dilutes attention. Loading too little causes hallucinations and missed dependencies.

This skill provides a hierarchical loading strategy that maximizes signal per token:

Priority 1: rules      → pack rules, stack conventions, guardrails
Priority 2: arch       → module boundaries, dependency graph, entry points
Priority 3: source     → specific files relevant to the current task
Priority 4: errors     → recent failures, test output, runtime errors
Priority 5: conversation → prior decisions, user intent, stakeholder context

When to Use

Start of every /temper stage (plan, design, build, review, check, fix)
Before making changes to unfamiliar code
When resuming a session from saved state
When context feels stale (agent is repeating itself or missing obvious facts)

Process

Step 1: Determine Task Scope

Before loading anything, answer:

What am I doing? (fixing a bug, adding a feature, reviewing code)
What files am I likely to touch? (from tasks.md or plan.md)
What do I already know? (from current context or build-state.json)

This takes 10 seconds and prevents loading 80% of files you won't use.

Step 2: Load Rules (always first)

Load in this order:

.claude/temper.config — enabled packs, stack, review settings
.claude/packs/{enabled-pack}/rules.md — only enabled packs
.claude/packs/stacks/{detected-stack}.md — stack-specific patterns (if exists)
.claude/CLAUDE.md — project conventions

Budget: ~200 lines total. If packs exceed this, load only the packs scoped to the current phase.

Step 3: Load Architecture (if touching unfamiliar code)

Start with the file you need to change
Trace dependencies: what does it import? What imports it?
Stop at 2 hops — do not traverse the entire dependency graph
If plan.md exists: read the file list section only (not the full plan)

Budget: ~400 lines. Use grep to find imports, not cat to read entire files.

Step 4: Load Source (only what's needed)

Read only files listed in tasks.md or plan.md
Read tests before implementation (TDD discipline)
Read error logs or test output if fixing a bug
Skip files you're not modifying and don't need to understand for the change

Budget: ~1000 lines. This is the bulk of your context. Be selective.

Step 5: Load Errors (if fixing or debugging)

Test output from the last run
check-context.json or review-context.json (if resuming from feedback loop)
Git diff (if reviewing changes)
Runtime error logs or stack traces

Budget: ~200 lines. Only relevant failures, not full test suites.

Step 6: Defer Everything Else

These are loaded ONLY when explicitly needed:

Full file contents of files not being modified
Historical git log beyond the last 5 commits
Documentation for frameworks you're not currently using
Design docs for features you're not currently building

Constraint: Under 2K Lines Per Task

Total context loaded per task should stay under 2000 lines. This includes rules, architecture, source, and errors. If you're approaching this limit:

Drop architecture — you probably already understand the module boundaries
Summarize source — read the function signatures, skip the bodies
Defer errors — only load the specific test failure you're fixing

Why 2K? Larger contexts reduce attention density. The agent spends tokens processing information instead of acting on it. Smaller, focused contexts produce better results.

Rationalizations

Rationalization	Why It's Wrong
"I need to read the whole codebase to understand the patterns"	No, you need to read 3-5 representative files. Patterns emerge quickly.
"More context means better decisions"	More context means more noise. The signal-to-noise ratio drops after 2K lines.
"I'll just load everything and filter later"	You can't "filter" context — it all consumes attention tokens whether you reference it or not.
"The file is short, I'll read it just in case"	10 "short" files = 500 lines you didn't need. Read on demand, not preemptively.
"I need the full git history for context"	The last 5 commits tell you what changed. Full history is archaeology, not engineering.

Red Flags

Watch for these signs that your context strategy is failing:

Agent repeats itself — too much context, not enough focus. Reduce.
Agent hallucinates APIs — not enough source context. Load the actual module.
Agent misses obvious dependencies — architecture context was skipped. Load the import graph.
Agent fixes symptoms, not root cause — error context is incomplete. Load the full stack trace.
Task takes > 10 turns — context is likely bloated. Compact and reload focused context.

Verification

After loading context, verify:

Can you name the 3-5 files you'll modify? If not, scope is unclear.
Can you describe the module boundary you're working within? If not, load architecture.
Do you know what tests exist for the code you're changing? If not, load test files.
Is your loaded context under 2000 lines? If not, trim before proceeding.