From pro-workflow
Teaches context engineering ops: Write to persist, Select relevant info, Compress tokens, Isolate to manage budgets and keep AI coding sessions efficient.
npx claudepluginhub rohitg00/pro-workflow --plugin pro-workflowThis skill uses the workspace's default tool permissions.
Four operations control everything about how context flows through an AI coding session. Master them and you control the quality of every response.
Optimizes Claude Code session context and token usage via compaction, MCP audits, prompt scoping, and subagent delegation. Use when sessions slow, context degrades, or budget depletes.
Suggests manual context compaction at logical intervals to preserve context through task phases rather than arbitrary auto-compaction.
Manages Claude Code context window usage: monitors statusline, degradation thresholds, compacts with structured handoffs (/create-handoff, /compact, /resume-handoff), and validates summaries. For long sessions nearing limits.
Share bugs, ideas, or general feedback.
Four operations control everything about how context flows through an AI coding session. Master them and you control the quality of every response.
Move information out of the context window into durable storage so it survives compaction and session boundaries.
Where to write:
| Target | When | Example |
|---|---|---|
| CLAUDE.md | Permanent project rules | "Always use pnpm, never npm" |
| NOTES.md / scratchpad | Working state for current task | Architecture decisions, open questions |
.claude/memory/ | Learnings and patterns | [LEARN] rules from corrections |
| External files | Data too large for context | Test plans, migration checklists |
Pattern — Scratchpad workflow:
1. Start complex task → create NOTES.md with goals and constraints
2. After research → write findings to NOTES.md
3. After compaction → NOTES.md survives, context does not
4. Resume → read NOTES.md to recover full state
Pull the right information into context at the right time. Precision matters more than volume.
Methods ranked by precision:
@file references — exact file injectiongrep / Glob — targeted pattern searchKey principle: Focused 300 tokens > unfocused 113K tokens.
A surgical grep result that returns the exact function signature beats dumping an entire module into context. Every irrelevant token dilutes attention.
Pattern — Progressive retrieval:
1. Start with file names (Glob)
2. Narrow to specific functions (Grep)
3. Read only the relevant lines (Read with offset+limit)
4. Never read entire large files when you need one function
Shrink context without losing the information that matters.
Compaction strategies:
| Strategy | How | When |
|---|---|---|
/compact with focus | /compact focus: auth module changes | Task boundaries |
| Microcompact | Ask Claude to summarize tool output inline | After large reads/searches |
| Head+tail | Read first 20 + last 20 lines of large output | Log analysis, test results |
| Tool result clearing | Subagent results auto-clear after reporting | Heavy exploration |
| Semantic selection | Summarize findings, discard raw data | Research phases |
Compaction triggers:
CLAUDE_AUTOCOMPACT_PCT_OVERRIDE=50)PostCompact hook — Re-inject critical context:
{
"type": "PostCompact",
"command": "cat .claude/critical-context.md"
}
Use this to ensure project rules, current task state, or architecture constraints survive every compaction.
Don't load everything into one context. Split work across independent execution spaces.
| Method | Isolation Level | Use When |
|---|---|---|
| Subagents | Forked context | Heavy exploration, test runs, doc generation |
Worktrees (claude -w) | Full repo copy | Parallel features, competing approaches |
/btw (built-in Claude Code) | Temporary overlay | Quick questions without entering conversation history |
| Agent teams | Independent sessions | Cross-layer changes, parallel reviews |
Fresh session (/resume) | Clean slate | Unrelated work, degraded context |
Pattern — Subagent delegation:
Main session: planning, coordination, commits
Subagent 1: explore auth module, report findings
Subagent 2: run test suite, report failures
Subagent 3: generate migration script
Main context stays clean. Subagents handle the volume.
Example baseline (calibrate with /context): ~200K total window, ~20K overhead (CLAUDE.md, tool definitions, MCP schemas). Plan around ~180K usable — actual budgets vary by model and configuration.
| Allocation | Budget | What Goes Here |
|---|---|---|
| Static context | 20-30K | CLAUDE.md, tool schemas, MCP definitions |
| Dynamic context | 150-180K | Code, conversation, tool results |
Put static context first. CLAUDE.md and tool definitions load before conversation. Keeping them stable maximizes prompt cache hits — saves cost and latency.
| Phase | Target Usage | Action If Over |
|---|---|---|
| Planning | < 20% | Keep plans concise, write to scratchpad |
| Implementation | < 50% | Compact between files, delegate reads |
| Testing | < 70% | Delegate test runs to subagents |
| Review | < 85% | Start fresh session if degraded |
| Situation | Action |
|---|---|
| Task boundary, want to keep learnings | /compact with focus |
| Context degraded, Claude repeating itself | /compact, then /resume if still bad |
| Starting unrelated work | /clear or new session |
| Heavy read/search operation | Delegate to subagent |
| Quick side question | /btw (doesn't pollute main context) |
| Exploring multiple approaches | Worktrees or agent teams |
## Context Engineering
Write to NOTES.md for working state that must survive compaction.
Select with precision — grep first, read specific lines, never dump whole files.
Compact at 50% or task boundaries. Set CLAUDE_AUTOCOMPACT_PCT_OVERRIDE=50.
Isolate heavy work to subagents. Main session stays for coordination and commits.