Skill

context-optimization

Manages session context, token budgeting, and strategic information loading for AI-assisted engineering sessions. Trigger: "optimize context", "token budget", "context management", "session context", "priming strategy".

From sovereign-architect

Install

Run in your terminal

npx claudepluginhub javimontano/mao-sovereign-architect

Tool Access

This skill is limited to using the following tools:

ReadGlobGrepBashAgent

Supporting Assets

View in Repository

evals/evals.json

examples/sample-output.md

prompts/use-case-prompts.md

references/body-of-knowledge.md

Skill Content

Similar Skills

cqrs-implementation

Implements CQRS patterns with Python templates for command/query separation, event-sourcing, and scalable read/write models. Use for optimizing queries or independent scaling.

backend-development

33.0k

architecture-patterns

1 file

Implements Clean Architecture, Hexagonal Architecture (ports/adapters), and Domain-Driven Design for backend services. For microservice design, monolith refactoring to bounded contexts, and dependency debugging.

backend-development

33.0k

api-design-principles

4 files

Provides REST and GraphQL API design principles including resource hierarchies, HTTP methods, versioning strategies, pagination, and filtering patterns for new APIs, reviews, or standards.

backend-development

33.0k

Stats

Stars0

Forks0

Last CommitMar 28, 2026

Actions

View Source View Plugin View on GitHub View README

Procedure

Step 1 — Audit Available Context

Estimate the total context window size and current utilization.

Inventory all loaded context: system prompts, CLAUDE.md, conversation history, tool results.

Identify the highest-value context for the current task (directly relevant files, references).

Identify low-value context consuming tokens (boilerplate, verbose logs, irrelevant history).

Calculate the remaining context budget for task execution.

Step 2 — Design the Loading Strategy

Lazy Loading: Load references only when needed, not all at session start.

Priority Queue: Rank context items by relevance to the current task.

Summarization: Replace verbose context with concise summaries when full text is not needed.

Chunking: Break large files into relevant sections; load only the needed chunk.

Index-First: Load indexes and catalogs first; deep-dive into specifics on demand.

Step 3 — Implement Token Budgeting

Allocate budget by category: system context (20%), task context (50%), working memory (30%).

Set per-file token limits: if a file exceeds budget, summarize or extract relevant sections.

Monitor context growth during multi-step tasks; prune completed step context.

Cache frequently-referenced information in compact form (tables, key-value pairs).

Use structured formats (tables, lists) over prose to convey the same information in fewer tokens.

Step 4 — Optimize for Session Continuity

Generate session state snapshots at key milestones for recovery.

Create compact session summaries that preserve critical decisions and findings.

Design handoff artifacts that allow a new session to resume without re-reading everything.

Track which context items have been loaded and which are pending.

Document context optimization decisions for session debugging.

Procedure

Step 1 — Audit Available Context

Estimate the total context window size and current utilization.

Inventory all loaded context: system prompts, CLAUDE.md, conversation history, tool results.

Identify the highest-value context for the current task (directly relevant files, references).

Identify low-value context consuming tokens (boilerplate, verbose logs, irrelevant history).

Calculate the remaining context budget for task execution.

Step 2 — Design the Loading Strategy

Lazy Loading: Load references only when needed, not all at session start.

Priority Queue: Rank context items by relevance to the current task.

Summarization: Replace verbose context with concise summaries when full text is not needed.

Chunking: Break large files into relevant sections; load only the needed chunk.

Index-First: Load indexes and catalogs first; deep-dive into specifics on demand.

Step 3 — Implement Token Budgeting

Allocate budget by category: system context (20%), task context (50%), working memory (30%).

Set per-file token limits: if a file exceeds budget, summarize or extract relevant sections.

Monitor context growth during multi-step tasks; prune completed step context.

Cache frequently-referenced information in compact form (tables, key-value pairs).

Use structured formats (tables, lists) over prose to convey the same information in fewer tokens.

Step 4 — Optimize for Session Continuity

Generate session state snapshots at key milestones for recovery.

Create compact session summaries that preserve critical decisions and findings.

Design handoff artifacts that allow a new session to resume without re-reading everything.

Track which context items have been loaded and which are pending.

Document context optimization decisions for session debugging.

context-optimization

context-optimization

Context Optimization

Guiding Principle

Procedure

Step 1 — Audit Available Context

Step 2 — Design the Loading Strategy

Step 3 — Implement Token Budgeting

Step 4 — Optimize for Session Continuity

Quality Criteria

Anti-Patterns

Context Optimization

Guiding Principle

Procedure

Step 1 — Audit Available Context

Step 2 — Design the Loading Strategy

Step 3 — Implement Token Budgeting

Step 4 — Optimize for Session Continuity

Quality Criteria

Anti-Patterns