From great_cto
Subagent that enforces the artifact summary contract: generates structured .summary.md files for pipeline artifacts and ensures downstream agents read summaries first, reducing token consumption by 30-50%.
How this agent operates — its isolation, permissions, and tool access model
Agent reference
great_cto:agents/-shared/artifact-summary-contractThe summary Claude sees when deciding whether to delegate to this agent
Every great_cto pipeline artifact has two files now: ``` docs/architecture/ARCH-billing.md ← full document (3-10k tokens) docs/architecture/ARCH-billing.summary.md ← ≤ 250 tokens, structured ``` When you write a primary artifact (`ARCH-*.md`, `PLAN-*.md`, `QA-*.md`, `SEC-*.md`, `TM-*.md`, `RELEASE-*.md`, `PERF-*.md`, `ADR-*.md`, `PHASE-*.md`), the `summary-enforce` PostToolUse hook ...
Every great_cto pipeline artifact has two files now:
docs/architecture/ARCH-billing.md ← full document (3-10k tokens)
docs/architecture/ARCH-billing.summary.md ← ≤ 250 tokens, structured
When you write a primary artifact (ARCH-*.md, PLAN-*.md, QA-*.md,
SEC-*.md, TM-*.md, RELEASE-*.md, PERF-*.md, ADR-*.md,
PHASE-*.md), the summary-enforce PostToolUse hook automatically
generates the paired .summary.md. You don't need to write it yourself.
To regenerate manually (e.g. after major edits):
node scripts/generate-summary.mjs docs/architecture/ARCH-billing.md --force
The summary follows a strict structured format per artifact type. Cost ≈ $0.0005 per generation via Haiku; free with heuristic fallback when no API key is configured.
Read the .summary.md first. Only Read the full document if the summary
is insufficient for your decision.
Example — code-reviewer checking architectural alignment of a senior-dev
PR:
GOOD:
1. Read ARCH-billing.summary.md (~250 tokens)
2. Read the actual PR diff
3. If summary mentions a risk relevant to the diff → Read ARCH-billing.md
BAD:
1. Read ARCH-billing.md (~5000 tokens)
2. Read the PR diff
3. Realize 4500 of those tokens were irrelevant
You MUST escalate from .summary.md to the full doc when:
In all other cases, the summary is enough.
Before this contract: a typical pipeline had each artifact re-read in full by 4-5 downstream agents → 15-50k wasted tokens per feature.
With this contract: artifacts are read in full once by their producer and the immediate consumer; everyone else gets the summary. Expected pipeline token reduction: 30-50%.
See docs/plans/PLAN-token-economy-2026-q2.md for the full token-economy
initiative.
npx claudepluginhub avelikiy/great_ctoManages SDLC documentation lifecycle: organizes draft directories, tracks versions/changes/metadata, handles status transitions, archives superseded docs, and maintains audit trails for compliance.
Autonomous agent that summarizes code, config, data, documentation, and markup files. Reads full content via Read tool, assesses size with file_metrics.py, extracts key passages, and outputs structured YAML-frontmatter summaries.
Analyzes repositories and knowledge bases to extract structured knowledge via RELIC evaluation, artifact discovery, and synthesis document generation for agent creation.