Plugin

cc-token-saver

Name: cc-token-saver
Author: ww-w-ai

Resume Claude Code sessions instantly from transcripts via /continue without LLM token costs, track real-time input/output/cache tokens and costs in CLI statusline or interactive HTML dashboards, save ~3900 tokens per session by injecting lite git instructions, and report unpublished Max plan rate limits from cached usage data.

npx claudepluginhub ww-w-ai/cc-token-saver

Component Overview

Skills

Hooks

Component Details

Skills (5)

continue

/continue

Cheaper and faster than /compact. Restores previous session context by reading transcripts directly — no LLM calls, no token cost

report-limit

/report-limit

Max plan hit the wall? 💀 Report your 5h window data — we're mapping the rate limit formula Anthropic won't publish

setup-git-lite

/setup-git-lite

Disable Claude Code built-in git instructions and inject a curated 280-tok minimum via SessionStart hook. Saves ~2,200 tok/session + ~1,700 tok/call.

setup-statusline

/setup-statusline

Live token counter in your CLI. Shows real-time input/output/cache token counts in the Claude Code status bar

usage-view

/usage-view

Know exactly what you spent. Interactive HTML dashboard with cost breakdown, token usage, and 5-hour window timeline across all sessions

Hooks (1)

Review workflow modifications before installing

Event Hooks

4 hooks across 2 events

README

cc-token-saver

Claude Code keeps cutting you off? Not anymore.

Spend less, code longer, and see exactly where your tokens go — zero config.

How? Auto context management, real-time cost tracking, and cache-aware session control — all built into one plugin.

😤 The Problem: $200/mo and You Still Can't Get Work Done

Claude Code Max Plan ($200/mo). Should be enough. It's not.

5-hour rolling window rate limit. You're deep in a coding flow and it just stops. No timer. No ETA. Just wait.

Cache expiry. You come back from lunch. It's been over an hour. You send one prompt and 900K tokens are re-sent at full price. Cost? $9 in a single shot.

Invisible costs. There's no way to see how much you're spending in real time. You only find out after the rate limit hits.

All manual. Context size, cache expiry timing, SubTask delegation, session cleanup. Nobody can track all this while actually coding.

cc-token-saver handles all of it automatically. Install once. Done.

🚀 Installation

claude plugin marketplace add ww-w-ai/cc-token-saver
claude plugin install cc-token-saver

Works automatically after install. Zero config. Requires Claude Code v2.1.71+.

For live monitoring:

/setup-statusline install

🛡️ Feature 1: Token Guardian

Detects cache expiry and automatically blocks expensive re-sends.

Claude Code's prompt cache TTL is 1 hour. Step away for more than an hour and the cache expires. Your next message re-sends the entire context at full price. At 900K tokens, that's $9 in one shot.

Token Guardian tracks when the last response was received. If more than 3,590 seconds have passed (TTL minus 10-second buffer), it blocks the prompt and shows a warning.

🚨 Cache expired (68m 23s idle)

The prompt cache has expired. Continuing will resend the full context.
Cost may increase significantly.

👉 /context — Check current context usage before deciding
👉 /clear → /continue — Reset, then restore previous context (recommended, cheapest)
👉 Re-send — Continue as-is (full re-cache cost incurred)

Just re-send the same prompt after the warning -- it goes through. The warning only fires once per idle period, so it never nags. Warning messages display in 23 languages based on your OS locale.

Result: Expensive re-cache costs are prevented automatically. No effort required.

🧠 Feature 2: Smart Session Architecture

Install it and cost-optimized work patterns kick in automatically.

Most users do everything in the Main session. File reads, code generation, test runs. Every output piles into context and is re-sent with every message. The session bloats. Costs snowball.

Session Architect automatically injects a delegation strategy at session start.

	Main Session	SubTask
Role	Design, decisions, review	Implementation, code gen, multi-file
Cache tier	1 hour (ephemeral_1h)	5 min
Cache write cost	＄10/MTok	＄6.25/MTok
Context size	~94K avg	~33K avg

SubTasks have 37.5% cheaper cache writes than Main. Context is also much smaller. Delegating heavy work to SubTasks cuts costs dramatically.

Result: Claude automatically works in a cost-efficient pattern. You don't have to think about it.

🪶 Concise Mode

Same content. Less padding. On by default.

The SessionStart hook also injects a response-style rule that runs in every session and every model — no flags, no setup. Three things change:

Preamble out — no "Let me check…", "I'll now…", restating your question, or recapping what the diff already shows
Right format for the content — bullets for lists, prose for reasoning (tradeoffs, causation, rationale). Neither is forced
Tighter expression — same point, fewer words. Clearer prose is shorter prose

Hard limit: never drop content, skip verification, or collapse nuance into a single sentence. Substance stays full; only the wrapper shrinks.

Install once, applies everywhere.

🔄 Feature 3: /continue — Context Restoration

Replaces /compact. Zero LLM calls. Zero token cost.

/compact sends your entire context (~1M tokens) to the LLM to compress it into a 3.3% summary. If the cache has expired, that alone triggers a full re-cache. Information loss is inevitable.

/continue takes a completely different approach. It preprocesses the previous session transcript and loads it directly. No LLM call. No cost. The original conversation is restored as-is.

View full README on GitHub

Similar Plugins

governor

Governor: always-on compact professional output, telemetry, context slimming, tool-output filtering, prompt guidance, and drift guardrails for Claude Code Max users.

Stats

Version1.6.2

Stars17

Forks1

MaintenanceExcellent

LicenseApache-2.0

Last CommitApr 28, 2026

AddedApr 8, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Help us improve

Share bugs, ideas, or general feedback.

Back to Plugins

cc-token-saver

Component Overview

Component Details

Skills (5)

Hooks (1)

README

cc-token-saver

😤 The Problem: $200/mo and You Still Can't Get Work Done

🚀 Installation

🛡️ Feature 1: Token Guardian

🧠 Feature 2: Smart Session Architecture

🪶 Concise Mode

🔄 Feature 3: /continue — Context Restoration

Similar Plugins

governor

Help us improve

Help us improve

cc-token-saver

Component Overview

Component Details

Skills (5)

Hooks (1)

README

cc-token-saver

😤 The Problem: $200/mo and You Still Can't Get Work Done

🚀 Installation

🛡️ Feature 1: Token Guardian

🧠 Feature 2: Smart Session Architecture

🪶 Concise Mode

🔄 Feature 3: /continue — Context Restoration

Similar Plugins

governor

Help us improve

limit

claude-pulse

claude-view

claude-context-optimizer

claude-cost-optimizer