Skill

Claude Code Cost Optimization

Optimizes Claude Code costs: track tokens and USD with /cost, route models (Haiku/Sonnet/Opus), reduce via /compact/grep/sub-agents, maximize prompt caching.

Anthropic

Bash

ai-ml

developer-tools

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-code-expert:cost-optimization

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Complete guide to managing costs, model routing, token usage, and caching.

SKILL.md

203 lines · ~1.3k tokens

Stats

LanguageTypeScript

Parent stars12

Parent forks1

MaintenanceExcellent

Last CommitMar 31, 2026

Actions

View Source View Plugin View on GitHub View README

Model	ID	Best For	Cost
Opus 4.6	`claude-opus-4-6`	Architecture, complex decisions	Highest
Sonnet 4.6	`claude-sonnet-4-6`	General development, implementation	Medium
Haiku 4.5	`claude-haiku-4-5-20251001`	Quick lookups, simple tasks	Lowest

Task	Approximate Cost
Simple question	$0.01 - $0.05
Code review (1 file)	$0.05 - $0.15
Feature implementation	$0.20 - $1.00
Complex refactoring	$0.50 - $2.00
Full project analysis	$1.00 - $5.00

Model	ID	Best For	Cost
Opus 4.6	`claude-opus-4-6`	Architecture, complex decisions	Highest
Sonnet 4.6	`claude-sonnet-4-6`	General development, implementation	Medium
Haiku 4.5	`claude-haiku-4-5-20251001`	Quick lookups, simple tasks	Lowest

Task	Approximate Cost
Simple question	$0.01 - $0.05
Code review (1 file)	$0.05 - $0.15
Feature implementation	$0.20 - $1.00
Complex refactoring	$0.50 - $2.00
Full project analysis	$1.00 - $5.00

Claude Code Cost Optimization

Popularity

Invocation

Context Preview

SKILL.md

Claude Code Cost Optimization

Popularity

Invocation

Context Preview

SKILL.md

Claude Code Cost Optimization

Cost Tracking

/cost Command

Model Selection & Routing

Available Models

Switching Models

CLI Model Override

Settings Configuration

Token Reduction Strategies

1. Use /compact Frequently

2. Targeted File Reads

3. Use Sub-Agents for Research

4. Grep Before Read

5. Background Tasks

6. Clear Between Unrelated Tasks

Prompt Caching

How It Works

Maximizing Cache Hits

API-Level Caching

Provider Cost Comparison

Anthropic Direct

AWS Bedrock

Google Vertex AI

Batch Processing (50% Savings)

Cost Estimation

Rule of Thumb

Factors Affecting Cost

Best Practices

Similar Skills

Claude Code Cost Optimization

Cost Tracking

/cost Command

Model Selection & Routing

Available Models

Switching Models

CLI Model Override

Settings Configuration

Token Reduction Strategies

1. Use /compact Frequently

2. Targeted File Reads

3. Use Sub-Agents for Research

4. Grep Before Read

5. Background Tasks

6. Clear Between Unrelated Tasks

Prompt Caching

How It Works

Maximizing Cache Hits

API-Level Caching

Provider Cost Comparison

Anthropic Direct

AWS Bedrock

Google Vertex AI

Batch Processing (50% Savings)

Cost Estimation

Rule of Thumb

Factors Affecting Cost

Best Practices

Similar Skills