Skill

cost-aware-llm-pipeline

Optimizes LLM API costs for Claude/GPT calls via task-complexity model routing, immutable budget tracking, narrow transient-error retries, and prompt caching. For batch tasks with budget limits.

Python

Anthropic

OpenAI

ai-ml

npx claudepluginhub xu-xiang/everything-claude-code-zh

Tool Access

This skill uses the workspace's default tool permissions.

Preview

在保持质量的同时控制 LLM API 成本的模式。将模型路由 (Model Routing)、预算跟踪 (Budget Tracking)、重试逻辑 (Retry Logic) 和提示词缓存 (Prompt Caching) 组合成一个可复用的流水线。

SKILL.md

Similar Skills

cost-aware-llm-pipeline

175.2k

Optimizes LLM API costs via model routing by task complexity, immutable budget tracking, narrow retry logic, and prompt caching. For budget-constrained LLM apps processing batches.

everything-claude-code

cost-aware-llm-pipeline

149

Provides Python patterns for LLM API cost optimization: model routing by complexity, immutable budget tracking, narrow retries, prompt caching. For apps calling Claude/GPT with budgets.

awesome-claude-notes

anth-cost-tuning

1.9k

Optimizes Anthropic Claude API costs with model routing, prompt caching, batching, spend monitoring, and Python cost calculators. For billing analysis and reduction.

4 tools

anthropic-pack

Stats

Stars75

Forks15

Last CommitMar 5, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

模型	输入 ($/1M tokens)	输出 ($/1M tokens)	相对成本
Haiku 4.5	$0.80	$4.00	1x
Sonnet 4.6	$3.00	$15.00	~4x
Opus 4.5	$15.00	$75.00	~19x

模型	输入 ($/1M tokens)	输出 ($/1M tokens)	相对成本
Haiku 4.5	$0.80	$4.00	1x
Sonnet 4.6	$3.00	$15.00	~4x
Opus 4.5	$15.00	$75.00	~19x

cost-aware-llm-pipeline

Tool Access

Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

cost-aware-llm-pipeline

Tool Access

Preview

SKILL.md

成本感知型 LLM 流水线 (Cost-Aware LLM Pipeline)

何时启用

核心概念

1. 基于任务复杂度的模型路由 (Model Routing)

2. 不可变成本跟踪 (Immutable Cost Tracking)

3. 精细化重试逻辑 (Narrow Retry Logic)

4. 提示词缓存 (Prompt Caching)

组合使用

价格参考 (2025-2026)

最佳实践

应避免的反模式 (Anti-Patterns)

使用场景

Similar Skills

Help us improve

成本感知型 LLM 流水线 (Cost-Aware LLM Pipeline)

何时启用

核心概念

1. 基于任务复杂度的模型路由 (Model Routing)

2. 不可变成本跟踪 (Immutable Cost Tracking)

3. 精细化重试逻辑 (Narrow Retry Logic)

4. 提示词缓存 (Prompt Caching)

组合使用

价格参考 (2025-2026)

最佳实践

应避免的反模式 (Anti-Patterns)

使用场景