From llm-router
Track and report how much you've saved by routing tasks to cheaper models.
npx claudepluginhub ypollak2/llm-router --plugin llm-routerThis skill uses the workspace's default tool permissions.
Track and report how much you've saved by routing tasks to cheaper models.
Guides Next.js Cache Components and Partial Prerendering (PPR): 'use cache' directives, cacheLife(), cacheTag(), revalidateTag() for caching, invalidation, static/dynamic optimization. Auto-activates on cacheComponents: true.
Guides building MCP servers enabling LLMs to interact with external services via tools. Covers best practices, TypeScript/Node (MCP SDK), Python (FastMCP).
Share bugs, ideas, or general feedback.
Track and report how much you've saved by routing tasks to cheaper models.
llm_savings — Cross-session savings by period (today / week / month / all time)
llm_usage("all") — Full dashboard: subscription %, Codex status, savings, providers
llm_dashboard — Open web dashboard at localhost:7337
Send a weekly savings summary to your team channel:
llm_digest(period="week") — format digest, print only
llm_digest(period="week", send=True) — format + push to LLM_ROUTER_WEBHOOK_URL
Set LLM_ROUTER_WEBHOOK_URL=https://hooks.slack.com/... in your environment.
Auto-detects Slack, Discord, or generic JSON webhook from the URL.
The digest automatically flags when today's spend is > 2× the 7-day average. No config needed — it's always on.
llm_policy — Show active org/repo routing policy + last 10 audit decisions
Set policy in ~/.llm-router/org-policy.yaml:
block_models:
- "o3*" # never use o3 series (too expensive)
- "gpt-4o"
allow_models:
- "gemini*" # prefer Gemini (allow overrides block)
task_caps:
image: 2.00 # max $2/day on image generation
llm_benchmark — Per-task routing accuracy from your 👍/👎 feedback
llm_rate — Rate the last response to improve future routing
llm_quality_report — Full routing stats, classifier accuracy, savings metrics