Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By zerogpu
Offload cheap AI tasks (classification, summarization, extraction, follow-ups, small-model chat) to ZeroGPU small/nano models.
npx claudepluginhub zerogpu/zerogpu-router --plugin zerogpu-routerCut your AI costs. Route trivial tasks — summarize, classify, redact PII, extract JSON, generate follow-ups, short chat — to small/nano models instead of burning frontier-model tokens.

ZeroGPU Router is a smart task router for AI agents. It exposes task-specific tools — summarize, classify, redact PII, extract JSON, and more — via the Model Context Protocol (MCP), backed by small language models that run for a fraction of the cost of a frontier model.
Your agent keeps doing the heavy reasoning. The boring stuff gets routed to ZeroGPU.
savings_usd figure.https://mcp.zerogpu.ai/mcp. We run the routing layer.You need a ZeroGPU API key and project ID. Grab them at platform.zerogpu.ai.
Register the hosted MCP endpoint:
claude mcp add --transport http zerogpu \
https://mcp.zerogpu.ai/mcp \
--header "x-api-key: <your-api-key>" \
--header "x-project-id: <your-project-id>"
Install the routing skill so Claude knows when to use the tools — see agents/claude/.
Restart Claude Code and try:
summarize this paragraph: Renewable energy adoption is accelerating globally, driven by falling solar and wind costs.
Claude will call zerogpu_summarize and reply with a summary plus a savings line.
Register the hosted MCP endpoint in OpenClaw:
openclaw mcp set zerogpu '{
"url": "https://mcp.zerogpu.ai/mcp",
"transport": "streamable-http",
"headers": {
"x-api-key": "<your-api-key>",
"x-project-id": "<your-project-id>"
}
}'
Install the routing skill — full instructions in agents/openclaw/.
Then ask your agent the same prompt above. The agent will call zerogpu_summarize instead of answering with the host model.
Sign in at platform.zerogpu.ai to:
The hosted Router at https://mcp.zerogpu.ai/mcp is the one your agent talks to. The dashboard at platform.zerogpu.ai is where you see what it did.
ZeroGPU Router exposes eleven task-specific routes:
| Route | Workload | Model |
|---|---|---|
zerogpu_classify_iab | IAB topic classification | zlm-v1-iab-classify-edge |
zerogpu_summarize | TL;DRs, abstracts, meeting note summaries | t5-small |
zerogpu_classify_zero_shot | Classify against a flat label list | deberta-v3-small |
zerogpu_extract_entities | Extract people, places, companies, dates, custom entities | gliner2-base-v1 |
zerogpu_extract_json | Pull structured fields into grouped JSON | gliner2-base-v1 |
zerogpu_classify_structured | Multi-axis schema classification | gliner2-base-v1 |
zerogpu_redact_pii | Mask emails, phones, names, addresses, other PII | gliner-multi-pii-v1 |
zerogpu_extract_pii | Extract PII grouped by category | gliner-multi-pii-v1 |
zerogpu_generate_followups | Generate follow-up questions from a passage | zlm-v1-followup-questions-edge |
zerogpu_chat | Short small-model chat replies | LFM2.5-1.2B-Instruct / -Thinking |
zerogpu_health | Verify ZeroGPU backend health | — |
Every route returns { <task fields>, model, usage, savings }.
| Package | Role |
|---|---|
| agents/claude/ | Claude Code marketplace plugin + routing skill |
| agents/openclaw/ | OpenClaw plugin (zerogpu-router) + routing skill + MCP registration JSON |
Share bugs, ideas, or general feedback.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Natural language processing and text analysis
Smart LLM routing with Claude subscription monitoring, complexity-first model selection, and 20+ AI providers
A real-time directory of AI models that allows your AI agent to advise and pick the ideal LLM for the user's task.
Faithful information summarization with fidelity preservation, structured output, and anti-hallucination methodology. Provides skills for file, URL, and image summarization; agents for autonomous summarization tasks; and hooks for validating agent output structure.
Train task-specific small language models (SLMs) using the Distil Labs CLI and platform
Agent Skills for Together AI platform — inference, training, embeddings, audio, video, images, function calling, and infrastructure
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim