By adibirzu
Multi-LLM gateway plugin for Claude Code and Codex workflows — phase-based orchestration, 8 agents, 11 commands, checkpoint discipline
Ask a question to a specific LLM model (e.g., /llm-ask oca/gpt5 explain this code)
Share or retrieve working context between Claude and Codex sessions
Query multiple LLMs in parallel for diverse perspectives
Open the MultiLLM dashboard showing sessions, token usage, costs, and backend status. Use when the user asks about LLM usage, costs, dashboard, or wants to see model statistics.
Discover available models from all LLM backends
Use this agent when the user needs a major architectural decision reviewed by multiple AI models simultaneously. It queries Claude Haiku, GPT-4o-mini, DeepSeek, and local Llama in parallel and synthesizes a consensus answer. Invoke for: "get multiple opinions", "council review", "compare LLMs on this", or any architectural decision that benefits from diverse perspectives.
Use this agent for thorough code review combining multiple LLM perspectives. Unlike security-reviewer (security-only), this covers correctness, design, performance, and maintainability. Invoke when: code has been written or modified, a PR needs review, or the user asks for feedback on implementation quality. Automatically invoked by work-orchestrator during QA phase.
Comprehensive reference agent for all cross-LLM collaboration patterns. Use this when you need to understand how to route work between models, share context across sessions, use shared memory, or leverage the full MultiLLM agent roster. This agent documents all available tools, agents, commands, and orchestration patterns.
Use this agent to compress large files, logs, or documents into a short summary using a FREE local Ollama model — preserving tokens in the main conversation. Invoke when: files are large (>200 lines), when exploring logs, or when the user says "summarize this file cheaply" or "save tokens".
Use this agent to do a security code review of any code, config, or infrastructure change. It asks GPT-4o (via OpenRouter) for a second opinion so you get a perspective from a different model family than Claude. Invoke when: code changes touch auth, crypto, network rules, IAM, secrets, or anything security-sensitive. Also invoke explicitly with "security-reviewer: review this".
Open the MultiLLM dashboard showing sessions, token usage, costs, and backend status. Use when the user asks about LLM usage, costs, dashboard, or wants to see model statistics.
Route work through the local MultiLLM gateway and decide when to ask other LLMs or helper agents for support. Use when Codex should leverage Claude, OCA, GPT, local models, or MultiLLM specialist agents for second opinions, architecture review, security review, context handoff, dashboard checks, or multi-device session consolidation.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Claude Code plugin marketplace by adibirzu.
| Plugin | Description | Version |
|---|---|---|
| prod-ready | Pre-production security audit, dependency hardening, CI/CD validation, and Docker readiness checks | 1.0.0 |
| rlm | Recursive Language Model v3 — dual-mode execution, git-aware incremental analysis, memory persistence | 3.0.0 |
| multillm | Multi-LLM gateway with 16+ backends, model discovery, session tracking, usage dashboard, and cross-LLM memory | 0.5.1 |
Add this marketplace to Claude Code:
/plugin marketplace add adibirzu/adibirzu-plugins
Then install individual plugins:
/plugin install prod-ready@adibirzu-plugins
/plugin install rlm@adibirzu-plugins
/plugin install multillm@adibirzu-plugins
MIT
npx claudepluginhub adibirzu/adibirzu-plugins --plugin multillmTenancy-agnostic Oracle Cloud Infrastructure (OCI) administration for Claude Code — safety-first skills for IAM, Security & Compliance, Observability & Database, Autonomous Database, Networking & Compute, Cost & Usage (FinOps), Log Analytics (OCL queries), Resource Manager (Terraform stacks), Data Safe, and Events & Functions (serverless). Plus a project lifecycle orchestrator and a Stage 0 solution-design front-end, all grounded in official Oracle docs (Open Knowledge Format) and routed against the upstream oracle/skills collection. Work by friendly context name instead of raw OCIDs; every mutation is preflighted, redacted, and confirmation-gated.
Recursive Language Model (RLM) v3 — dual-mode execution, git-aware incremental analysis, memory persistence, token-aware processing, FINAL protocol, adaptive budgets
Route Claude Code to 19 LLM backends (local, cloud, OCI GenAI, CLI agents incl. Claude Code/Codex/Gemini/Antigravity) through one gateway. Cost prediction, budgets, quota-aware failover, model fusion (panel+judge), log-driven routing, council/2nd-opinion, shared memory, and a real-time dashboard — all local, tenancy-agnostic. Bring your own keys/env.
Pre-production security audit, dependency hardening, CI/CD validation, and Docker readiness checks for Claude Code
Intelligent delegation framework for routing tasks to external LLM services while retaining strategic oversight
Flagship+ skill pack for OpenRouter - 30 skills for multi-model routing, fallbacks, and LLM gateway mastery
Multi-LLM integration for second opinions and task delegation
Run any model with an Anthropic- or OpenAI-compatible API (e.g. DeepSeek, GLM, Kimi, Qwen, MiniMax) — even your Codex subscription — as real Claude Code workflows, agent-team teammates, or one-shot subagents, driven exactly like native ones. Your main session's own auth is untouched (OAuth subscription or API key, either works); API-key providers bill the provider key via apiKeyHelper, while a Codex subscription bills through a local OAuth daemon — each worker receives its credential on demand, never through its env or argv. Requires the `cc-fleet` binary on PATH, installed separately.
When calling LLM APIs from Python code. When connecting to llamafile or local LLM servers. When switching between OpenAI/Anthropic/local providers. When implementing retry/fallback logic for LLM calls. When code imports litellm or uses completion() patterns.
Fuse the Claude Code model with OpenAI Codex and agy: query all three in parallel, then Claude judges, synthesizes, and acts.