Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Run inference-time scaling for LLMs by generating multiple candidates and selecting the best using voting, scoring, or search algorithms (Self-Consistency, Best-of-N, Beam Search, Particle Filtering) from JSONL, CSV, or TXT files or interactive prompts, with guided setup and budget tuning
npx claudepluginhub red-hat-ai-innovation-team/its_hub --plugin its-hubUse when the user wants to run inference-time scaling on multiple prompts from a file (JSONL, CSV, or TXT). Applies to batch processing, evaluation runs, or dataset-level scaling.
Guides users through inference-time scaling with its_hub, including algorithm selection (Self-Consistency, Best-of-N, Beam Search, Particle Filtering), budget tuning, reward model setup, tool-calling integration, interpreting results, and troubleshooting. Use when the user is working with its_hub, asking about scaling algorithms, debugging scaling issues, or tuning inference quality.
Use when the user wants to run inference-time scaling on a prompt — detect environment, execute scaling, and present results. For algorithm selection, budget tuning, reward models, and troubleshooting, consult the inference-scaling-guide skill.
Use when the user wants to set up inference-time scaling for the first time, or when its_hub is not yet installed/configured in the current environment.
Share bugs, ideas, or general feedback.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Orchestrate multiple open-weight LLMs via Fireworks AI to deliberate on queries using Karpathy's LLM Council concept. Models respond individually, rank each other's responses, then a Chairman synthesizes the best answer. Powered entirely by fast, affordable open-weight models on Fireworks.
A real-time directory of AI models that allows your AI agent to advise and pick the ideal LLM for the user's task.
Professional AI/ML Engineering toolkit: Prompt engineering, LLM integration, RAG systems, AI safety with 12 expert plugins
Smart LLM routing with Claude subscription monitoring, complexity-first model selection, and 20+ AI providers
Evaluate and compare ML model performance metrics
OpenRouter SDK plugin - unified interface for 500+ LLM models with intelligent routing, cost optimization, and framework integrations (Vercel AI SDK, LangChain, OpenAI SDK, PydanticAI)
LLM post-training — unified interface for SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim