Help us improve
Share bugs, ideas, or general feedback.
From its-hub
Guides users through first-time setup of its_hub for inference-time scaling, including environment detection, installation, and configuration of model endpoint, scaling algorithm, and budget.
npx claudepluginhub red-hat-ai-innovation-team/its_hub --plugin its-hubHow this skill is triggered — by the user, by Claude, or both
Slash command
/its-hub:setup-guideThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are helping the user set up inference-time scaling.
Detects the inference-scaling environment, runs inference-time scaling on a prompt, and presents results with vote counts or scores.
Generates validated, runnable implementation plans for ML pipelines, architecture designs, and multi-step projects grounded in official framework documentation.
Automates Ollama installation, hardware-based model selection, GPU setup, and client integration (Python/Node.js/REST) for local LLM inference on macOS/Linux/Docker.
Share bugs, ideas, or general feedback.
You are helping the user set up inference-time scaling.
"${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh"
Also check for API keys in the environment:
echo "openai_key=${OPENAI_API_KEY:+found}" "anthropic_key=${ANTHROPIC_API_KEY:+found}" "endpoint=${OPENAI_BASE_URL:-none}"
If library=missing:
installer=uv: run uv pip install "its_hub[lm]"installer=pip: run pip install "its_hub[lm]"installer=none: tell the user they need Python 3.11+ and pip/uv installed firstIf an API key was detected, offer a one-question fast path:
"I detected your OpenAI API key. I can set up with these defaults:
- Model:
gpt-4o-mini- Algorithm:
self-consistency- Budget:
8Accept these defaults, or would you like to customize?"
If the user accepts, skip to Step 5 using:
endpoint: https://api.openai.com/v1 (or OPENAI_BASE_URL if set)model: gpt-4o-minialgorithm: self-consistencybudget: 8If no API key was detected, or the user wants to customize, proceed to Step 4.
Ask these questions one at a time:
http://localhost:8000/v1 for vLLM, https://api.openai.com/v1 for OpenAIgpt-4o, Qwen/Qwen2.5-32B-Instructinference-scaling-guide skill for guidance if the user is unsure.
inference-scaling-guide skill for guidance if the user is unsure.Self-consistency:
\boxed{...} for math). If unsure, skip — default voting works for most cases."tool_name, tool_args, or tool_hierarchicalBest-of-N:
API keys are read from environment variables — never store them in the config file.
If no API key was detected in Step 1, tell the user to set the appropriate environment variable:
"Set your API key as an environment variable before running scaling:
export OPENAI_API_KEY="sk-..." # OpenAI or OpenAI-compatible endpoints export ANTHROPIC_API_KEY="sk-ant-..." # Anthropic endpointsFor local vLLM endpoints that don't require authentication, no API key is needed."
Write the config to .its-hub/config.json:
{
"models": {
"default": {
"endpoint": "<endpoint>",
"model": "<model_name>"
}
},
"algorithm": "<algorithm>",
"budget": "<budget>",
"algorithm_config": {}
}
Add .its-hub/ to .gitignore if not already present.
Confirm the config file was written, then report success:
"Setup complete! To run scaling, use the
inference-scalingskill.API keys are read from environment variables, not the config file. Make sure the appropriate variable is set in your shell:
export OPENAI_API_KEY="sk-..." # OpenAI or OpenAI-compatible endpoints (including vLLM with auth) export ANTHROPIC_API_KEY="sk-ant-..." # Anthropic endpointsLocal endpoints (e.g., vLLM without auth) don't need an API key."
If this skill is invoked again and a config already exists, ask: "You already have a configuration. Do you want to update it or add another model?"
If adding a model: collect endpoint and model name. Add a new entry to the models dict using the model name as key.