Help us improve
Share bugs, ideas, or general feedback.
From its-hub
Runs inference-time scaling on multiple prompts from JSONL, CSV, or TXT files. Useful for batch processing, evaluation runs, or dataset-level scaling.
npx claudepluginhub red-hat-ai-innovation-team/its_hub --plugin its-hubHow this skill is triggered — by the user, by Claude, or both
Slash command
/its-hub:batch-scalingThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Run inference-time scaling on multiple prompts from a file.
Detects the inference-scaling environment, runs inference-time scaling on a prompt, and presents results with vote counts or scores.
Uses Google Gemini Batch API for cost-effective asynchronous processing of thousands of documents with the same prompt.
Interactive benchmark orchestrator for vLLM inference services. Runs single/multi-case online benchmarks, aggregates results, and auto-optimizes concurrency under latency SLOs.
Share bugs, ideas, or general feedback.
Run inference-time scaling on multiple prompts from a file.
prompt or messages fieldprompt column"${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh"
If config=missing, tell the user to run the setup-guide skill first.
Call the batch scaling script with the input file and any overrides:
"${CLAUDE_PLUGIN_ROOT}/scripts/its_batch_scale.sh" [--algorithm ALG] [--budget N] [--model KEY] [--output FILE] <input-file>
The script loads config once and processes all prompts in a single process. Default output path is results/<input_name>_scaled.jsonl.
The script outputs a JSON summary with total, succeeded, failed, failures, and output_file.
Report: "N/M prompts completed successfully. K failed. Results written to <output_file>"
If there were failures, list the line numbers and error messages.