Skill

inference-scaling

Detects the inference-time scaling environment and executes scaling on a prompt, then presents the selected response and configuration metadata.

ai-ml

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/its-hub:inference-scaling

User invocable

Model invocable

Inline context

Default effort

Uses dynamic context injection — preprocesses shell commands at runtime

Tool Access

This skill is limited to the following tools:

Bash(${CLAUDE_PLUGIN_ROOT}/scripts/its_scale.sh:*)Bash(${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh:*)

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Execute scaling on a prompt. For algorithm guidance, budget tuning, and troubleshooting, consult the `inference-scaling-guide` skill.

SKILL.md

44 lines · ~398 tokens

Stats

LanguagePython

Stars36

Forks18

MaintenanceExcellent

Last CommitJul 16, 2026

Actions

View Source View Plugin View on GitHub View README

Run Inference-Time Scaling

Execute scaling on a prompt. For algorithm guidance, budget tuning, and troubleshooting, consult the inference-scaling-guide skill.

Step 1: Check Environment

"${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh"

If not ready

library=missing or config=missing: invoke the setup-guide skill.

If ready (`library=installed`, `config=found`)

Proceed to Step 2.

Step 2: Execute Scaling

Run the scaling script with the user's prompt and any overrides:

"${CLAUDE_PLUGIN_ROOT}/scripts/its_scale.sh" --metadata $ARGUMENTS

If the user provides a file path (e.g., "scale all prompts in data/eval.jsonl"), invoke the batch-scaling skill instead.

Step 3: Present Results

Selected response — Show the winning response prominently
Metadata (if available):
- Self-consistency: show vote counts ("Selected by majority vote — 5/8 responses agreed")
- Best-of-N: show scores ("Selected as highest scoring — score: 0.92 out of 8 candidates")
Configuration used — algorithm, budget, model (briefly)

If the scaling failed, consult the inference-scaling-guide skill for troubleshooting.

inference-scaling

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

inference-scaling

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Run Inference-Time Scaling

Step 1: Check Environment

If not ready

If ready (`library=installed`, `config=found`)

Step 2: Execute Scaling

Step 3: Present Results

Similar Skills

Run Inference-Time Scaling

Step 1: Check Environment

If not ready

If ready (`library=installed`, `config=found`)

Step 2: Execute Scaling

Step 3: Present Results

Similar Skills

inference-scaling

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

inference-scaling

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Run Inference-Time Scaling

Step 1: Check Environment

If not ready

If ready (library=installed, config=found)

Step 2: Execute Scaling

Step 3: Present Results

Similar Skills

Run Inference-Time Scaling

Step 1: Check Environment

If not ready

If ready (library=installed, config=found)

Step 2: Execute Scaling

Step 3: Present Results

Similar Skills

If ready (`library=installed`, `config=found`)

If ready (`library=installed`, `config=found`)