together-dedicated-containers | togetherai-skills | ClaudePluginHub

Skill

together-dedicated-containers

From togetherai-skills

Deploys custom Dockerized inference workers on Together AI GPUs using Sprocket SDK and Jig CLI. Submits async queue jobs and polls results for container-level control beyond standard endpoints.

$

npx claudepluginhub togethercomputer/skills

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Use Dedicated Container Inference when the user needs a custom runtime, not just managed model

Supporting Assets

agents/openai.yamlreferences/jig-cli.mdreferences/sprocket-sdk.mdscripts/queue_client.pyscripts/queue_client.tsscripts/sprocket_hello_world.py

SKILL.md

Similar Skills

together-dedicated-endpoints

22

Deploys and manages single-tenant GPU endpoints on Together AI with autoscaling and no rate limits. Handles fine-tuned or uploaded models, hardware sizing, and lifecycle for stable production inference.

8 files

togetherai-skills

modal

13

Runs Python code serverlessly in the cloud with containers, GPUs, and autoscaling for deploying ML models, batch processing, scheduled jobs, and GPU-accelerated APIs.

12 files

hugging-face-jobs

36.4k

Runs Python workloads on Hugging Face Jobs with managed CPUs, GPUs, TPUs, secrets, and Hub persistence for data processing, batch inference, experiments, and scheduled tasks without local setup.

8 files

antigravity-awesome-skills

Stats

Stars22

Forks4

Last CommitMar 30, 2026

Used By2 plugins

Actions

View Source View Plugin View on GitHub View README

Tags

Help us improve

Share bugs, ideas, or general feedback.

Together Dedicated Containers

Overview

Use Dedicated Container Inference when the user needs a custom runtime, not just managed model hosting.

Core building blocks:

Jig CLI for build and deployment
Sprocket SDK for request handling inside the container
Queue API for async jobs

When This Skill Wins

Deploy a custom inference worker
Bundle custom dependencies or runtime logic into a container
Use queue-based async processing with progress tracking
Run a specialized image, video, or multimodal pipeline

Hand Off To Another Skill

Use together-dedicated-endpoints for standard model hosting without custom containers
Use together-gpu-clusters for full cluster ownership and orchestration control
Use together-chat-completions, together-images, or together-video when a serverless product already covers the task

Quick Routing

Minimal worker template
- Start with scripts/sprocket_hello_world.py
- Read references/sprocket-sdk.md
Build, deploy, logs, queue, and secrets
- Read references/jig-cli.md
Queue submission and polling
- Start with scripts/queue_client.py or scripts/queue_client.ts

Workflow

Confirm that the user truly needs a custom container runtime.
Implement the worker with Sprocket's request lifecycle.
Configure pyproject.toml for image, runtime, autoscaling, and mounts.
Deploy with Jig.
Submit jobs through the queue API and poll until completion.

High-Signal Rules

Python scripts require the Together v2 SDK (together>=2.0.0). If the user is on an older version, they must upgrade first: uv pip install --upgrade "together>=2.0.0".
Prefer dedicated endpoints over containers unless the runtime or pipeline is genuinely custom.
Treat the worker contract and pyproject.toml as the source of truth for deployment behavior.
Parameterize deployment name, queue inputs, and resource sizing instead of hardcoding them.
Queue-based jobs are asynchronous by default; account for polling and result retrieval in client code.

Resource Map

Jig CLI: references/jig-cli.md
Sprocket SDK: references/sprocket-sdk.md
Python queue client: scripts/queue_client.py
TypeScript queue client: scripts/queue_client.ts
Worker template: scripts/sprocket_hello_world.py

Official Docs