Marketplace

sparkrun

AI-assisted inference on NVIDIA DGX Spark - run, manage, and stop LLM workloads

npx claudepluginhub spark-arena/sparkrun

README

1 Plugin

sparkrun

236·1·

AI-assisted inference on NVIDIA DGX Spark - run, manage, and stop LLM workloads

2mo

v0.0.4

spark-arena

Related Marketplaces

nextjs

139.9K

1plugin

No description available.

thedotmack

81.7K

1plugin

Plugins by Alex Newman (thedotmack)

ruview

73.1K

1plugin

RuView Marketplace: Claude Code + Codex plugins for WiFi sensing — configuration, applications, model training, and onboarding, from practical to advanced

Stats

Plugins1

Stars236

UpdatedMay 31, 2026

Links

View on GitHub View Marketplace JSON

Help us improve

Share bugs, ideas, or general feedback.

Stats

Links

Help us improve

Share bugs, ideas, or general feedback.

# Run an inference workload sparkrun run qwen3-1.7b-vllm # Multi-node tensor parallelism (TP maps to node count on DGX Spark) sparkrun run qwen3-1.7b-vllm --tp 2 # Re-attach to logs, stop a workload, check status sparkrun logs qwen3-1.7b-vllm sparkrun stop qwen3-1.7b-vllm sparkrun status

One command to rule them all

Launch, manage, and stop LLM inference workloads on one or more NVIDIA DGX Spark systems — no Slurm, no Kubernetes, no fuss.

Documentation · Quick Start · Recipes · Spark Arena

Install

uvx sparkrun setup

One command — installs sparkrun, then launches the guided setup wizard to create a cluster, configure SSH mesh, detect ConnectX-7 NICs, set up sudoers, and enable earlyoom.

Quick Start

# Run an inference workload
sparkrun run qwen3-1.7b-vllm

# Multi-node tensor parallelism (TP maps to node count on DGX Spark)
sparkrun run qwen3-1.7b-vllm --tp 2

# Re-attach to logs, stop a workload, check status
sparkrun logs qwen3-1.7b-vllm
sparkrun stop qwen3-1.7b-vllm
sparkrun status

Ctrl+C detaches from logs — it never kills your inference job. Your model keeps serving.

See the full CLI reference for all commands and options.

Highlights

Multi-runtime — vLLM, SGLang, llama.cpp out of the box
Multi-node tensor parallelism — --tp 2 = 2 hosts, automatic InfiniBand/RDMA detection
VRAM estimation — know if your model fits before you launch (sparkrun show <recipe>)
Git-based recipe registries — we publish official recipes, community recipes, and benchmarked recipes via Spark Arena, plus you can add your own registries.
Guided setup wizard — cluster creation, SSH mesh, CX7 auto-detection, sudoers, earlyoom
Model & container distribution — syncs models and images to cluster nodes over SSH automatically

Spark Arena

Spark Arena is the community hub for DGX Spark recipe benchmarks — browse benchmark results, then run them directly with sparkrun.

Official Recipes

Official Recipes are maintained by the Spark Arena team and hosted on GitHub. They are tested and optimized for NVIDIA DGX Spark systems.

Community Recipes

Community Recipes are contributed by the community and hosted on GitHub.

License

Apache License 2.0 — see LICENSE for details.

sparkrun

README

1 Plugin

sparkrun

Related Marketplaces

nextjs

thedotmack

ruview

Help us improve

Help us improve

Find plugins for your project

sparkrun

README

One command to rule them all

Install

Quick Start

Highlights

Spark Arena

Official Recipes

Community Recipes

Sponsored by

License

1 Plugin

sparkrun

Related Marketplaces

nextjs

thedotmack

ruview

Help us improve

One command to rule them all

Install

Quick Start

Highlights

Spark Arena

Official Recipes

Community Recipes

Sponsored by

License