Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By JiusiServe
Diagnose NVIDIA LongVidio Sparse Attention (LVSA) failures: identify silent dense fallback, out-of-memory at long sequences, missing MP4 outputs in Docker, quality regressions from training references, and environment variable misconfigurations.
npx claudepluginhub jiusiserve/longvideosparseattention --plugin lvsa-troubleshootingShare bugs, ideas, or general feedback.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Install LVSA and generate your first long video. Picks SDPA vs FlashInfer, sets the right LVSA_REFERENCE_LATENT_FRAMES per model, verifies the sparse path engaged.
Agent-ready playbooks for LLM serving benchmarks, capacity planning, torch-profiler triage, pipeline analysis, compute simulation, SGLang/vLLM SOTA Humanize loops, human code review, production incident triage, and model PR-history dossiers.
Skills for finding, comparing, running, and prompting AI models on Replicate
Agent Skills for Together AI platform — inference, training, embeddings, audio, video, images, function calling, and infrastructure
Claude Code skill pack for Runway (18 skills)
Video generation at scale. Generate videos, images, and audio with Runway's API — batch ad campaigns, product videos, multishot stories, and creative iteration. Supports seedance2, gen4.5, veo3, Nano, Banana Pro, and more.
Configure and run the LVSA vllm-omni serving plugin: env vars, geometry overrides, hooks for Wan vs HunyuanVideo, multi-GPU Ulysses.
Add LVSA support for a new video diffusion model by implementing the ModelAdapter ABC and wiring it into examples/.
Install LVSA and generate your first long video. Picks SDPA vs FlashInfer, sets the right LVSA_REFERENCE_LATENT_FRAMES per model, verifies the sparse path engaged.
Tune LVSA's sparsity_scale, window_size, n_first_frames, and reference_latent_frames for a target model and quality/speed trade-off.
Reproduce the headline paper numbers (SotA grid, latency scaling) using the bundled benchmarks/ scripts, VQeval, and VBench-Long.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim