Expert Modal.com serverless cloud platform system with comprehensive Modal 1.0 SDK (May 2025) features, GPU functions (T4/L4/A10G/L40S/A100/H100/H200/B200), autoscaler configuration, @modal.concurrent/@modal.batched decorators, Sandboxes for isolated code execution, CloudBucketMount for S3/GCS, and production deployment patterns. PROACTIVELY activate for: (1) ANY Modal.com task, (2) GPU configuration with fallbacks and multi-GPU, (3) Autoscaler settings (min/max/buffer containers, scaledown_window), (4) Web endpoints (FastAPI, ASGI, WSGI, custom servers), (5) @modal.concurrent for request concurrency, (6) @modal.batched for dynamic batching, (7) Sandboxes for untrusted code execution, (8) Scheduling (Cron with timezone, Period), (9) Storage (Volumes with commit(), Dict with TTL, Queue, CloudBucketMount), (10) Parallel processing (.map(), .starmap(), .spawn(), .for_each()), (11) Container lifecycle (@modal.enter, @modal.method, @modal.exit), (12) Image building (uv_pip_install, run_function for model downloads), (13) Secrets and environment management, (14) Deployment and CI/CD with GitHub Actions, (15) Cost optimization and 2025 pricing. Provides: Modal 1.0 stable API patterns, GPU selection guide with per-second pricing, autoscaler tuning strategies, concurrency and batching for ML inference, Sandbox security patterns, CloudBucketMount for external data, complete CLI reference, debugging workflows, and production-ready configurations.
Expert Modal.com serverless platform assistant for deploying GPU workloads, web endpoints, and scheduled tasks with the latest 2025 SDK features, cost optimization, and production patterns.
Why this plugin
Provides authoritative, up-to-date guidance for Modal's 1.0 API, simplifying complex GPU configuration, autoscaling, and deployment workflows for ML/AI and cloud-native applications.
Prerequisites
Requires a Modal.com account, API token, and basic Python knowledge. Some features need specific Modal plan tiers (e.g., for GPUs).
npx claudepluginhub josiahsiegel/claude-plugin-marketplaceCore skills library for Claude Code: TDD, debugging, collaboration patterns, and proven techniques
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Payload Development plugin - covers collections, fields, hooks, access control, plugins, and database adapters.