Plugin

sagemaker-ai

Name: sagemaker-ai
Author: awslabs

Automate SageMaker AI/ML workflows: prepare and validate datasets, fine-tune LLMs via SFT/DPO/RLVR on serverless jobs, evaluate with LLM-as-a-Judge, deploy models to endpoints/Bedrock, and diagnose/manage HyperPod clusters using generated notebooks, scripts, and AWS tools.

npx claudepluginhub awslabs/agent-plugins --plugin sagemaker-ai

Component Overview

Skills

MCP Servers

Component Details

Skills (12)

dataset-evaluation

/dataset-evaluation

Validates dataset formatting and quality for SageMaker model fine-tuning (SFT, DPO, or RLVR). Use when the user says "is my dataset okay", "evaluate my data", "check my training data", "I have my own data", or before starting any fine-tuning job. Detects file format, checks schema compliance against the selected model and technique, and reports whether the data is ready for training or evaluation.

dataset-transformation

/dataset-transformation

Generates a Jupyter notebook that transforms datasets between ML schemas for model training or evaluation. Use when the user says "transform", "convert", "reformat", "change the format", or when a dataset's schema needs to change to match the target format — always use this skill for format changes rather than writing inline transformation code. Supports OpenAI chat, SageMaker SFT/DPO/RLVR, HuggingFace preference, Bedrock Nova, VERL, and custom JSONL formats from local files or S3.

directory-management

/directory-management

Manages project directory setup and artifact organization. Use when starting a new project, resuming an existing one, or when a PLAN.md needs to be associated with a project directory. Creates the project folder structure (specs/, scripts/, notebooks/) and resolves project naming.

finetuning-setup

/finetuning-setup

Selects a base model and fine-tuning technique (SFT, DPO, or RLVR) for the user's use case by querying SageMaker Hub. Use when the user asks which model or technique to use, wants to start fine-tuning, or mentions a model name or family (e.g., "Llama", "Mistral") — always activate even for known model names because the exact Hub model ID must be resolved. Queries available models, validates technique compatibility, and confirms selections.

finetuning

/finetuning

Generates a Jupyter notebook that fine-tunes a base model using SageMaker serverless training jobs. Use when the user says "start training", "fine-tune my model", "I'm ready to train", or when the plan reaches the finetuning step. Supports SFT, DPO, and RLVR trainers, including RLVR Lambda reward function creation.

hyperpod-issue-report

/hyperpod-issue-report

Generate comprehensive issue reports from HyperPod clusters (EKS and Slurm) by collecting diagnostic logs and configurations for troubleshooting and AWS Support cases. Use when users need to collect diagnostics from HyperPod cluster nodes, generate issue reports for AWS Support, investigate node failures or performance problems, document cluster state, or create diagnostic snapshots. Triggers on requests involving issue reports, diagnostic collection, support case preparation, or cluster troubleshooting that requires gathering logs and system information from multiple nodes.

hyperpod-ssm

/hyperpod-ssm

Remote command execution and file transfer on SageMaker HyperPod cluster nodes via AWS Systems Manager (SSM). This is the primary interface for accessing HyperPod nodes — direct SSH is not available. Use when any skill, workflow, or user request needs to execute commands on cluster nodes, upload files to nodes, read/download files from nodes, run diagnostics, install packages, or perform any operation requiring shell access to HyperPod instances. Other HyperPod skills depend on this skill for all node-level operations.

hyperpod-version-checker

/hyperpod-version-checker

Check and compare software component versions on SageMaker HyperPod cluster nodes - NVIDIA drivers, CUDA toolkit, cuDNN, NCCL, EFA, AWS OFI NCCL, GDRCopy, MPI, Neuron SDK (Trainium/Inferentia), Python, and PyTorch. Use when checking component versions, verifying CUDA/driver compatibility, detecting version mismatches across nodes, planning upgrades, documenting cluster configuration, or troubleshooting version-related issues on HyperPod. Triggers on requests about versions, compatibility, component checks, or upgrade planning for HyperPod clusters.

model-deployment

/model-deployment

Generates a Jupyter notebook that deploys fine-tuned models from SageMaker Serverless Model Customization to SageMaker endpoints or Bedrock. Use when the user says "deploy my model", "create an endpoint", "make it available", or asks about deployment options. Identifies the correct deployment pathway (Nova vs OSS), generates deployment code, and handles endpoint configuration.

model-evaluation

/model-evaluation

Generates a Jupyter notebook that evaluates a fine-tuned SageMaker model using LLM-as-a-Judge. Use when the user says "evaluate my model", "how did my model perform", "compare models", or after a training job completes. Supports built-in and custom evaluation metrics, evaluation dataset setup, and judge model selection.

planning

/planning

Discovers user intent and generates a structured, step-by-step customization plan that orchestrates other skills. Always activate at the start of every conversation, when all tasks in a plan are completed, or when the user asks to modify the current plan. Handles intent discovery, plan generation, plan iteration, and mid-execution plan alterations. When in doubt, use this skill.

use-case-specification

/use-case-specification

Creates a reusable use case specification file that defines the business problem, stakeholders, and measurable success criteria for model customization, as recommended by the AWS Responsible AI Lens. Use as the default first step in any model customization plan. Skip only if the user explicitly declines or already has a use case specification to reuse. Captures problem statement, primary users, and LLM-as-a-Judge success tenets.

MCP Servers (1)

Connects to external services

aws-mcp

External

README

Agent Plugins for AWS

Read this in other languages: 日本語

[!IMPORTANT] Generative AI can make mistakes. You should consider reviewing all output and costs generated by your chosen AI model and agentic coding assistant. See AWS Responsible AI Policy.

Agent Plugins for AWS equip AI coding agents with the skills to help you architect, deploy, and operate on AWS. Agent plugins are currently supported by Claude Code and Cursor.

AI coding agents are increasingly used in software development, helping developers write, review, and deploy code more efficiently. Agent skills and the broader agent plugin packaging model are emerging as best practices for steering coding agents toward reliable outcomes without bloating model context. Instead of repeatedly pasting long AWS guidance into prompts, developers can now encode that guidance as reusable, versioned capabilities that agents invoke when relevant. This improves determinism, reduces context overhead, and makes agent behavior easier to standardize across teams. Agent plugins act as containers that package different types of expertise artifacts together. A single agent plugin can include:

Agent skills – Structured workflows and best-practice playbooks that guide AI through complex tasks like deployment, code review, or architecture planning. Agent skills encode domain expertise as step-by-step processes.
MCP servers – Connections to external services, data sources, and APIs. MCP servers give your assistant access to live documentation, pricing data, and other resources at runtime. Learn more about MCP servers for AWS.
Hooks – Automation and guardrails that run on developer actions. Hooks can validate changes, enforce standards, or trigger workflows automatically.
References – Documentation, configuration defaults, and knowledge that the agent skill can consult. References make agent skills smarter without bloating the prompt.

As new types of expertise artifacts emerge in this space, they can be packaged into agent plugins, making the evolution transparent to developers.

Best practices

To maximize the benefits of plugin-assisted development while maintaining security and code quality, follow these essential guidelines:

Always review generated code before deployment (for example, against your constraints for security, cost, resilience)
Use plugins as accelerators, not replacements for developer judgment and expertise.
Keep plugins updated to benefit from the latest AWS best practices.
Follow the principle of least privilege when configuring AWS credentials.
Run security scanning tools on generated infrastructure code.

Plugins

Plugin	Description	Status
amazon-location-service	Add maps, geocoding, routing, places search, and geospatial features to applications with Amazon Location Service	Available
aws-amplify	Build full-stack apps with AWS Amplify Gen 2 using guided workflows for auth, data, storage, and functions	Available
aws-serverless	Build serverless applications with Lambda, API Gateway, EventBridge, Step Functions, and durable functions	Available
databases-on-aws	Database guidance for the AWS database portfolio — schema design, queries, migrations, and multi-tenant patterns	Some Services Available (Aurora DSQL)
deploy-on-aws	Deploy applications to AWS with architecture recommendations, cost estimates, and IaC deployment	Available
migration-to-aws	Migrate GCP infrastructure to AWS with resource discovery, architecture mapping, cost analysis, and execution planning	Available

View full README on GitHub

Similar Plugins

fullstack-dev-skills

8.6k

204

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

Stats

Version1.0.0

Parent Repo Stars461

Parent Repo Forks57

Installs1

MaintenanceGood

LicenseApache-2.0

AddedApr 1, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

claude-plugins-official17,685 agent-plugins-for-aws630 ccode-personal-plugins

Safety Signals

Caution

External network access

Connects to servers outside your machine

Agent Plugins for AWS

Read this in other languages: 日本語

[!IMPORTANT] Generative AI can make mistakes. You should consider reviewing all output and costs generated by your chosen AI model and agentic coding assistant. See AWS Responsible AI Policy.

Agent Plugins for AWS equip AI coding agents with the skills to help you architect, deploy, and operate on AWS. Agent plugins are currently supported by Claude Code and Cursor.

Agent skills – Structured workflows and best-practice playbooks that guide AI through complex tasks like deployment, code review, or architecture planning. Agent skills encode domain expertise as step-by-step processes.
MCP servers – Connections to external services, data sources, and APIs. MCP servers give your assistant access to live documentation, pricing data, and other resources at runtime. Learn more about MCP servers for AWS.
Hooks – Automation and guardrails that run on developer actions. Hooks can validate changes, enforce standards, or trigger workflows automatically.
References – Documentation, configuration defaults, and knowledge that the agent skill can consult. References make agent skills smarter without bloating the prompt.

As new types of expertise artifacts emerge in this space, they can be packaged into agent plugins, making the evolution transparent to developers.

Best practices

To maximize the benefits of plugin-assisted development while maintaining security and code quality, follow these essential guidelines:

Always review generated code before deployment (for example, against your constraints for security, cost, resilience)
Use plugins as accelerators, not replacements for developer judgment and expertise.
Keep plugins updated to benefit from the latest AWS best practices.
Follow the principle of least privilege when configuring AWS credentials.
Run security scanning tools on generated infrastructure code.

Plugins

Plugin	Description	Status
amazon-location-service	Add maps, geocoding, routing, places search, and geospatial features to applications with Amazon Location Service	Available
aws-amplify	Build full-stack apps with AWS Amplify Gen 2 using guided workflows for auth, data, storage, and functions	Available
aws-serverless	Build serverless applications with Lambda, API Gateway, EventBridge, Step Functions, and durable functions	Available
databases-on-aws	Database guidance for the AWS database portfolio — schema design, queries, migrations, and multi-tenant patterns	Some Services Available (Aurora DSQL)
deploy-on-aws	Deploy applications to AWS with architecture recommendations, cost estimates, and IaC deployment	Available
migration-to-aws	Migrate GCP infrastructure to AWS with resource discovery, architecture mapping, cost analysis, and execution planning	Available

sagemaker-ai

Component Overview

Component Details

Skills (12)

MCP Servers (1)

README

Agent Plugins for AWS

Best practices

Plugins

Similar Plugins

fullstack-dev-skills

sagemaker-ai

Component Overview

Component Details

Skills (12)

MCP Servers (1)

README

Agent Plugins for AWS

Best practices

Plugins

Similar Plugins

fullstack-dev-skills

team-skills-platform

prompts.chat

context7-plugin

chrome-devtools-mcp

react-native-dev