From sagemaker-ai
Generates Jupyter notebook for fine-tuning LLMs on SageMaker serverless jobs. Supports SFT, DPO, RLVR trainers with Lambda reward functions.
npx claudepluginhub awslabs/agent-plugins --plugin sagemaker-aiThis skill uses the workspace's default tool permissions.
Before starting this workflow, verify:
Generates design tokens/docs from CSS/Tailwind/styled-components codebases, audits visual consistency across 10 dimensions, detects AI slop in UI.
Records polished WebM UI demo videos of web apps using Playwright with cursor overlay, natural pacing, and three-phase scripting. Activates for demo, walkthrough, screen recording, or tutorial requests.
Delivers idiomatic Kotlin patterns for null safety, immutability, sealed classes, coroutines, Flows, extensions, DSL builders, and Gradle DSL. Use when writing, reviewing, refactoring, or designing Kotlin code.
Before starting this workflow, verify:
A use_case_spec.md file exists
use-case-specification skill first, then resumeA fine-tuning technique (SFT, DPO, or RLVR) and base model have already been selected
finetuning-setup skill to collect what's missing, then resumeA base model name available on SageMakerHub has been identified
finetuning-setup skill to get itfinetuning-setup retrieves, as it may differ from other commonly used names for the same model[title]_finetuning.ipynb
[title] = snake_case name derived from use caseRead the example notebook matching the finetuning strategy:
references/sft_example.mdreferences/dpo_example.mdreferences/rlvr_example.md[title]_finetuning.ipynbIn the 'Setup & Credentials' cell, populate:
BASE_MODEL
MODEL_PACKAGE_GROUP_NAME
use_case_spec.md if needed)[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}customer-support-chatbot-v1Save notebook
references/rlvr_reward_function.md section "Helping Users Create Lambda Functions"CUSTOM_REWARD_FUNCTION in the Notebook with the ARN of the reward function (either given directly by the user, or from the function generation code as evaluator.arn).ACCEPT_EULA to True in the notebook after reviewing the licenseA Jupyter notebook has now been generated which will help you finetune your model. You are free to run it now. Please let me know once the training is complete.CRITICAL:
rlvr_reward_function.md - Lambda reward function creation guide (RLVR only)templates/rlvr_reward_function_source_template.py - Lambda reward function source template for open-weights models (RLVR only)templates/nova_rlvr_reward_function_source_template.py - Lambda reward function source template for Nova 2.0 Lite (RLVR only)sft_example.md - Complete notebook template for Supervised Fine-Tuningdpo_example.md - Complete notebook template for Direct Preference Optimizationrlvr_example.md - Complete notebook template for Reinforcement Learning from Verifiable Rewards