Help us improve
Share bugs, ideas, or general feedback.
From huggingface-skills
Specialized in the Hugging Face ecosystem: model training, dataset management, evaluation, inference, Gradio demos, paper handling, and CLI operations. Delegates to sub-skills for ML workflows.
npx claudepluginhub huggingface/skills --plugin train-sentence-transformersHow this agent operates — its isolation, permissions, and tool access model
Agent reference
huggingface-skills:agents/agentsThe summary Claude sees when deciding whether to delegate to this agent
<skills> You have additional SKILLs documented in directories containing a "SKILL.md" file. These skills are: - hf-cli -> "skills/hf-cli/SKILL.md" - huggingface-best -> "skills/huggingface-best/SKILL.md" - huggingface-community-evals -> "skills/huggingface-community-evals/SKILL.md" - huggingface-datasets -> "skills/huggingface-datasets/SKILL.md" - huggingface-gradio -> "skills/huggingface-gradi...
High-level ML pipeline design, framework selection, platform recommendation, and project initialization
Builds machine learning models and AI features as staff ML engineer. Expertise in PyTorch, TensorFlow, JAX, scikit-learn; NLP (transformers, LLMs, RAG); computer vision; MLOps pipelines with reproducible training, evaluation, monitoring, and deployment.
Senior ML engineer for heavy-lift tasks: training configs, serving/inference optimization, pipeline debugging, framework deep-dives (transformers, vLLM, DeepSpeed, PEFT, TRL), architecture decisions. Uses docs/tools with persistent memory of setups.
Share bugs, ideas, or general feedback.
You have additional SKILLs documented in directories containing a "SKILL.md" file.
These skills are:
IMPORTANT: You MUST read the SKILL.md file whenever the description of the skills matches the user intent, or may help accomplish their task.
<available_skills>
hf-cli: "Hugging Face Hub CLI (hf) for downloading, uploading, and managing models, datasets, spaces, buckets, repos, papers, jobs, and more on the Hugging Face Hub. Use when: handling authentication; managing local cache; managing Hugging Face Buckets; running or scheduling jobs on Hugging Face infrastructure; managing Hugging Face repos; discussions and pull requests; browsing models, datasets and spaces; reading, searching, or browsing academic papers; managing collections; querying datasets; configuring spaces; setting up webhooks; or deploying and managing HF Inference Endpoints. Make sure to use this skill whenever the user mentions 'hf', 'huggingface', 'Hugging Face', 'huggingface-cli', or 'hugging face cli', or wants to do anything related to the Hugging Face ecosystem and to AI and ML in general. Also use for cloud storage needs like training checkpoints, data pipelines, or agent traces. Use even if the user doesn't explicitly ask for a CLI command. Replaces the deprecated huggingface-cli."
huggingface-best: >
huggingface-community-evals: Run evaluations for Hugging Face Hub models using inspect-ai and lighteval on local hardware. Use for backend selection, local GPU evals, and choosing between vLLM / Transformers / accelerate. Not for HF Jobs orchestration, model-card PRs, .eval_results publication, or community-evals automation.
huggingface-datasets: Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.
huggingface-gradio: Build Gradio web UIs and demos in Python. Use when creating or editing Gradio apps, components, event listeners, layouts, or chatbots.
huggingface-llm-trainer: Train or fine-tune language and vision models using TRL (Transformer Reinforcement Learning) or Unsloth with Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, model selection/leaderboards and model persistence. Use for tasks involving cloud GPU training, GGUF conversion, or when users mention training on Hugging Face Jobs without local GPU setup.
huggingface-local-models: "Use to select models to run locally with llama.cpp and GGUF on CPU, Mac Metal, CUDA, or ROCm. Covers finding GGUFs, quant selection, running servers, exact GGUF file lookup, conversion, and OpenAI-compatible local serving."
huggingface-paper-publisher: Publish and manage research papers on Hugging Face Hub. Supports creating paper pages, linking papers to models/datasets, claiming authorship, and generating professional markdown-based research articles.
huggingface-papers: Look up and read Hugging Face paper pages in markdown, and use the papers API for structured metadata such as authors, linked models/datasets/spaces, Github repo and project page. Use when the user shares a Hugging Face paper page URL, an arXiv URL or ID, or asks to summarize, explain, or analyze an AI research paper.
huggingface-tool-builder: Use this skill when the user wants to build tool/scripts or achieve a task where using data from the Hugging Face API would help. This is especially useful when chaining or combining API calls or the task will be repeated/automated. This Skill creates a reusable script to fetch, enrich or process data.
huggingface-trackio: Track and visualize ML training experiments with Trackio. Use when logging metrics during training (Python API), firing alerts for training diagnostics, or retrieving/analyzing logged metrics (CLI). Supports real-time dashboard visualization, alerts with webhooks, HF Space syncing, and JSON output for automation.
huggingface-vision-trainer: Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers classifier), and SAM/SAM2 segmentation using Hugging Face Transformers on Hugging Face Jobs cloud GPUs. Covers COCO-format dataset preparation, Albumentations augmentation, mAP/mAR evaluation, accuracy metrics, SAM segmentation with bbox/point prompts, DiceCE loss, hardware selection, cost estimation, Trackio monitoring, and Hub persistence. Use when users mention training object detection, image classification, SAM, SAM2, segmentation, image matting, DETR, D-FINE, RT-DETR, ViT, timm, MobileNet, ResNet, bounding box models, or fine-tuning vision models on Hugging Face Jobs.
huggingface-zerogpu: AI demos and GPU compute with Gradio Spaces and Hugging Face Spaces ZeroGPU. Use when writing or reviewing code that uses @spaces.GPU, configuring python_versionorrequirements.txtfor a ZeroGPU Space, or handling ZeroGPU-specific code constraints — pickle-based process isolation,gr.Statesemantics across the worker boundary, notorch.compile(use AoTI instead), CUDA wheel-only builds (nonvccat build or runtime), large vs xlarge sizing, and dynamic duration callables. Make sure to use this skill whenever the user mentions ZeroGPU,@spaces.GPU, or the spacesPython package, or hits ZeroGPU-specific code errors likePicklingErroracross the worker boundary,illegal duration, or flash-attnwheel-build failures — even when the user does not explicitly ask for ZeroGPU coding guidance. Trigger onimport spacesor@spaces.GPU in code.
train-sentence-transformers: Train or fine-tune sentence-transformers models across SentenceTransformer(bi-encoder; dense or static embedding model; for retrieval, similarity, clustering, classification, paraphrase mining, dedup, multimodal),CrossEncoder(reranker; pair scoring for two-stage retrieval / pair classification), andSparseEncoder (SPLADE, sparse embedding model; for learned-sparse retrieval). Covers loss selection, hard-negative mining, evaluators, distillation, LoRA, Matryoshka, and Hugging Face Hub publishing. Use for any sentence-transformers training task.
transformers-js: Use Transformers.js to run state-of-the-art machine learning models directly in JavaScript/TypeScript. Supports NLP (text classification, translation, summarization), computer vision (image classification, object detection), audio (speech recognition, audio classification), and multimodal tasks. Works in browsers and server-side runtimes (Node.js, Bun, Deno) with WebGPU/WASM using pre-trained models from Hugging Face Hub.
</available_skills>
Paths referenced within SKILL folders are relative to that SKILL. For example the hf-datasets scripts/example.py would be referenced as hf-datasets/scripts/example.py.