Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
npx claudepluginhub p988744/nlp-skills --plugin nlp-skillsUse this agent when the user needs help configuring data sources for training. This agent advises on data collection strategies and generates data_source.yaml configuration. Examples: <example> Context: User asks about getting training data user: "資料從哪裡來?" assistant: "[Uses Task tool to launch data-source-advisor agent to explore data source options]" <commentary> User is asking about data sourcing. Launch data-source-advisor to discuss options and create configuration. </commentary> </example> <example> Context: User mentions specific data source user: "我想從 PostgreSQL 資料庫拿標註資料" assistant: "[Uses Task tool to launch data-source-advisor agent to configure database connection]" <commentary> User has a specific data source in mind. Launch data-source-advisor to help configure the connection properly. </commentary> </example> <example> Context: User needs to generate synthetic data user: "資料不夠,可以用 GPT 生成嗎?" assistant: "[Uses Task tool to launch data-source-advisor agent to set up LLM data generation]" <commentary> User wants to use LLM for data augmentation. Launch data-source-advisor to configure synthetic data generation. </commentary> </example> <example> Context: User mentions web scraping user: "我想爬取金融新聞來訓練" assistant: "[Uses Task tool to launch data-source-advisor agent to configure web scraping]" <commentary> User wants to scrape web data. Launch data-source-advisor to set up crawling configuration. </commentary> </example>
Use this agent when the user expresses vague or unclear training needs. This agent proactively asks questions to clarify goals, pain points, and constraints. Examples: <example> Context: User mentions wanting to train a model without specific details user: "我想訓練一個模型" assistant: "[Uses Task tool to launch goal-clarifier agent to explore the user's needs]" <commentary> The user's request is vague - no task type, domain, or goal specified. Launch goal-clarifier to guide them through clarification. </commentary> </example> <example> Context: User mentions NLP task but lacks specifics user: "幫我做情感分析" assistant: "[Uses Task tool to launch goal-clarifier agent to understand the specific requirements]" <commentary> Sentiment analysis is broad - need to clarify: entity-level vs document-level, domain, labels, data availability. Launch goal-clarifier. </commentary> </example> <example> Context: User asks about training approach user: "fine-tune 還是 prompt engineering 比較好?" assistant: "[Uses Task tool to launch goal-clarifier agent to understand the context before recommending]" <commentary> The right approach depends on task, data, and constraints. Launch goal-clarifier to understand context before recommending. </commentary> </example>
Use this agent when the user reports performance issues or asks for help improving model quality. This agent diagnoses problems and recommends solutions. Examples: <example> Context: User reports low performance metric user: "F1 只有 72%,目標是 80%" assistant: "[Uses Task tool to launch problem-diagnoser agent to analyze the issue]" <commentary> User has a specific performance gap. Launch problem-diagnoser to analyze results and recommend improvements. </commentary> </example> <example> Context: User mentions a specific class performing poorly user: "中立類別的準確率很低" assistant: "[Uses Task tool to launch problem-diagnoser agent to investigate class imbalance]" <commentary> Single class underperforming suggests class imbalance or data quality issue. Launch problem-diagnoser to investigate. </commentary> </example> <example> Context: User suspects overfitting user: "訓練 loss 很低但測試效果不好" assistant: "[Uses Task tool to launch problem-diagnoser agent to check for overfitting]" <commentary> Classic overfitting symptom. Launch problem-diagnoser to verify and suggest remedies. </commentary> </example> <example> Context: User asks how to improve user: "怎麼提高模型效能?" assistant: "[Uses Task tool to launch problem-diagnoser agent to analyze current state and suggest improvements]" <commentary> General improvement request. Launch problem-diagnoser to analyze current performance and identify opportunities. </commentary> </example>
Use this agent after training or evaluation completes to analyze results, compare versions, and provide decision recommendations. Examples: <example> Context: Training just completed user: "訓練完成了,結果怎麼樣?" assistant: "[Uses Task tool to launch result-analyzer agent to analyze the training results]" <commentary> User wants to understand training results. Launch result-analyzer to provide comprehensive analysis. </commentary> </example> <example> Context: User wants to compare versions user: "v1 和 v2 哪個比較好?" assistant: "[Uses Task tool to launch result-analyzer agent to compare version performance]" <commentary> User wants version comparison. Launch result-analyzer to analyze both versions and recommend. </commentary> </example> <example> Context: Evaluation completed user: "評估報告出來了" assistant: "[Uses Task tool to launch result-analyzer agent to interpret the evaluation report]" <commentary> Evaluation just completed. Launch result-analyzer to interpret results and provide insights. </commentary> </example> <example> Context: User needs to decide next steps user: "該繼續訓練還是部署?" assistant: "[Uses Task tool to launch result-analyzer agent to analyze current state and recommend]" <commentary> User at decision point. Launch result-analyzer to assess readiness and recommend action. </commentary> </example>
This skill should be used when the user asks to "configure data source", "data from database", "fetch data from API", "scrape web data", "generate training data with LLM", "regenerate data", "data pipeline", "where does data come from", or needs to set up reproducible data collection. Provides data source configuration, reproducibility tracking, and data regeneration capabilities.
This skill should be used when the user asks to "execute plan", "run the plan", "start executing", "continue plan execution", or has a plan file ready to execute. Manages phased implementation with batch execution and checkpoint reviews.
LLM fine-tuning 教練式引導工作流程 v2。 核心功能:主動探索使用者痛點、引導明確目標、多任務管理、資料來源追蹤、完整版本 lineage。 支援:LoRA/QLoRA/DoRA 微調、SFT/ORPO/DPO 對齊、資料準備、Benchmark 評估、HuggingFace 部署。 特色:教練式引導、可重現的資料管線、多任務版本追蹤。 觸發詞:「訓練模型」「fine-tune」「微調」「LoRA」「建立新任務」「改善模型」「優化準確率」「資料管線」「任務管理」
This skill should be used when the user asks to "train a model", "fine-tune", "build NLP model", "create training task", "optimize model performance", "improve accuracy", "what model should I use", or expresses vague training needs like "I want to do sentiment analysis" or "help me with NER". Provides coaching-style guidance to clarify goals, diagnose pain points, and recommend optimal training approaches.
This skill should be used when the user asks "what is LoRA", "compare models", "which model is best for Chinese", "SFT vs DPO", "how to handle overfitting", "class imbalance solution", "model architecture", "training method comparison", or needs reference information about LLM fine-tuning. Provides structured knowledge base for models, methods, architectures, and troubleshooting.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
LLM post-training — unified interface for SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning
Train task-specific small language models (SLMs) using the Distil Labs CLI and platform
Transfer learning adaptation
Agent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub
Skills for fine-tuning language models with the Tinker API — research, debugging, and more.
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
No model invocation
Executes directly as bash, bypassing the AI model
No model invocation
Executes directly as bash, bypassing the AI model
Share bugs, ideas, or general feedback.
Claude Code plugin for NLP tasks - LLM fine-tuning with coaching-style guidance workflow.
# Add nlp-skills marketplace
/plugin marketplace add p988744/nlp-skills
# Install
/plugin install nlp-skills
claude --plugin-dir /path/to/nlp-skills
| Skill | Triggers | Description |
|---|---|---|
| llm-coach | "train model", "fine-tune", "optimize" | Coaching guidance entry point |
| llm-knowledge | "what is LoRA", "compare models" | Knowledge base |
| task-manager | "list tasks", "compare versions" | Multi-task management |
| data-pipeline | "data source", "where does data come from" | Data pipeline configuration |
| writing-plans | "write a plan", "create training plan" | Plan-based task tracking |
| executing-plans | "execute plan", "run the plan" | Batch execution with checkpoints |
| finetune-llm | "fine-tune LLM", "training workflow" | Overview skill |
| Command | Description |
|---|---|
/nlp-skills:coach | Start coaching dialogue |
/nlp-skills:tasks | List all task status |
/nlp-skills:new-task | Create new task |
/nlp-skills:data-source | Configure data sources |
/nlp-skills:generate | Generate project structure |
/nlp-skills:write-plan | Write detailed execution plan |
/nlp-skills:execute-plan | Execute plan with checkpoint reviews |
/nlp-skills:evaluate | Run evaluation analysis |
/nlp-skills:deploy | Deploy model |
| Agent | Trigger | Function |
|---|---|---|
| goal-clarifier | Vague requirements | Proactively clarify goals |
| data-source-advisor | Data source questions | Help configure data pipelines |
| problem-diagnoser | Performance issues | Auto-diagnose and recommend fixes |
| result-analyzer | Post-training/evaluation | Analyze results, decision support |
# Coaching guidance
"I want to train a model"
# Direct creation
/nlp-skills:new-task entity-sentiment
# List tasks
/nlp-skills:tasks
1. Start coaching → Clarify goals, pain points, resources
2. Configure data source → Set up DB, API, scraping, LLM generation
3. Generate project → Create scripts, configs, docs
4. Prepare data → Run data generation scripts
5. Train model → Execute training scripts
6. Evaluate performance → Analyze results, compare versions
7. Deploy → HuggingFace, Ollama
Each task is a fully independent, self-contained project:
{task-name}/
├── task.yaml # Task definition
├── data_source.yaml # Data source config (reproducible)
├── plans/ # Execution plans (plan-based tracking)
│ └── YYYY-MM-DD-goal.md
├── versions/ # Version tracking (full lineage)
│ ├── v1/
│ │ ├── config.yaml
│ │ ├── data_snapshot.json
│ │ ├── results.json
│ │ └── lineage.yaml
│ └── v2/
├── data/
├── scripts/
├── configs/
├── models/
└── benchmarks/
Core feature of v0.3 - reproducible data pipelines:
# data_source.yaml
sources:
- type: database
connection: postgresql://...
query: "SELECT text, label FROM annotations"
- type: api
endpoint: https://api.example.com/data
- type: web_scrape
urls: ["https://..."]
keywords: ["finance", "stock"]
- type: llm_generated
model: gpt-4o
count: 500
Reduce web searches with built-in 2025-2026 knowledge:
| Category | Content |
|---|---|
| Architecture | Dense vs MoE, MLA |
| Base Models | Qwen3, DeepSeek-V3/R1, Llama 3.3 |
| Training Methods | SFT, LoRA, QLoRA, ORPO, DPO |
| Task Types | Sentiment Analysis, NER, Relation Extraction |
| Troubleshooting | Overfitting, Class Imbalance, Low Accuracy |
claude --plugin-dir .
claude --debug --plugin-dir .
./scripts/validate-plugin.sh