Plugin

unsloth-buddy

Name: unsloth-buddy
Author: tyh-labs

Fine-tune LLMs end-to-end on NVIDIA GPUs with Unsloth or Apple Silicon with mlx-tune. Interview for requirements, prepare data, set up environment, run LoRA training (SFT/DPO/GRPO/vision), evaluate models, generate demos, and export for deployment.

Python

Hugging Face

ai-ml

npx claudepluginhub tyh-labs/unsloth-buddy --plugin unsloth-buddy

Component Overview

Agents

Component Details

Agents (1)

unsloth-buddy — Agent Definition

/AGENTS

> This file is read by [OpenClaw](https://github.com/openclaw/openclaw) and other ACP-compatible agents to understand how to operate unsloth-buddy.

README

unsloth-buddy

/unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air.

English | 简体中文 | 繁體中文

What is this?

A fine-tuning agent that talks like a colleague. Describe your goal, and it asks the right questions, finds or formats your data, picks the right technique and model, trains on your hardware, validates the result, and packages it for deployment.

Runs on NVIDIA GPUs via Unsloth, natively on Apple Silicon via mlx-tune, and on free cloud GPUs via colab-mcp. Part of the Gaslamp AI development platform — docs.

One sentence, one fine-tuned model.

You: Fine-tune a small model on my customer support FAQ. I have a CSV file.

[Phase 0] Creating project: customer_faq_sft_2026_03_17/
[Phase 1] Requirements interview...
           Method: SFT   Model: Qwen2.5-0.5B   Deploy: Ollama
[Phase 2] Data strategy...
           Loading 1,200 rows from faq.csv → reformatting as chat messages
           Saved to data/train.jsonl (validated: messages column ✓)
[Phase 3] Environment: Apple M4 24GB, mlx-tune 0.4.3, Python 3.12
           Ready for training
[Phase 4] Training... 200 steps
           Final loss: 1.42 → saved to outputs/adapters/
[Phase 5] Evaluation (base vs fine-tuned, greedy decoding):
           Q: How do I reset my password?
           [Base]    I can help with that. Which password?
           [Fine-tuned]  Go to the login page → click "Forgot password" → check your email.
[Phase 6] Export → outputs/model-q4_k_m.gguf
           Run: ollama create my-faq-bot -f Modelfile && ollama run my-faq-bot

One conversation, seven phases, one deployable model — and a shareable demo page.

Quick Start

This skill includes sub-skills and utility scripts — install the full repository, not a single file.

Claude Code (recommended)

/plugin marketplace add TYH-labs/unsloth-buddy
/plugin install unsloth-buddy@TYH-labs/unsloth-buddy

Then describe what you want to fine-tune. The skill activates automatically.

Gemini CLI

gemini extensions install https://github.com/TYH-labs/unsloth-buddy --consent

Any agent supporting the Agent Skills standard

git clone https://github.com/TYH-labs/unsloth-buddy.git .agents/skills/unsloth-buddy

How is it different?

Most tools assume you already know what to do. This one doesn't.

View full README on GitHub

Similar Plugins

itsmostafa-llm-engineering-skills

Specialized skills for LLM engineering tasks including model training, evaluation, fine-tuning, and deployment optimization.

3mo

v1.0.0

Stats

Version1.0.0

Stars203

Forks12

MaintenanceExcellent

AddedMar 28, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

unsloth-buddy222

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Help us improve

Share bugs, ideas, or general feedback.

Back to Plugins

unsloth-buddy

/unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air.

English | 简体中文 | 繁體中文

What is this?

Runs on NVIDIA GPUs via Unsloth, natively on Apple Silicon via mlx-tune, and on free cloud GPUs via colab-mcp. Part of the Gaslamp AI development platform — docs.

One sentence, one fine-tuned model.

You: Fine-tune a small model on my customer support FAQ. I have a CSV file.

[Phase 0] Creating project: customer_faq_sft_2026_03_17/
[Phase 1] Requirements interview...
           Method: SFT   Model: Qwen2.5-0.5B   Deploy: Ollama
[Phase 2] Data strategy...
           Loading 1,200 rows from faq.csv → reformatting as chat messages
           Saved to data/train.jsonl (validated: messages column ✓)
[Phase 3] Environment: Apple M4 24GB, mlx-tune 0.4.3, Python 3.12
           Ready for training
[Phase 4] Training... 200 steps
           Final loss: 1.42 → saved to outputs/adapters/
[Phase 5] Evaluation (base vs fine-tuned, greedy decoding):
           Q: How do I reset my password?
           [Base]    I can help with that. Which password?
           [Fine-tuned]  Go to the login page → click "Forgot password" → check your email.
[Phase 6] Export → outputs/model-q4_k_m.gguf
           Run: ollama create my-faq-bot -f Modelfile && ollama run my-faq-bot

One conversation, seven phases, one deployable model — and a shareable demo page.

Quick Start

This skill includes sub-skills and utility scripts — install the full repository, not a single file.

Claude Code (recommended)

/plugin marketplace add TYH-labs/unsloth-buddy
/plugin install unsloth-buddy@TYH-labs/unsloth-buddy

Then describe what you want to fine-tune. The skill activates automatically.

Gemini CLI

gemini extensions install https://github.com/TYH-labs/unsloth-buddy --consent

Any agent supporting the Agent Skills standard

git clone https://github.com/TYH-labs/unsloth-buddy.git .agents/skills/unsloth-buddy

How is it different?

Most tools assume you already know what to do. This one doesn't.

unsloth-buddy

Component Overview

Component Details

Agents (1)

README

unsloth-buddy

What is this?

One sentence, one fine-tuned model.

Quick Start

How is it different?

Similar Plugins

itsmostafa-llm-engineering-skills

Help us improve

Help us improve

unsloth-buddy

Component Overview

Component Details

Agents (1)

README

unsloth-buddy

What is this?

One sentence, one fine-tuned model.

Quick Start

How is it different?

Similar Plugins

itsmostafa-llm-engineering-skills

Help us improve

tinker

transfer-learning-adapter

superml

huggingface-skills

caveman