Skill

delegate-to-ollama

Delegate token-heavy coding work to a local LLM (Ollama / LM Studio) via tunaLlama. Use this when the user asks for code generation, file review, refactoring, or any task where the output would be long. Saves tokens by running heavy generation locally while you maintain oversight.

npx claudepluginhub hang-in/tunallama --plugin tunaLlama

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/tunaLlama:delegate-to-ollama

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You have access to `tuna_*` MCP tools backed by a local LLM. Use them when:

SKILL.md

44 lines · ~494 tokens

Similar Skills

llm-review

Dispatches local LLM agents for code exploration, single-file review, and multi-file analysis. Manages full lifecycle: availability check, model loading, execution, unloading via craft-skills scripts.

1 file

craft-skills

codex

247

Delegates complex code generation, refactoring, architectural analysis, and review tasks to OpenAI's Codex CLI (GPT-5.3-codex models) via safe workflows with sandboxing and approvals. Activates on explicit triggers like 'use codex' or 'codex exec'.

1 file3 tools

developer-kit-tools

consultant

Consults 100+ external AI models via LiteLLM for architectural reviews, security audits, deep code analysis, or extended reasoning on codebases. Runs async with session management and CLI status checks.

10 files

consultant

Stats

LanguagePython

Parent stars20

MaintenanceExcellent

Last CommitMay 10, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

When to use tunaLlama tools

You have access to tuna_* MCP tools backed by a local LLM. Use them when:

The user asks for code generation and you have clear requirements. Use tuna_generate_code instead of generating the code yourself.

The user asks to review or analyze a file. Use tuna_review_file (passing the path) instead of reading the file first. The file content stays out of your context — major token savings.

The user asks for refactoring or test writing with a defined scope. Use tuna_refactor_code or tuna_write_tests.

The user asks a question about multiple files. Use tuna_analyze_files so file contents bypass your context.

When NOT to delegate

Tasks requiring deep judgment about architecture or design — keep these yourself.

Short snippets (< ~10 lines) — overhead exceeds savings.

Tasks that require knowledge of recent conversation context the local LLM does not have.

Anything safety-critical or involving the user's intent interpretation.

When to use tunaLlama tools

You have access to tuna_* MCP tools backed by a local LLM. Use them when:

The user asks for code generation and you have clear requirements. Use tuna_generate_code instead of generating the code yourself.
The user asks to review or analyze a file. Use tuna_review_file (passing the path) instead of reading the file first. The file content stays out of your context — major token savings.
The user asks for refactoring or test writing with a defined scope. Use tuna_refactor_code or tuna_write_tests.
The user asks a question about multiple files. Use tuna_analyze_files so file contents bypass your context.

When NOT to delegate

Tasks requiring deep judgment about architecture or design — keep these yourself.
Short snippets (< ~10 lines) — overhead exceeds savings.
Tasks that require knowledge of recent conversation context the local LLM does not have.
Anything safety-critical or involving the user's intent interpretation.

Standard pattern: delegate then verify

Decompose the user's request into clear instructions for the local LLM.
Call the appropriate tuna_* tool.
Review the returned output. Catch obvious problems.
If the output looks wrong, call tuna_fix_code with the error description.
Present the verified result to the user.

Recall

Before starting non-trivial work in a familiar codebase, consider calling tuna_recall with keywords from the current request. Past delegations on the same codebase often surface useful prior decisions. Korean queries work — the backend uses Kiwi morpheme indexing.

delegate-to-ollama

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

delegate-to-ollama

Popularity

Invocation

Context Preview

SKILL.md

When to use tunaLlama tools

When NOT to delegate

Standard pattern: delegate then verify

Recall

Similar Skills

Help us improve

When to use tunaLlama tools

When NOT to delegate

Standard pattern: delegate then verify

Recall