llm-externalizer

Offload expensive code-scan work to cheap LLMs. Keep the fix loop local in Claude Code.

What it does

This plugin helps you review a codebase with a cheap model, and then fix the findings with your normal Claude Code session.

The work splits in two halves:

The scan — reading your files and listing what looks wrong (bugs, spec violations, duplicate code, broken imports). This half is sent to an inexpensive model of your choice: a free remote one, a paid remote ensemble of three models, or a local model running on your own machine.
The fix — actually editing the code to resolve each finding. This half stays inside Claude Code and is done by Claude Sonnet or Opus, so you keep the same review-and-approve flow you already use for any edit.

Keeping the fix half local means the expensive model only touches code when it actually needs to. The scan half does all the slow reading work on the cheap side.

Cost comparison per scan: Opus $0.84 — Sonnet $0.51 — Ensemble $0.08

How it works
Features
Requirements
Install
First run
Plugin commands (/llm-externalizer:* — what you type in Claude Code)
MCP tools (direct tool calls — for skills, custom agents, scripts)
Agents (internal, dispatched by commands)
Configuration
Troubleshooting
Plugin structure
Contributing
License

How it works

┌─────────────────────────────────────────────────────────────────────────┐
│  YOUR CLAUDE CODE SESSION (local — Sonnet / Opus / Haiku)               │
│                                                                         │
│   /llm-externalizer:llm-externalizer-scan-and-fix                       │
│        │                                                                │
│        │  1. auto-discover codebase via git ls-files                    │
│        │  2. call MCP tool "scan_folder" or "code_task"  ───────┐       │
│        │                                                        │       │
│        │                                                        ▼       │
│        │           ┌─────────────────────────────────────────────────┐  │
│        │           │  MCP SERVER (bundled with plugin)               │  │
│        │           │                                                 │  │
│        │           │  FFD-batches files into ~400 KB payloads        │  │
│        │           │  Streams each batch to the configured backend:  │  │
│        │           │    • OpenRouter ensemble (3 models in parallel) │  │
│        │           │    • OpenRouter single model                    │  │
│        │           │    • LM Studio / Ollama / vLLM / llama.cpp      │  │
│        │           │    • Nemotron free tier                         │  │
│        │           │                                                 │  │
│        │           │  Writes per-file / per-group / merged reports   │  │
│        │           │  to ./reports/llm-externalizer/*.md             │  │
│        │           └─────────────────────────────────────────────────┘  │
│        │                                                        │       │
│        │  3. receive report paths (only paths — never bodies)   │       │
│        │  4. dispatch FIXER SUBAGENTS (local Claude Sonnet/Opus)        │
│        │       • parallel: up to 15 concurrent, one per report          │
│        │       • serial:   one bug at a time from an aggregated list    │
│        │                                                                │
│        │     EACH FIXER subagent:                                       │
│        │       a. reads ONE report from disk                            │

Help us improve

llm-externalizer

Component Overview

Setup Configuration

Component Details

Commands (18)

Agents (5)

Skills (6)

Hooks (1)

MCP Servers (1)

README

llm-externalizer

What it does

Table of contents

How it works

Similar Plugins

litellm

conjure

ollama-local-ai

multi-mcp

majestic-llm

More by emasoft

Help us improve

llm-router

Help us improve