Search everything...

Stats

Actions

Available In

humanize

Name: humanize
Author: bbuf

By BBuf

Run an iterative development loop where Claude implements plans and an independent Codex instance reviews each step, with support for Gemini research, automated plan generation, GPU kernel optimization, and git integration.

Publisher marketplacehumanize@KernelPilot · marketplace and plugin share one repository (bbuf/kernel-pilot)

npx claudepluginhub bbuf/kernel-pilot --plugin humanize

Popularity

Stars

Top 5%

109

Med: 0·Avg: 514

Copy clicks

Med: 0·Avg: 1

What's Inside

Slash Commands5

Cancel RLCR Loop

/cancel-rlcr-loop

Cancel active RLCR loop

Generate Idea Draft from Loose Input

/gen-idea

Generate a repo-grounded idea draft via directed-swarm exploration

Generate Plan from Draft

/gen-plan

Generate implementation plan from draft document

Refine Annotated Plan

/refine-plan

Refine an annotated implementation plan and generate a QA ledger

Start RLCR Loop

/start-rlcr-loop

Start iterative loop with Codex review

Agents4

bitlesson-selector

/bitlesson-selector

Selects required BitLesson entries for a specific sub-task. Use before execution for every task or sub-task.

draft-relevance-checker

/draft-relevance-checker

Checks if a draft document is relevant to the current repository. Use when validating draft content for gen-plan command.

plan-compliance-checker

/plan-compliance-checker

Checks plan relevance and compliance before RLCR loop. Use when validating plan files for start-rlcr-loop command.

plan-understanding-quiz

/plan-understanding-quiz

Analyzes a plan and generates multiple-choice technical comprehension questions to verify user understanding before RLCR loop. Use when validating user readiness for start-rlcr-loop command.

Skills8

ask-codex

/ask-codex

Consult Codex as an independent expert. Sends a question or task to codex exec and returns the response.

ask-gemini

/ask-gemini

Consult Gemini as an independent expert with deep web research. Sends a question or task to Gemini CLI and returns a research-backed response.

humanize-gen-plan

/humanize-gen-plan

Generate a structured implementation plan from a draft document. Validates input, checks relevance, analyzes for issues, and generates a complete plan.md with acceptance criteria.

humanize-kernel-agent-loop

/humanize-kernel-agent-loop

Run an autonomous Humanize Kernel Agent Loop for GPU kernel optimization: plan/refine K/R/W into task-acceptance pairs, create a clean standalone repo, research with kernel-knowledge, iterate with benchmark/profile evidence, autotune across the workload distribution, emit kernels/dispatcher/tuning decisions, maintain ledgers, and start RLCR.

humanize-refine-plan

/humanize-refine-plan

Refine an annotated implementation plan into a comment-free plan and a QA ledger while preserving the gen-plan schema.

Hooks1

Event Hooks

Bash

File writes

7 hooks across 4 events

The plugin manifest points to a different repository than the source indexed by ClaudePluginHub.

Stats

Version1.17.0

LanguagePython

Stars109

Forks13

MaintenanceExcellent

LicenseMIT

Last CommitMay 18, 2026

AddedMay 18, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

KernelPilot137

Safety Signals

Caution

Executes bash commands

Hook triggers when Bash tool is used

Modifies files

Hook triggers on file write and edit operations

README

KernelPilot

An autonomous Humanize-powered GPU kernel optimization loop with a local PR-driven CUDA knowledge base, Nsight Compute report skills, and clean standalone benchmark repos.

KernelPilot is for serious CUDA kernel tuning runs where the important facts are easy to lose: which upstream PR inspired a candidate, which shape regressed, what Nsight Compute actually said, which evidence changed the next edit, and whether the candidate belongs in a framework repo or a clean experiment.

The project packages three cooperating skills:

Skill	Role
`humanize-kernel-agent-loop`	Turns kernel definition `K`, reference `R`, and workload distribution `W` into task-acceptance pairs, a standalone optimization repo, autonomous research/iteration/autotuning, correctness tests, benchmarks, ledgers, dispatcher, tuning decisions, and review-gated iteration.
`kernel-knowledge`	A local PR-diff-first CUDA kernel evidence corpus. It routes by architecture, repo, topic, technique, profile symptom, operator, and DSL, then opens PR diffs, source snapshots, wiki pages, docs, and blogs as needed.
`ncu-report`	Converts Nsight Compute reports into a reproducible profile digest: metrics, source counters, PM sampling, PTX/SASS hotspots, bottleneck diagnosis, and exactly one next kernel edit.

Together they make an optimization loop that can work from a simple request:

[$humanize-kernel-agent-loop] Optimize SGLang's GEMM path for M=64, N=2048, K=2048, fp16, bias=true, and beat the current SGLang baseline by at least 10%.

The loop decides how to plan, when to query knowledge, what to profile, how to record lineage, how to scan the workload distribution, and when to ask the Humanize review gate whether another round is needed. The human should specify the target when it is ambiguous; the loop owns the rest.

Why Use It

PR-diff-grounded prior art. The knowledge base is organized around real merged kernel PRs, with review diffs and source snapshots materialized under knowledge/evidence/pull-bundles/.
Standalone by default. Candidate kernels do not pollute SGLang, vLLM, PyTorch, or other large framework repos. The loop creates an isolated repo with bindings, tests, benchmarks, ledgers, lineage, and profile artifacts. The standalone repo is where implementation artifacts, provenance, and measurements live.
Evidence-driven profiling. The loop decides when ncu-report is worth running, then uses it to move from vague labels like "memory-bound" toward measured bottlenecks and one concrete next edit.
Knowledge-backed edits. The agent can read PRs, wiki pages, official docs, blog/code notes, and profiler examples when they help explain a benchmark result, profile symptom, regression, plateau, or next edit.
Review-gated iteration. Humanize RLCR keeps the loop from declaring victory too early; default loop budget is 84 iterations unless configured otherwise.
Shape-aware tuning. The loop treats benchmark cases as a workload distribution, builds a performance map, and emits dispatcher/tuning decisions when different regimes need different kernels or configurations.

Kernel Agent Loop

flowchart LR
    K[Kernel definition K] --> P[Plan P = task and AC pairs]
    R[Correctness reference R] --> P
    W[Workload distribution W] --> P
    P --> S[Clean standalone repo]

    subgraph R0[Stage 1: Research]
        KW[kernel-knowledge / KernelWiki]
        B[Baseline and repo inspection]
        RD[Research digest and recipes]
        KW --> RD
        B --> RD
    end

    subgraph I0[Stage 2: Iterate]
        T[Writer executes task t_i]
        E[Inspect, edit, compile, test, benchmark, profile]
        V{Reviewer checks evidence vs ac_i}
        T --> E --> V
        V -->|blocked feedback| T
    end

    subgraph A0[Stage 3: Autotune]
        PM[Performance map over W]
        D[Shape-aware dispatcher]
        TD[Tuning decisions]
        PM --> D --> TD
    end

View full README on GitHub

humanize

Popularity

What's Inside

Confidence

README

KernelPilot

Why Use It

Kernel Agent Loop

Similar Plugins

autoresearch

kernel-opt-agent

autoresearch-agent

autoresearch-ai-plugin

claude-evolve

andrej-karpathy-skills

More by BBuf

ai-infra-auto-driven-skills

KernelPilot

Why Use It

Kernel Agent Loop

More by BBuf

ai-infra-auto-driven-skills

Popularity

Health & Quality

Similar Plugins

autoresearch

kernel-opt-agent

autoresearch-agent

autoresearch-ai-plugin

claude-evolve

andrej-karpathy-skills