Help us improve
Share bugs, ideas, or general feedback.
From eval-runner
Provision a new eval-runner workspace on disk. Use when the user wants to start a new evaluation project — scaffolds evals/, datasets/, results/, and docs/ directories, personalises CLAUDE.md, and (by default) creates a GitHub repo.
npx claudepluginhub danielrosehill/claude-eval-runner-pluginHow this skill is triggered — by the user, by Claude, or both
Slash command
/eval-runner:new-workspaceThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Creates a new workspace for evaluation work. The plugin's commands
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
Creates a new workspace for evaluation work. The plugin's commands
(/eval-runner:setup-eval, /eval-runner:create-eval, /eval-runner:run-eval,
/eval-runner:publish-eval, /eval-runner:publish-dataset,
/eval-runner:document-eval) operate against this scaffold.
$ARGUMENTS is parsed as:
My-Eval-Project). Required.~/repos/github/my-repos.--private (optional): create the GitHub repo as private. Default: public.--local-only (optional): skip GitHub repo creation.<workspace>/
├── CLAUDE.md # workspace-level context for Claude
├── README.md
├── evals/ # one subfolder per eval
│ └── .gitkeep
├── datasets/ # eval datasets (raw + processed)
│ └── .gitkeep
├── results/ # run outputs, scored, with metadata
│ └── .gitkeep
├── docs/ # rationale, findings, writeups
│ └── .gitkeep
└── .gitignore
template/ from the plugin into <parent>/<name>/.CLAUDE.md — inject workspace name, date, and a short preamble describing what is being evaluated (ask the user in one line if not obvious).git init, initial commit.--local-only, gh repo create (public by default, private with --private) and push.