Help us improve
Share bugs, ideas, or general feedback.
From Harness Data
UK AISI Inspect AI eval conventions — the dataset/solver/scorer three-piece shape, Docker-sandboxed task isolation, the 200+ pre-built inspect_evals starter tasks, and wrapping Claude Code/Codex/Gemini CLI as the agent under test. Use when authoring or running Inspect AI evals, sandbox-isolated agentic evals, or adding a custom eval task.
npx claudepluginhub camilool8/harness-engineering-templates --plugin harness-dataHow this skill is triggered — by the user, by Claude, or both
Slash command
/harness-data:addon-inspect-aiThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- **Dataset → solver → scorer.** Three-piece eval shape. Datasets are
Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.
Share bugs, ideas, or general feedback.
github.com/UKGovernmentBEIS/inspect_evals.
Use them as starter tasks; do not re-invent.