From agentv-dev
Lints AgentV eval YAML files for structural issues, schema compliance, and quality problems using a Python script, followed by LLM semantic review for factual accuracy, coverage, and consistency.
npx claudepluginhub entityprocess/agentv --plugin agentv-devThis skill uses the workspace's default tool permissions.
Lint and review AgentV eval YAML files for structural issues, schema compliance, and quality problems. Runs deterministic checks via script, then applies LLM judgment for semantic issues the script cannot catch.
Writes, edits, reviews, and validates AgentV EVAL.yaml files for agent skill evaluations. Adds test cases, configures graders, converts from evals.json or chat transcripts.
Reviews AI plugin PRs: structural linting with Python script, eval YAML checks, skill quality audits via checklist. Outputs inline PR comments.
Runs evaluation pipelines on Claude Code skills to test triggering accuracy, workflow correctness, and output quality. Spawns sub-agents for parallel execution and generates JSON reports.
Share bugs, ideas, or general feedback.
Lint and review AgentV eval YAML files for structural issues, schema compliance, and quality problems. Runs deterministic checks via script, then applies LLM judgment for semantic issues the script cannot catch.
Execute scripts/lint_eval.py against the target eval files:
python scripts/lint_eval.py <path-to-evals-dir-or-file> --json
The script checks:
.eval.yaml extensiondescription field presentid, input, and at least one of criteria/expected_output/assertionstype: file use leading /assertions blocks present (flags tests relying solely on expected_output)expected_output prose detection (flags "The agent should..." patterns)input)Report the script findings grouped by severity (error > warning > info). For each finding, include the file path and a concrete fix.
The script catches structural issues but cannot assess:
Read the relevant SKILL.md files and cross-check against the eval content for these issues.
scripts/lint_eval.py — Deterministic eval linter (Python 3.11+, stdlib only)