Skill

qa-test-run

Executes test suites across JS/TS, PHP, Go stacks via npm/pnpm/yarn/phpunit/go test; diagnoses failures, auto-fixes issues, generates coverage reports for unit/integration/E2E/mutation/contract tiers.

Javascript

Typescript

Php

Jest

testing

Install

npx claudepluginhub parhumm/jaan-to --plugin jaan-to

Tool Access

This skill uses the workspace's default tool permissions.

Preview

> Execute tests across stacks, diagnose failures, auto-fix simple issues, and generate coverage reports.

Supporting Assets

LEARN.mdtemplate.md

SKILL.md

Similar Skills

using-git-worktrees

Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.

superpowers

168.3k

subagent-driven-development

3 files

Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.

superpowers

168.3k

dispatching-parallel-agents

Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.

superpowers

168.3k

Stats

Stars11

Forks0

Last CommitMar 15, 2026

Actions

View Source View Plugin View on GitHub View README

qa-test-run

Execute tests across stacks, diagnose failures, auto-fix simple issues, and generate coverage reports.

Context Files

$JAAN_CONTEXT_DIR/tech.md - Tech stack context (CRITICAL -- determines test runners, package manager, framework)
- Uses sections: #current-stack, #frameworks, #constraints
$JAAN_CONTEXT_DIR/config.md - Project configuration
$JAAN_TEMPLATES_DIR/jaan-to-qa-test-run.template.md - Output template
$JAAN_LEARN_DIR/jaan-to-qa-test-run.learn.md - Past lessons (loaded in Pre-Execution)
${CLAUDE_PLUGIN_ROOT}/docs/extending/language-protocol.md - Language resolution protocol

Input

Test Source: $ARGUMENTS

Accepts 1-2 arguments:

qa-test-generate output (preferred) -- Path to qa-test-generate output directory (from /jaan-to:qa-test-generate)
test directory -- Path to existing test directory in the project
Tier filter (optional) -- --unit, --integration, --e2e, --mutation, --contract, or --all (default: --all)
Empty -- Interactive wizard prompting for test location and tier

Tier flags: --mutation runs mutation testing (StrykerJS/Infection/go-mutesting). --contract delegates API contract validation to /jaan-to:qa-contract-validate. --all includes all tiers including mutation and contract.

IMPORTANT: The input above is your starting point. Determine mode and proceed accordingly.

Pre-Execution Protocol

MANDATORY — Read and execute ALL steps in: ${CLAUDE_PLUGIN_ROOT}/docs/extending/pre-execution-protocol.md Skill name: qa-test-run Execute: Step 0 (Init Guard) → A (Load Lessons) → B (Resolve Template) → C (Offer Template Seeding)

Language Settings

Read and apply language protocol: ${CLAUDE_PLUGIN_ROOT}/docs/extending/language-protocol.md Override field for this skill: language_qa-test-run

Language exception: Test execution output (command output, error messages, stack traces) is NOT affected by this setting and remains in the project's language.

PHASE 1: Analysis (Read-Only)

Thinking Mode

ultrathink

Use extended reasoning for:

Analyzing test file structure and framework detection
Planning execution order across tiers
Diagnosing failure patterns and categorizing root causes
Determining auto-fix feasibility vs manual intervention

Step 1: Validate & Parse Inputs

If qa-test-generate output provided:

Read the output directory structure
Identify config files (vitest.config.ts, playwright.config.ts, etc.)
Identify test files by tier (unit/, integration/, e2e/)
Extract test framework from config files

If test directory provided:

Scan directory for test files
Detect test framework from config files or file patterns
Classify tests into tiers by naming convention

If no input: Use AskUserQuestion:

"Where are your test files located?"
"Which test tiers to run?" Options: "All", "Unit only", "Integration only", "E2E only"

Step 2: Detect Tech Stack

Read $JAAN_CONTEXT_DIR/tech.md for framework detection.

tech.md value	Test Runner	E2E Runner	Coverage Tool	Package Manager	Test Command Prefix
Node.js / TypeScript	Vitest / Jest	Playwright / Cypress	@vitest/coverage-v8 / istanbul	pnpm / npm / yarn	`npx` / `pnpm exec`
PHP	PHPUnit / Pest	Laravel Dusk / Codeception	PHPUnit coverage (Xdebug/PCOV)	composer	`vendor/bin/`
Go	`go test` (stdlib)	Rod / Chromedp	`go test -cover` (built-in)	go mod	`go test`

Fallback: If tech.md missing → detect from lockfiles (package-lock.json, composer.lock, go.sum) + config files (vitest.config.*, phpunit.xml, *_test.go) → AskUserQuestion if ambiguous.

Step 3: Scan Test Files by Tier

Scan per detected stack:

Node.js/TypeScript:

Unit: *.test.{ts,tsx}, *.spec.{ts,tsx} in test/unit/ or __tests__/
Integration: *.integration.test.*, *.int.test.* in test/integration/
E2E: *.spec.{ts,tsx} in test/e2e/, e2e/, or tests/

PHP:

Unit: *Test.php in tests/Unit/
Integration: *Test.php in tests/Feature/
E2E: *Test.php in tests/Browser/

Go:

Unit: *_test.go in package directories (no build tags)
Integration: *_test.go with //go:build integration tag
E2E: *_test.go with //go:build e2e tag

Present file counts per tier.

Step 4: Pre-Execution Health Checks

Run stack-aware health checks before execution:

Check	Node.js	PHP	Go
Dependencies installed	`node_modules/` exists	`vendor/` exists	`go.sum` exists
ORM client generated	Prisma: `npx prisma generate`	Eloquent: migrations run	sqlc: `sqlc generate`
Test env file	`.env.test` or `.env.testing`	`.env.testing`	`_test` build tag
E2E server	webServer config in playwright	artisan serve	custom server

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Health Check Matrix" for per-stack detailed checks and fix commands.

If any check fails → report issue and offer auto-fix (Step 6).

Step 5: Build Execution Plan

Construct commands per tier based on detected stack and framework:

Node.js/TypeScript:

Unit: npx vitest run --workspace=unit --reporter=json
Integration: npx vitest run --workspace=integration --reporter=json
E2E: npx playwright test --reporter=json

PHP:

Unit: vendor/bin/phpunit --testsuite=unit --log-junit=results.xml
Integration: vendor/bin/phpunit --testsuite=feature --log-junit=results.xml
E2E: vendor/bin/phpunit --testsuite=browser --log-junit=results.xml

Go:

Unit: go test ./... -json -cover
Integration: go test ./... -json -cover -tags=integration
E2E: go test ./... -json -tags=e2e

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Multi-Stack Test Commands" for full command tables with flags, environment variables, and framework-specific options.

Present execution plan with estimated test counts per tier.

HARD STOP -- Human Review Check

Show complete analysis and execution plan:

TEST EXECUTION PLAN
-------------------------------------------------------------
Stack:              {detected_stack}
Test Runner:        {detected_runner}
Coverage Tool:      {detected_coverage}
Package Manager:    {detected_pm}

Health Checks:      {pass_count}/{total} passed
  {list_of_checks_with_status}

Tests Found:
  Unit:             {count} files ({scenario_count} tests)
  Integration:      {count} files ({scenario_count} tests)
  E2E:              {count} files ({scenario_count} tests)

Execution Order:    unit → integration → E2E
Tier Filter:        {all|unit|integration|e2e}

{health_check_issues_if_any}

Use AskUserQuestion:

Question: "Proceed with test execution?"
Header: "Run Tests"
Options:
- "Yes" -- Execute tests as planned
- "No" -- Cancel
- "Edit" -- Change tier filter or fix health issues first

Do NOT proceed to Phase 2 without explicit approval.

PHASE 2: Execution (Write Phase)

Step 6: Auto-Fix Simple Issues

Before running tests, attempt to fix common issues (stack-aware):

Node.js:

Generate Prisma client: npx prisma generate
Create .env.test from .env.example
Fix import paths for moved test files

PHP:

Run composer dump-autoload
Create .env.testing from .env.example
Run pending migrations: php artisan migrate --env=testing

Go:

Run go generate ./...
Verify test build tags are correct
Run go mod tidy

For environment values (DB URLs, API keys), use AskUserQuestion to get actual values from the user -- never guess or use placeholders.

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Auto-Fix Procedures" for per-stack step-by-step procedures.

Step 7: Execute Tests

Run tests in order: unit → integration → E2E

7.0 Performance Optimization

Apply per-stack parallel execution and coverage tool selection for speed:

Parallel Execution (add flags to commands from Step 5):

Stack	Runner	Parallel Flag	Speedup
JS/TS	Vitest	`pool: 'threads'` + `--no-isolate` (stateless tests)	~14% faster than forks (see reference doc)
JS/TS	Playwright	`fullyParallel: true`, `workers: 4`, `--shard=X/Y` in CI	40min → 5-7min (4 shards)
PHP	ParaTest	`vendor/bin/paratest -p8 --runner WrapperRunner`	5x (20min → 4min)
Go	go test	`t.Parallel()` + `-parallel 128 -p 16`	3x on I/O-bound tests
Python	pytest-xdist	`-n auto --dist loadscope`	5x with 8 cores

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Parallel Execution Strategy" for per-stack config examples and trade-offs.

Coverage Tool Selection (prefer fast providers in CI):

Stack	Fast Provider	Overhead	Slow Provider	Overhead
JS/TS	V8 (`@vitest/coverage-v8`)	~10%	Istanbul	~300%
PHP	PCOV	~34% (1.3x)	Xdebug 3	~280% (3.8x)
Go	`go test -cover` (native)	<1-5%	N/A	N/A

Rule: Use fast provider in CI; use slow provider locally only when branch/path coverage or debugging is needed.

E2E Auth Caching (Playwright): Use storageState to authenticate once and reuse across tests. Avoids redundant login UI flows per test.

Fail-Fast: Set maxFailures in Playwright config. If >80% of a tier fails, halt tier and diagnose before continuing (likely setup issue).

For each tier:

Run the test command with JSON/XML reporter for reliable parsing
Apply parallel flags from table above when available
Capture stdout, stderr, and exit code
Parse results into structured format
Record pass/fail/skip counts
If failures detected → proceed to Step 8 (diagnose) before next tier

Important: Use --reporter=json (Vitest), --reporter=json (Playwright), --log-junit (PHPUnit), or -json (Go) for machine-parseable output. Never rely on text output parsing.

Step 7.5: Contract Validation Delegation (if --contract or --all)

When --contract tier is selected (explicitly or via --all):

Discover API contract: glob for specs/openapi.yaml, specs/openapi.json, check $JAAN_OUTPUTS_DIR/backend/api-contract/
If contract found: suggest running /jaan-to:qa-contract-validate "{contract_path}" — do NOT execute Spectral/oasdiff/Prism/Schemathesis directly (those tools are not in this skill's allowed-tools)
If contract not found: skip with info message "No API contract discovered — contract validation skipped"
In output report: include contract validation status as "DELEGATED to qa-contract-validate" or "SKIPPED (no contract)" — never report "PASS" for contract tier

Contract validation tools (Spectral, oasdiff, Prism, Schemathesis) are exclusively owned by qa-contract-validate. This skill delegates rather than duplicates.

Step 8: Diagnose Failures

Categorize each failure into generic categories:

Category	Auto-Fix	Examples
Import/Module resolution	Yes	Missing modules, wrong paths, autoload issues
ORM/DB client generation	Yes	Prisma not generated, Eloquent not migrated, sqlc stale
Environment configuration	Yes (ask value)	Missing env vars, wrong DB URLs
Assertion failures	No (manual)	Business logic mismatches
Timeout/Async errors	Suggest fix	Slow operations, missing await, goroutine leaks
Database/State errors	Suggest fix	Missing migrations, seed data, connection refused
Mock/Fixture errors	Suggest fix	Stale snapshots, missing handlers, mock mismatches

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Error Pattern Detection" for per-stack regex patterns to identify each category.

For auto-fixable categories: apply fix and mark for re-run. For manual categories: collect diagnostic info for the report.

Step 9: Re-Run Failed Tests

After auto-fixes, selectively re-run only failed tests:

Node.js (Vitest): npx vitest run --reporter=json {failed_test_files} Node.js (Playwright): npx playwright test --reporter=json {failed_spec_files} PHP (PHPUnit): vendor/bin/phpunit --filter="{FailedTestName}" Go: go test -json -run "TestName" ./package/...

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Selective Re-Run Commands" for per-framework re-run flags and options.

Track which tests were fixed by auto-fix vs still failing.

Step 10: Parse Coverage

Parse coverage output per stack:

Node.js: Istanbul/v8 JSON from coverage/coverage-summary.json PHP: PHPUnit Clover XML from coverage.xml or build/logs/clover.xml Go: go test -coverprofile=coverage.out → parse with go tool cover

Extract:

Line coverage percentage
Branch coverage percentage (where available)
Uncovered files and functions
Coverage delta from previous run (if baseline exists)

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Coverage Parsing Rules" for per-stack parsing patterns and output formats.

Step 10a: Parse Mutation Score (if --mutation or --all with mutation config detected)

If --mutation tier selected OR mutation tool config detected (stryker.config.*, infection.json5, etc.):

Parse mutation score from mutation tool outputs only (never conflate with code coverage):

StrykerJS: reports/mutation/mutation.json -> mutationScore
Infection: infection-log.json -> stats.msi
go-mutesting: parse stdout killed/total ratio (NOT go test -cover)
mutmut: mutmut results CLI output -> parse survived/killed/total counts (NOT .mutmut-cache SQLite)

If mutation tool not available for stack: report mutation_score: null (JSON null, NOT "N/A") and exclude from quality-gate weighting. Parsers treat null as "not measured", 0 as "measured zero".

Add mutation results to output report:

Mutation score percentage
Surviving mutants count
Top 5 survivor locations (file:line with mutator type)

Iteration Tracking

Track RED-GREEN cycle count during test execution:

Cap at configurable limit (default 10 cycles)
If same test fails 3 times with same error pattern: escalate via AskUserQuestion with error context
Include cycle count in final report

Step 11: Generate Report

Compile results into structured report:

11.1 Generate Output Metadata

source "${CLAUDE_PLUGIN_ROOT}/scripts/lib/id-generator.sh"
SUBDOMAIN_DIR="$JAAN_OUTPUTS_DIR/qa/test-run"
mkdir -p "$SUBDOMAIN_DIR"
NEXT_ID=$(generate_next_id "$SUBDOMAIN_DIR")

Generate slug from test directory or project name (lowercase-kebab-case, max 50 chars).

11.2 Generate Executive Summary

Test execution for {project_name}: {total_tests} tests across {tier_count} tiers.
{pass_count} passed, {fail_count} failed, {skip_count} skipped.
{auto_fix_count} failures auto-fixed. Coverage: {line_pct}% line, {branch_pct}% branch.

11.3 Show Preview

OUTPUT PREVIEW
-------------------------------------------------------------
ID:     {NEXT_ID}
Folder: $JAAN_OUTPUTS_DIR/qa/test-run/{NEXT_ID}-{slug}/

Results:
  Unit:        {pass}/{total} passed ({coverage}% coverage)
  Integration: {pass}/{total} passed ({coverage}% coverage)
  E2E:         {pass}/{total} passed

Auto-Fixes Applied: {count}
Remaining Failures: {count}

Use AskUserQuestion:

Question: "Write test execution report?"
Header: "Write Report"
Options:
- "Yes" -- Write report to output
- "No" -- Cancel
- "Refine" -- Make adjustments first

Step 12: Write Output Files

If approved:

12.1 Create Folder

OUTPUT_FOLDER="$JAAN_OUTPUTS_DIR/qa/test-run/${NEXT_ID}-${slug}"
mkdir -p "$OUTPUT_FOLDER"

12.2 Write Main Document

Path: $OUTPUT_FOLDER/${NEXT_ID}-${slug}.md

Use template from: $JAAN_TEMPLATES_DIR/jaan-to-qa-test-run.template.md

Fill sections:

Title, Executive Summary
Test Execution Results (per tier with pass/fail/skip counts)
Coverage Report (line, branch, uncovered functions)
Failure Analysis (categorized with diagnostics)
Auto-Fix Summary (what was fixed, how)
Suggested Fixes (for manual failures)
Coverage Gaps (uncovered areas)
Next Steps
Metadata

12.3 Update Index

source "${CLAUDE_PLUGIN_ROOT}/scripts/lib/index-updater.sh"
add_to_index \
  "$SUBDOMAIN_DIR/README.md" \
  "$NEXT_ID" \
  "${NEXT_ID}-${slug}" \
  "{Project Name} Test Execution" \
  "{Executive Summary}"

12.4 Confirm Completion

TEST EXECUTION COMPLETE
-------------------------------------------------------------
ID:          {NEXT_ID}
Folder:      $JAAN_OUTPUTS_DIR/qa/test-run/{NEXT_ID}-{slug}/
Index:       Updated $JAAN_OUTPUTS_DIR/qa/test-run/README.md

Results:
  Total:       {total_tests} tests
  Passed:      {pass_count}
  Failed:      {fail_count}
  Skipped:     {skip_count}
  Auto-Fixed:  {auto_fix_count}

Coverage:
  Line:        {line_pct}%
  Branch:      {branch_pct}%

Step 13: Suggest Next Actions

Test execution complete!

Next Steps:

Review failure diagnostics in the report

Fix remaining assertion failures manually

Re-run with /jaan-to:qa-test-run {output-path} --all after fixes

Use /jaan-to:qa-test-generate to generate tests for uncovered areas

See the report for detailed coverage gaps and suggested improvements

Step 14: Capture Feedback

Use AskUserQuestion:

Question: "How did the test execution turn out?"
Header: "Feedback"
Options:
- "Perfect!" -- Done
- "Needs fixes" -- What should I improve?
- "Learn from this" -- Capture a lesson for future runs

If "Learn from this": Run /jaan-to:learn-add qa-test-run "{feedback}"

Reference: See ${CLAUDE_PLUGIN_ROOT}/docs/extending/qa-test-run-reference.md section "Key Execution Rules" for test execution best practices, tier ordering rationale, and anti-patterns to avoid.

Skill Alignment

Two-phase workflow with HARD STOP for human approval
Multi-stack support via tech.md detection
Template-driven output structure
Output to standardized $JAAN_OUTPUTS_DIR path

qa-test-run

Install

Tool Access

Preview

Supporting Assets

SKILL.md

Similar Skills

qa-test-run

Install

Tool Access

Preview

Supporting Assets

SKILL.md

qa-test-run

Context Files

Input

Pre-Execution Protocol

Language Settings

PHASE 1: Analysis (Read-Only)

Thinking Mode

Step 1: Validate & Parse Inputs

Step 2: Detect Tech Stack

Step 3: Scan Test Files by Tier

Step 4: Pre-Execution Health Checks

Step 5: Build Execution Plan

HARD STOP -- Human Review Check

PHASE 2: Execution (Write Phase)

Step 6: Auto-Fix Simple Issues

Step 7: Execute Tests

7.0 Performance Optimization

Step 7.5: Contract Validation Delegation (if --contract or --all)

Step 8: Diagnose Failures

Step 9: Re-Run Failed Tests

Step 10: Parse Coverage

Step 10a: Parse Mutation Score (if --mutation or --all with mutation config detected)

Iteration Tracking

Step 11: Generate Report

11.1 Generate Output Metadata

11.2 Generate Executive Summary

11.3 Show Preview

Step 12: Write Output Files

12.1 Create Folder

12.2 Write Main Document

12.3 Update Index

12.4 Confirm Completion

Step 13: Suggest Next Actions

Step 14: Capture Feedback

Skill Alignment

Definition of Done

Similar Skills

qa-test-run

Context Files

Input

Pre-Execution Protocol

Language Settings

PHASE 1: Analysis (Read-Only)

Thinking Mode

Step 1: Validate & Parse Inputs

Step 2: Detect Tech Stack

Step 3: Scan Test Files by Tier

Step 4: Pre-Execution Health Checks

Step 5: Build Execution Plan

HARD STOP -- Human Review Check

PHASE 2: Execution (Write Phase)

Step 6: Auto-Fix Simple Issues

Step 7: Execute Tests

7.0 Performance Optimization

Step 7.5: Contract Validation Delegation (if --contract or --all)

Step 8: Diagnose Failures

Step 9: Re-Run Failed Tests

Step 10: Parse Coverage

Step 10a: Parse Mutation Score (if --mutation or --all with mutation config detected)

Iteration Tracking

Step 11: Generate Report

11.1 Generate Output Metadata

11.2 Generate Executive Summary

11.3 Show Preview

Step 12: Write Output Files

12.1 Create Folder

12.2 Write Main Document