SLASH COMMAND ONLY - Do NOT invoke automatically. Only runs via /test-gh-skills command. Executes Python test orchestrator for 80 tests across 6 skill groups.
Runs comprehensive test suite for 80 gh CLI search tests across 6 skill groups using parallel execution. Only triggers when user explicitly runs `/test-gh-skills` command or requests "run the full test suite".
/plugin marketplace add aaddrick/gh-cli-search/plugin install gh-cli-search@helpful-tools-marketplaceThis skill inherits all available tools. When active, it can use any tool Claude has access to.
This skill should NEVER be invoked automatically.
ONLY invoke this skill when:
/test-gh-skills slash commandDO NOT invoke when:
Why: Running the full test suite takes ~8 minutes and 80 Claude API calls. This is expensive and should only happen on explicit user request via the slash command.
Execute comprehensive test suite for all gh CLI search skills using a Python test orchestrator. The script runs 80 tests in parallel (default: 4 workers), validates responses, and generates detailed reports at multiple levels. After test execution completes, the test-reviewer agent automatically analyzes results and creates REVIEWER-NOTES.md.
Core Principle: Fast, efficient test execution with minimal overhead. Each test runs in a fresh Claude session with only the user request (no test criteria leaked).
Performance: ~6 seconds per test, ~2-3 minutes total for 80 tests with parallel execution (4 workers).
ONLY use this skill when user explicitly invokes:
/test-gh-skills slash commandNEVER use automatically for:
./testing/scripts/run-single-test.sh directly)python3 testing/scripts/run-all-tests.py [--workers N] [--no-review]
├─> Parse all scenario files in testing/scenarios/
├─> For each test group (6 groups total):
│ ├─> Submit tests to parallel worker pool (default: 4 workers)
│ ├─> Each worker executes: ./testing/scripts/run-single-test.sh "<user-request>"
│ │ └─> claude -p "<user-request>" --allowedTools "Read,Skill"
│ ├─> Extract command from response
│ ├─> Validate against expected criteria
│ └─> Write test report
├─> Generate group reports (per scenario file)
├─> Generate master report (all tests)
└─> Automatically invoke test-reviewer agent (headless)
├─> Read master report
├─> Analyze group reports
├─> Sample individual test failures (3-5 examples)
├─> Identify failure patterns
├─> Perform root cause analysis
└─> Create REVIEWER-NOTES.md with recommendations
Total: 80 tests across 6 groups (parallel execution with 4 workers) + automatic review
run-all-tests.py:
testing/scenarios/run-single-test.sh (configurable workers)./testing/reports/yyyy-mm-dd_{COUNT}/--workers N, --no-reviewrun-single-test.sh:
Execute the Python test orchestrator:
python3 testing/scripts/run-all-tests.py
Options:
--workers N - Number of parallel workers (default: 4)--no-review - Skip automatic test-reviewer agent executionExamples:
# Run with default settings (4 workers, with review)
python3 testing/scripts/run-all-tests.py
# Run with 8 parallel workers
python3 testing/scripts/run-all-tests.py --workers 8
# Run without automatic review
python3 testing/scripts/run-all-tests.py --no-review
Expected output:
GH CLI Search Skills - Test Suite Execution
============================================================
Start time: 2025-11-15 09:00:00
Parallel workers: 4
Report directory: /home/aaddrick/source/gh-cli-search/testing/reports/2025-11-15_5
Processing gh-cli-setup-tests...
Found 10 tests
Test 1: Installation Check Command... PASS
Test 2: Authentication Command... PASS
...
Group complete: 10/10 passed
Processing gh-search-code-tests...
...
============================================================
TEST SUITE COMPLETE
============================================================
Total Tests: 80
Passed: 59 (73.8%)
Failed: 21
Execution Time: 180.5 seconds (3.0 minutes)
Average: 2.3 seconds per test
Report location: ./testing/reports/2025-11-15_5/REPORT.md
============================================================
RUNNING TEST REVIEWER AGENT
============================================================
Analyzing results in: /home/aaddrick/source/gh-cli-search/testing/reports/2025-11-15_5
✓ Test reviewer completed successfully
Reviewer notes: ./testing/reports/2025-11-15_5/REVIEWER-NOTES.md
The test suite automatically generates multiple reports:
Master Report:
./testing/reports/yyyy-mm-dd_{COUNT}/REPORT.md
Contains:
Reviewer Analysis (Automatically Generated):
./testing/reports/yyyy-mm-dd_{COUNT}/REVIEWER-NOTES.md
Contains:
The test-reviewer agent runs automatically after test execution completes. It:
To skip automatic review:
python3 testing/scripts/run-all-tests.py --no-review
Location: ./testing/reports/yyyy-mm-dd_{COUNT}/REPORT.md
Generated by: run-all-tests.py
Contents: Overall summary, results by group, failed tests, execution time
Location: ./testing/reports/yyyy-mm-dd_{COUNT}/{group-name}/REPORT.md
Generated by: run-all-tests.py
Contents: Group summary, individual test results, group-specific pass rate
Location: ./testing/reports/yyyy-mm-dd_{COUNT}/{group-name}/{test-number}.md
Generated by: run-all-tests.py
Contents: Test details, full response, validation results, pass/fail reason
Location: ./testing/reports/yyyy-mm-dd_{COUNT}/REVIEWER-NOTES.md
Generated by: test-reviewer agent (automatically invoked as headless agent)
Contents: Failure patterns, root cause analysis, prioritized recommendations, next steps
--workers flagTests validate:
-- flag presence, PowerShell --% , exclusions inside quotes@me syntax, date formats (ISO8601), comparison operatorsTesting is successful when:
Target pass rate: 90%+ (requires skill alignment improvements)
If tests fail:
testing/README.mdtesting/scripts/run-all-tests.pytesting/scripts/run-single-test.shagents/test-reviewer.md - Post-test analysis and recommendationstesting/scenarios/*.mdskills/gh-search-*/SKILL.md, skills/gh-cli-setup/SKILL.mdpython3 --versionls testing/scenarios/chmod +x testing/scripts/run-single-test.shclaude --versionls -la testing/reports/mkdir -p testing/reports/Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.
Applies Anthropic's official brand colors and typography to any sort of artifact that may benefit from having Anthropic's look-and-feel. Use it when brand colors or style guidelines, visual formatting, or company design standards apply.
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.