npx claudepluginhub jamie-bitflight/claude_skills --plugin development-harnessThis skill uses the workspace's default tool permissions.
Analyze failing test cases with a balanced, investigative approach.
Analyzes failing test cases to classify failures as test bugs, implementation bugs, or ambiguous. Uses investigative process to verify expectations against code and docs before recommending fixes.
Analyzes failing tests to classify as test bugs, implementation bugs, or ambiguous. Investigates test intent vs code behavior with evidence, recommends targeted fixes.
Guides systematic investigation of test failures using dual-hypothesis approach (test wrong vs. implementation bug) and step-by-step protocol. Use for diagnosing test errors or establishing test failure mindset.
Share bugs, ideas, or general feedback.
Analyze failing test cases with a balanced, investigative approach.
When tests fail, there are two primary possibilities:
Assuming tests are wrong by default is a dangerous anti-pattern that defeats the purpose of testing.
For each failing test, ask:
Classify the failure as one of:
| Classification | Meaning |
|---|---|
| Test Bug | Test's expectations are incorrect |
| Implementation Bug | Code doesn't behave as it should |
| Ambiguous | Intended behavior is unclear |
Provide clear explanation including:
Scenario: Test expects calculateDiscount(100, 0.2) to return 20, but it returns 80
Analysis:
Determination: Ambiguous Recommendation: Check documentation or clarify intended behavior
Scenario: Test expects validateEmail("user@example.com") to return true, but it returns false
Analysis:
Determination: Implementation Bug Recommendation: Fix the regex to properly validate email addresses per RFC standards
Scenario: Test expects divide(10, 0) to return 0, but it throws an error
Analysis:
Determination: Test Bug Recommendation: Update test to expect an error, not 0
For each failing test, provide:
Test: [test name/description]
Failure: [what failed and how]
Investigation:
- Test expects: [expected behavior]
- Implementation does: [actual behavior]
- Root cause: [why they differ]
Determination: [Test Bug | Implementation Bug | Ambiguous]
Recommendation:
[Specific fix to either test or implementation]