From python3-development
Analyzes failing tests to classify as test bugs, implementation bugs, or ambiguous. Investigates test intent vs code behavior with evidence, recommends targeted fixes.
npx claudepluginhub jamie-bitflight/claude_skills --plugin python3-developmentThis skill uses the workspace's default tool permissions.
Analyze failing test cases with a balanced, investigative approach.
Analyzes failing test cases to classify failures as test bugs, implementation bugs, or ambiguous. Uses investigative process to verify expectations against code and docs before recommending fixes.
Analyzes failing test cases to classify as test bugs, implementation bugs, or ambiguous cases. Uses investigative process to determine root causes without premature fixes.
Guides systematic test failure investigation with dual hypotheses (test vs. code issue), step-by-step protocol, red flags, and best practices.
Share bugs, ideas, or general feedback.
Analyze failing test cases with a balanced, investigative approach.
Consult ../python3-development/references/python3-standards.md when shared testing or quality rules from this plugin apply; full standards, graphs, and amendment process are documented there.
When tests fail, there are two primary possibilities:
Assuming tests are wrong by default is a dangerous anti-pattern that defeats the purpose of testing.
For each failing test, ask:
Classify the failure as one of:
| Classification | Meaning |
|---|---|
| Test Bug | Test's expectations are incorrect |
| Implementation Bug | Code doesn't behave as it should |
| Ambiguous | Intended behavior is unclear |
Provide clear explanation including:
Scenario: Test expects calculateDiscount(100, 0.2) to return 20, but it returns 80
Analysis:
Determination: Ambiguous Recommendation: Check documentation or clarify intended behavior
Scenario: Test expects validateEmail("user@example.com") to return true, but it returns false
Analysis:
Determination: Implementation Bug Recommendation: Fix the regex to properly validate email addresses per RFC standards
Scenario: Test expects divide(10, 0) to return 0, but it throws an error
Analysis:
Determination: Test Bug Recommendation: Update test to expect an error, not 0
For each failing test, provide:
Test: [test name/description]
Failure: [what failed and how]
Investigation:
- Test expects: [expected behavior]
- Implementation does: [actual behavior]
- Root cause: [why they differ]
Determination: [Test Bug | Implementation Bug | Ambiguous]
Recommendation:
[Specific fix to either test or implementation]