From Build Swift Apps
Investigate and fix flaky tests using Tuist CI test insights and local repeated runs. Accepts a test URL/identifier or asks to discover flaky tests, then analyzes root causes and verifies fixes with xcodebuild retries.
How this skill is triggered — by the user, by Claude, or both
Slash command
/build-swift-apps:tuist-flaky-test-stabilizerThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this skill to move from flaky-test evidence to a small code or test fix.
Use this skill to move from flaky-test evidence to a small code or test fix.
Accept any of:
Module/Suite/testName identifiertuist test case list --flaky --json --page-size 50
tuist test case show <id-or-identifier> --json
tuist test case run list <module/suite/test> --flaky --json
tuist test case run show <run-id> --json
Collect:
Open the reported source line, then inspect setup, teardown, shared fixtures, global state, async waits, mocked services, file system use, and clock/timezone dependencies.
Common root causes:
Run the narrow test repeatedly:
xcodebuild test \
-workspace <workspace> \
-scheme <scheme> \
-only-testing <module>/<suite>/<test> \
-test-iterations 50 \
-run-tests-until-failure
If the flake depends on parallelism, broaden the scope:
xcodebuild test \
-workspace <workspace> \
-scheme <scheme> \
-only-testing <module> \
-parallel-testing-enabled YES \
-test-iterations 20 \
-run-tests-until-failure
Use Thread Sanitizer when a data race is plausible and the project can run with TSan enabled.
Report the root cause, fix, before/after evidence, exact verification command, and any residual risk if the original flake could not be reproduced locally.
npx claudepluginhub xopoko/build-swift-apps --plugin build-swift-apps2plugins reuse this skill
First indexed Jun 4, 2026
Expert approach to flaky-test-remediation in test automation. Use when working with .
Detects flaky tests via repeated runs under varied order, parallelism, and seed, plus static smell scanning for latent flakiness. Classifies tests as confirmed-flaky, latent-flaky, or stable and fixes root causes without weakening assertions.
Diagnoses non-deterministic test failures and eliminates root causes (timing, shared state, concurrency, external dependency, randomness) instead of retrying or skipping.