From argos
Test strateji + piramidi (unit/integration/e2e/contract/load), pyramid violation (ice-cream cone, hourglass) tespit, stack başına tooling, coverage threshold, mutation testing, exploratory + bug bash + UAT protokolü, quality gate beyond CI.
npx claudepluginhub resultakak/argos --plugin argosThis skill uses the workspace's default tool permissions.
`agents/shared/severity-rubric.md` ve `agents/shared/escalation-matrix.md` default-load
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
agents/shared/severity-rubric.md ve agents/shared/escalation-matrix.md default-load
sayılır (agents/coordination.md §11). Bu skill'in çıktısı Critical / High / Medium /
Low + kanıt formatında olmak zorunda — spekülatif Critical yasak. Sahiplik dışı bulgu
ilgili agent'a delege; karar yetkisi eşiği aşılırsa kullanıcı onayı zorunlu.
| Metric | Tool | Hedef |
|---|---|---|
| Test sayısı (unit/int/e2e) | pytest --collect-only, npm test --listTests, go test -list | ratio takip |
| Coverage staged scope | coverage.py / nyc / go test -cover | ≥ %70 |
| Coverage total | aynı | ≥ %60 |
| Mutation score | mutmut / Stryker / pitest | ≥ %75 |
| Flaky test oranı | CI history son 50 run | < %2 |
| Test süresi (CI) | pipeline metric | unit < 3dk, int < 7dk, e2e < 15dk |
| Bug escape rate | prod incident / month | trend ↓ |
Manual / Exploratory (≤ 5%)
/
/ E2E (browser, full-stack) 5-10%
/
/ Integration (real DB, real services) 20-30%
/
/ Unit (fast, isolated) 60-70%
/__________________________________________________
Pyramid violations:
| Pattern | Belirti | Sebep | Düzeltme |
|---|---|---|---|
| Ice-cream cone | E2E > Integration > Unit | "test = scenario" yanlış kavrayış | Unit'i artır, e2e azalt; behavior-driven üst, technical alt |
| Hourglass | Unit + E2E çok, Integration az | Mock-heavy unit + browser e2e; gerçek entegrasyon kayıp | Integration ekle (testcontainers + real DB) |
| Cupcake | Hepsi az | Test culture eksik | Önce unit, sonra integration, en son e2e |
| Inverted brittle | Integration ≥ Unit | DB hit'li unit yazımı | Unit isolation; integration için ayrı suite |
| Stack | Unit | Integration | E2E | Contract | Load | Mutation |
|---|---|---|---|---|---|---|
| Python (FastAPI) | pytest + pytest-asyncio | pytest + httpx + testcontainers | playwright + pytest | schemathesis (OpenAPI) | k6 / Locust | mutmut |
| Python (Django) | pytest-django + factory_boy | pytest-django + testcontainers PG | playwright | schemathesis | k6 | mutmut |
| Node (Express/Fastify) | vitest / jest | supertest + testcontainers | playwright | dredd / schemathesis | k6 / artillery | Stryker |
| TypeScript | vitest / jest + ts-jest | supertest + testcontainers | playwright | schemathesis | k6 | Stryker |
| Go | testing + testify | testing + testcontainers-go | playwright (headless) | schemathesis | k6 / vegeta | go-mutesting |
| Java | JUnit 5 | JUnit + Testcontainers | Selenium / playwright | Pact / schemathesis | Gatling / JMeter | pitest |
| React (frontend) | vitest + React Testing Library | MSW + RTL integration | playwright / cypress | — | Lighthouse CI | Stryker |
codecov, coveralls).coverage.py, nyc, go test -cover, jacoco).Coverage kötü metric: test fail vermese bile satır "covered" sayılır.
Mutation testing kodu mutate eder (örn. < → <=); test fail vermezse mutant
survives = test zayıf.
freezegun, time.tick.defer / afterEach.Awaitility / eventually(timeout=5s).Heuristic Test Strategy Model (James Bach):
"Explore [area] with [resources] to discover [info]."
Pre-release organize event:
| Gate | Kriter |
|---|---|
| Pre-commit | lint + format + secret scan |
| PR | unit + integration + lint + coverage |
| Merge to main | + e2e + mutation (sample) |
| Release branch | + full mutation + load test (canary RPS) + manual smoke |
| Pre-deploy | + security scan + dependency audit |
| Post-deploy canary | + synthetic check + SLO burn rate alert |
| Production | continuous: synthetic + chaos drill + bug escape rate |
User: /qa-plan checkout-svc
Agent (test-engineer):
1. Test envanteri: 142 unit, 38 integration, 27 e2e — ratio 67/18/13.
2. Coverage: staged %58 (target %70 — fail), total %52, critical path %72 (auth %91, payment %68 — gap).
3. Mutation score (mutmut sample): %63 — düşük; payment module %48 (Critical).
4. Flaky son 50 run: 4 test (e2e/checkout-flow.spec.ts ts'de %18 fail) — quarantine.
5. Pyramid: ratio OK (67/18/13 yaklaşık 60/20/10) ama integration az — DB mock ağır unit'lar var (28 mock'lu unit gerçekte integration olmalı).
6. CI süresi: unit 4dk (target 3), integration 11dk, e2e 18dk — toplam 33dk PR (target ≤10dk).
7. Plan:
- DB index/cache mock'lı unit'leri integration'a taşı (Hourglass fix).
- Payment critical path coverage %68 → %95 hedef (mutation %48 → %85).
- e2e shard 4-way (18dk → ~5dk).
- Flaky e2e/checkout-flow kök neden (race condition `await network`).
- Mutation testing weekly run (release branch).
- Bug bash: 2026-06-01 release öncesi.
- UAT script discovery'den Gherkin → 14 senaryo.
8. Output: rapor + 9 action item + tooling diff.
# QA Plan: <service>
## Mevcut Durum
- ratio (unit/int/e2e), coverage, mutation, flaky %, CI süre, bug escape rate
## Pyramid Sağlığı
- Healthy / Ice-cream cone / Hourglass / Cupcake / Inverted
## Critical / High / Medium / Low
## Stack Tooling
- table
## Coverage + Mutation
- staged / total / critical path / mutation score
## Flaky
- liste + sahip + quarantine süresi
## Quality Gate Matrix
| Gate | Kriter | Mevcut |
## Action Items
| P | Aksiyon | Sahip | Bitiş | Issue |