Skill

write-runbook

Write an operational runbook for a service, deployment, or incident response procedure.

Install

npx claudepluginhub hpsgd/turtlestack --plugin internal-docs-writer

Tool Access

This skill is limited to using the following tools:

ReadWriteEditBashGlobGrep

Preview

Write a runbook for $ARGUMENTS using the mandatory structure below.

SKILL.md

Similar Skills

context7-mcp

Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.

context7-plugin

51.8k

find-docs

Retrieves current documentation, API references, and code examples for libraries, frameworks, SDKs, CLIs, and services via Context7 CLI. Ideal for API syntax, configs, migrations, and setup queries.

context7

50.8k

context7-cli

3 files

Uses ctx7 CLI to fetch current library docs, manage AI coding skills (install/search/generate), and configure Context7 MCP for AI editors.

context7

50.8k

Stats

Parent Repo Stars0

Parent Repo Forks0

Last CommitApr 2, 2026

Actions

View Source View Plugin View on GitHub View README

Field	Value
What this covers	[one sentence]
When to use	[trigger conditions — alert name, symptom, or scheduled occasion]
Estimated duration	[time range for the full procedure]
Risk level	Low / Medium / High / Critical
Last tested	[date]
Owner	[team or person responsible for maintaining this runbook]

Field

Value

What this covers

[one sentence]

When to use

[trigger conditions — alert name, symptom, or scheduled occasion]

Estimated duration

[time range for the full procedure]

Risk level

Low / Medium / High / Critical

Last tested

[date]

Owner

[team or person responsible for maintaining this runbook]

- [ ] Access to [system] — how to get it: [link or instructions] - [ ] CLI tool [name] installed — version [X]+ — install: `[command]` - [ ] Environment variable [NAME] set — get it from: [location] - [ ] VPN connected to [network] — connect: [instructions] - [ ] Permissions: [specific role/group] in [system] — request via: [link] - [ ] Communication channel open: [Slack channel / war room] — notify: [who]

#### Step N: [what you're doing and why] **Action:** \`\`\`bash [exact command to run — copy-pasteable, no placeholders without explanation] \`\`\` **Expected output:** \`\`\` [what you should see if this worked correctly] \`\`\` **If this fails:** - Symptom: [what you might see instead] - Likely cause: [why this happens] - Fix: [what to do about it] - If the fix doesn't work: [escalate to whom, or which step to go to] **Checkpoint:** [how to confirm this step succeeded before moving on]

#### Verification checklist - [ ] Service is responding: `curl -s https://[endpoint]/health | jq .status` → should return `"ok"` - [ ] Logs are clean: `[log command]` → no errors in the last 5 minutes - [ ] Metrics are normal: [dashboard link] → [what to look for] - [ ] Dependent services unaffected: [how to check] - [ ] Users can perform [core action]: [how to test]

#### Rollback procedure **When to rollback:** [specific conditions that trigger a rollback decision] **Rollback window:** [how long after the procedure can you still roll back?] **Data implications:** [will rollback cause data loss? What data?] 1. [Rollback step with exact command] Expected result: [what you should see] 2. [Next rollback step] Expected result: [what you should see] #### After rollback - [ ] Verify rollback succeeded: [how] - [ ] Notify: [who needs to know] - [ ] Create incident ticket: [where, with what information]

Condition	Escalate to	Contact	Expected response time
Procedure fails after troubleshooting	[team/person]	[Slack/phone/page]	[time]
Data loss suspected	[team/person]	[Slack/phone/page]	Immediate
Customer impact detected	[team/person]	[Slack/phone/page]	Immediate
Unsure whether to proceed	[team/person]	[Slack/phone/page]	[time]

Condition

Escalate to

Contact

Expected response time

Procedure fails after troubleshooting

[team/person]

[Slack/phone/page]

[time]

Data loss suspected

[team/person]

[Slack/phone/page]

Immediate

Customer impact detected

[team/person]

[Slack/phone/page]

Immediate

Unsure whether to proceed

[team/person]

[Slack/phone/page]

[time]

Check	Requirement
Copy-paste test	Could someone paste every command and have it work?
Failure coverage	Does every step have a "what if this fails" section?
No assumed knowledge	Could a new team member follow this on their first week?
Rollback exists	Is there a complete rollback procedure?
Escalation contacts	Are real people or teams listed, with contact methods?
Timing estimates	Does the reader know how long things should take?
Warnings present	Are destructive or irreversible steps clearly marked?
Verification complete	Can the operator confirm success at the end?

Check

Requirement

Copy-paste test

Could someone paste every command and have it work?

Failure coverage

Does every step have a "what if this fails" section?

No assumed knowledge

Could a new team member follow this on their first week?

Rollback exists

Is there a complete rollback procedure?

Escalation contacts

Are real people or teams listed, with contact methods?

Timing estimates

Does the reader know how long things should take?

Warnings present

Are destructive or irreversible steps clearly marked?

Verification complete

Can the operator confirm success at the end?

Field	Value
What this covers	[one sentence]
When to use	[trigger conditions — alert name, symptom, or scheduled occasion]
Estimated duration	[time range for the full procedure]
Risk level	Low / Medium / High / Critical
Last tested	[date]
Owner	[team or person responsible for maintaining this runbook]

Field

Value

What this covers

[one sentence]

When to use

[trigger conditions — alert name, symptom, or scheduled occasion]

Estimated duration

[time range for the full procedure]

Risk level

Low / Medium / High / Critical

Last tested

[date]

Owner

[team or person responsible for maintaining this runbook]

Condition	Escalate to	Contact	Expected response time
Procedure fails after troubleshooting	[team/person]	[Slack/phone/page]	[time]
Data loss suspected	[team/person]	[Slack/phone/page]	Immediate
Customer impact detected	[team/person]	[Slack/phone/page]	Immediate
Unsure whether to proceed	[team/person]	[Slack/phone/page]	[time]

Condition

Escalate to

Contact

Expected response time

Procedure fails after troubleshooting

[team/person]

[Slack/phone/page]

[time]

Data loss suspected

[team/person]

[Slack/phone/page]

Immediate

Customer impact detected

[team/person]

[Slack/phone/page]

Immediate

Unsure whether to proceed

[team/person]

[Slack/phone/page]

[time]

Check	Requirement
Copy-paste test	Could someone paste every command and have it work?
Failure coverage	Does every step have a "what if this fails" section?
No assumed knowledge	Could a new team member follow this on their first week?
Rollback exists	Is there a complete rollback procedure?
Escalation contacts	Are real people or teams listed, with contact methods?
Timing estimates	Does the reader know how long things should take?
Warnings present	Are destructive or irreversible steps clearly marked?
Verification complete	Can the operator confirm success at the end?

Check

Requirement

Copy-paste test

Could someone paste every command and have it work?

Failure coverage

Does every step have a "what if this fails" section?

No assumed knowledge

Could a new team member follow this on their first week?

Rollback exists

Is there a complete rollback procedure?

Escalation contacts

Are real people or teams listed, with contact methods?

Timing estimates

Does the reader know how long things should take?

Warnings present

Are destructive or irreversible steps clearly marked?

Verification complete

Can the operator confirm success at the end?

write-runbook

Install

Tool Access

Preview

SKILL.md

Similar Skills

write-runbook

Install

Tool Access

Preview

SKILL.md

Step 1 — Research the procedure

Step 2 — Write the runbook

Title

Overview

Prerequisites

Procedure

Verification

Rollback

Troubleshooting

Escalation

Appendix

Step 3 — Quality checks

Rules

Similar Skills

Step 1 — Research the procedure

Step 2 — Write the runbook

Title

Overview

Prerequisites

Procedure

Verification

Rollback

Troubleshooting

Escalation

Appendix

Step 3 — Quality checks

Rules