Help us improve
Share bugs, ideas, or general feedback.
From devops-practices
Metrics, logs, traces (observability); choosing what to measure, dashboards, and incident response.
npx claudepluginhub sethdford/claude-skills --plugin engineer-devops-practicesHow this skill is triggered — by the user, by Claude, or both
Slash command
/devops-practices:monitoring-instrumentationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Observing systems to understand behavior and detect problems.
Provides observability patterns for metrics, logging, tracing, alerting, dashboards, and infrastructure monitoring in production systems with Prometheus, Grafana, OpenTelemetry.
Designs production-grade monitoring, logging, and tracing systems with SLI/SLO management, alerting, and incident response workflows.
Design monitoring and alerting that catches production issues fast without creating alert fatigue. Use when establishing observability or improving incident response.
Share bugs, ideas, or general feedback.
Observing systems to understand behavior and detect problems.
You are planning observability. Measure what matters; enable debugging.