From oma
Routes observability intents across MELT+P signals, layers, boundaries, and vendors. Designs pipelines, tunes transports, meta-observability, incident forensics for telemetry, APM, RUM, SLOs.
npx claudepluginhub first-fluke/oh-my-agent --plugin omaThis skill uses the workspace's default tool permissions.
Route, design, tune, and review observability work across MELT+P signals, layers, boundaries, vendor categories, transport choices, meta-observability, and incident forensics.
Builds production-ready monitoring, logging, and tracing systems with observability strategies, SLI/SLO management, alerting, and incident response workflows.
Builds production-ready monitoring, logging, and tracing systems with observability strategies, SLI/SLO management, and incident response. For designing systems, alerting, and reliability investigations.
Provides patterns for observability strategies covering logs, metrics, traces, and signal correlation. Use when designing monitoring systems or implementing the three pillars.
Share bugs, ideas, or general feedback.
Route, design, tune, and review observability work across MELT+P signals, layers, boundaries, vendor categories, transport choices, meta-observability, and incident forensics.
resources/| Action | SSL primitive | Evidence |
|---|---|---|
| Classify observability intent | SELECT | Intent rules |
| Read telemetry/config evidence | READ | OTel/vendor configs, dashboards, samples |
| Route vendor/category | SELECT | Vendor categories |
| Infer coverage gaps | INFER | Matrix and signal/boundary mapping |
| Validate meta-observability | VALIDATE | Clock, cardinality, retention, health |
| Write guidance/config | WRITE | OaC/config/docs when requested |
| Notify result | NOTIFY | Routed recommendation |
1. Classify intent: setup, migrate, investigate, alert, trace, tune, or route.
2. Select layer/boundary/signal coverage from `resources/matrix.md`.
3. Load the specific vendor, transport, incident, or signal guide before producing guidance.
When CNCF/vendor status is load-bearing, verify live state at https://landscape.cncf.io.
| Scope | Resource target |
|---|---|
CODEBASE | Observability config, dashboards, alert rules, instrumentation |
LOCAL_FS | Resource guides and generated docs |
NETWORK | Vendor/CNCF status and telemetry backends when checked |
USER_DATA | Incident symptoms, logs, metrics, traces, profiles |
resources/vendor-categories.md; do not duplicate their documentationX-Amzn-Trace-Id, GCP Cloud Trace, Datadog, Cloudflare, Linkerd) via boundaries/cross-application.mdas of 2026-Q2; verify live status at https://landscape.cncf.ioThe combinations below are outside this skill's boundary. The external tools listed are authoritative for each domain.
| Domain | External tools |
|---|---|
| LLM ops / gen_ai observability | Langfuse, Arize Phoenix, LangSmith, Braintrust |
| Data pipeline lineage | OpenLineage + Marquez, dbt test, Apache Airflow lineage |
| L1/L2 physical / datacenter hardware | Nlyte, Sunbird, Device42; SNMP exporters where Prometheus bridge is needed |
| L5 Session / L6 Presentation full TLS inspection | Wireshark (packet-level), Cloudflare Radar (TLS ecosystem data), vendor TLS inspection tooling |
| Chaos engineering orchestration | Chaos Mesh, Litmus, Gremlin, ChaosToolkit |
| GPU / AI infra (DCGM, NVIDIA) | NVIDIA DCGM Exporter + Prometheus; OTel GPU semconv (Development, not production-ready) |
| Software supply chain (SBOM, attestation) | sigstore (cosign / rekor), in-toto framework, SLSA level attestations |
| Incident response workflow (paging, rotation) | PagerDuty, OpsGenie, Grafana OnCall |
| Fluentd (primary tool) | Deprecated CNCF 2025-10 — use Fluent Bit or OTel Collector |
User / Other Skill Query
|
v
+-----------------------------+
| Intent Classifier |
| setup | migrate | investigate
| alert | trace | tune | route|
+-----------------------------+
|
v
+-----------------------------+
| Vendor Router |
| category-first delegation |
+-----------------------------+
|
v
+-----------------------------+
| vendor-categories.md |
| (a) OSS Full-Stack |
| (b) Commercial SaaS APM |
| (c) High-Cardinality |
| (d) Profiling Specialist |
| (e) SIEM / Enterprise Logs|
| (f) FinOps / Cost |
| (g) Feature Flags/Rollout |
| (h) Log Pipeline |
| (i) Time Series Storage |
| (j) Crash Analytics |
+-----------------------------+
|
v
+-----------------------------+
| Matrix Coverage Selector |
| 4 Layers x 4 Boundaries |
| x 7 Signals = 112 cells |
+-----------------------------+
|
v
+-----------------------------+
| Transport Depth / |
| Meta-observability |
| UDP, OTLP, Collector, |
| cardinality, clock skew |
+-----------------------------+
|
v
+-----------------------------+
| Incident Forensics |
| 6-dim localization: |
| code/service/layer/host/ |
| region/infra |
+-----------------------------+
Layers (4): L3-network, L4-transport, mesh, L7-application Boundaries (4): multi-tenant, cross-application, slo, release Signals (7): metrics, logs, traces, profiles, cost, audit, privacy
See resources/matrix.md for the full 112-cell coverage map with N/A markers for invalid combinations.
| Intent | Primary target | Fallback |
|---|---|---|
setup | resources/vendor-categories.md → vendor-owned skill | Generic OTel semconv in resources/standards.md |
migrate | CNCF 2025-10 guide + resources/vendor-categories.md §(h) | OTel Collector bridge config |
investigate | resources/incident-forensics.md (MRA + 6-dim localization) | signals/traces.md + signals/logs.md |
alert | boundaries/slo.md (burn-rate alert rules) | resources/observability-as-code.md |
trace | boundaries/cross-application.md (propagator matrix) | layers/mesh.md (zero-code auto-instrumentation) |
tune | transport/ (4 files: UDP/MTU, OTLP, topology, sampling) | resources/meta-observability.md (cardinality guardrails) |
route | boundaries/multi-tenant.md + transport/collector-topology.md | boundaries/cross-application.md (data residency) |
Standalone:
/oma-observability "set up OTel stack on Kubernetes"
/oma-observability --migrate "move from Fluentd to Fluent Bit"
/oma-observability --investigate "5xx spike in ap-northeast-2"
/oma-observability --alert "configure SLO burn-rate alert for checkout API"
/oma-observability --trace "W3C propagator across AWS + GCP boundary"
/oma-observability --tune "UDP statsd MTU throughput limit"
/oma-observability --route "multi-tenant log isolation with data residency"
Shared invocation (from other skills):
setup | migrate | investigate | alert | trace | tune | routeFollow resources/execution-protocol.md step by step.
See resources/examples.md for end-to-end walkthroughs.
Use resources/intent-rules.md for intent classification reference.
Use resources/matrix.md for coverage navigation across layers, boundaries, and signals.
Use resources/vendor-categories.md for vendor delegation and category selection.
Before submitting, run resources/checklist.md.
Integration status (2026-Q2): rows below describe recommended handoff patterns from the oma-observability side. As of this version, reciprocal cross-references from the other skills' SKILL.md files are not yet in place — this is a v1.1 follow-up item. Users invoking the other skills directly will need to surface this integration manually until the reciprocal links land.
| Skill | Integration point | Reciprocal link status |
|---|---|---|
oma-debug | On failure: pull traces + logs by request_id → trigger resources/incident-forensics.md 6-dim localization playbook | ⏳ pending (v1.1) |
oma-qa | Canary post-deploy loop via chrome-devtools MCP: console errors + Core Web Vitals trend; INP/LCP/CLS from layers/L7-application/web-rum.md | ⏳ pending (v1.1) |
oma-tf-infra | Terraform modules for OTel Collector, Grafana, and Loki stack provisioning | ⏳ pending (v1.1) |
oma-scm | Deployment SHA → service.version OTel attribute + release marker events; see boundaries/release.md | ⏳ pending (v1.1) |
oma-backend | Propagator and baggage rules cross-referenced in backend.md ruleset; DB N+1 + Kafka patterns in signals/traces.md | ⏳ pending (v1.1) |
oma-frontend | layers/L7-application/web-rum.md INP/LCP/CLS checklist cross-referenced in frontend.md ruleset | ⏳ pending (v1.1) |
oma-mobile | layers/L7-application/mobile-rum.md offline-queuing pattern cross-referenced in mobile.md ruleset | ⏳ pending (v1.1) |
oma-db | signals/traces.md DB patterns (N+1, connection pool) cross-referenced in database.md ruleset | ⏳ pending (v1.1) |
otel_spec / otel_semconv keys in each file's frontmatter document the assumed version. If content depends on a specific attribute stability tier, the tier is stated inline.resources/standards.md and the affected file, bump minor version.resources/standards.md.vendor-categories.md (Graduated / Archived / acquired) → update the vendor table.https://landscape.cncf.io for CNCF project status. This skill does not promise to track it on any schedule — verify at use time if the information is load-bearing.last_reviewed / next_review frontmatter. Those were removed because no automated enforcement exists; relying on voluntary manual review produces stale stamps that misrepresent currency. Git history (git log path/to/file) is the source of truth for when a file was last changed.docs/plans/designs/005-oma-observability.md §Ownership. CTO co-signs changes to standards.md, matrix.md, anti-patterns.md.resources/checklist.md §1 Setup validation before merging.resources/execution-protocol.mdresources/intent-rules.mdresources/matrix.mdresources/standards.mdresources/vendor-categories.mdresources/incident-forensics.mdresources/meta-observability.mdresources/observability-as-code.mdresources/anti-patterns.mdresources/checklist.mdresources/examples.mdresources/transport/udp-statsd-mtu.mdresources/transport/otlp-grpc-vs-http.mdresources/transport/collector-topology.mdresources/transport/sampling-recipes.mdresources/layers/L3-network.mdresources/layers/L4-transport.mdresources/layers/mesh.mdresources/layers/L7-application/web-rum.mdresources/layers/L7-application/mobile-rum.mdresources/layers/L7-application/crash-analytics.mdresources/boundaries/multi-tenant.mdresources/boundaries/cross-application.mdresources/boundaries/slo.mdresources/boundaries/release.mdresources/signals/metrics.mdresources/signals/logs.mdresources/signals/traces.mdresources/signals/profiles.mdresources/signals/cost.mdresources/signals/audit.mdresources/signals/privacy.md