Skill

Observe

Conducts structured neutral observation of codebases, systems, behaviors, or reasoning to record patterns, categorize findings, and hypothesize. Use for unclear issues, unknown root causes, change effects, or bias audits.

code-quality

npx claudepluginhub pjt222/agent-almanac

Tool Access

This skill uses the workspace's default tool permissions.

Preview

---

SKILL.md

Similar Skills

Observe (Guidance)

Guides systematic observation of systems, codebases, behaviors, or team dynamics using field notes, pattern recognition, and reporting for debugging, research, and evidence-based understanding before intervening.

agent-almanac

observe

Runs focused observation sessions to analyze specific codebase patterns like error handling, naming conventions, or directories. Records findings for future /learn runs.

atv-starter-kit

instinct-system

188

Tracks project-specific code patterns as confidence-scored instincts through observe-hypothesize-confirm cycles, storing in .claude/instincts.md for promotion to MEMORY.md.

dotnet-claude-kit

Stats

Stars12

Forks1

Last CommitFeb 20, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Observe

Conduct a structured observation session — framing the observation target, witnessing with sustained neutral attention, recording patterns without interpretation, categorizing findings, generating hypotheses from patterns, and archiving the observations for future reference.

When to Use

A system's behavior is unclear and action without observation would be premature
Debugging a problem where the cause is unknown — observation before intervention prevents masking symptoms
A codebase or system has been changed and the effects need to be witnessed before further changes are made
Understanding user behavior patterns over a conversation to improve future interactions
Auditing own reasoning patterns for biases, habits, or recurring errors
After learn has built a model that needs validation through observation of the system in action

Inputs

Required: Observation target — a system, codebase, behavior pattern, user interaction, or reasoning process to observe
Optional: Observation duration/scope — how long or deep to observe before concluding
Optional: Specific question or hypothesis to guide observation focus
Optional: Prior observations to compare against (detecting change over time)

Procedure

Step 1: Frame — Set the Observation Focus

Define what is being observed, why, and from what perspective.

Observation Protocol by System Type:
┌──────────────────┬──────────────────────────┬──────────────────────────┐
│ System Type      │ What to Observe          │ Categories to Watch      │
├──────────────────┼──────────────────────────┼──────────────────────────┤
│ Codebase         │ File structure, naming   │ Patterns, anti-patterns, │
│                  │ conventions, dependency  │ consistency, dead code,  │
│                  │ flow, test coverage,     │ documentation quality,   │
│                  │ error handling patterns  │ coupling between modules │
├──────────────────┼──────────────────────────┼──────────────────────────┤
│ User behavior    │ Question patterns,       │ Expertise signals, pain  │
│                  │ vocabulary evolution,    │ points, unstated needs,  │
│                  │ repeated requests,       │ learning trajectory,     │
│                  │ emotional signals        │ communication style      │
├──────────────────┼──────────────────────────┼──────────────────────────┤
│ Tool / API       │ Response patterns, error │ Rate limits, edge cases, │
│                  │ conditions, latency,     │ undocumented behavior,   │
│                  │ output format variations │ state dependencies       │
├──────────────────┼──────────────────────────┼──────────────────────────┤
│ Own reasoning    │ Decision patterns, tool  │ Biases, habits, blind    │
│                  │ selection habits, error  │ spots, strengths,        │
│                  │ recovery approaches,     │ recurring failure modes, │
│                  │ communication patterns   │ over/under-confidence    │
└──────────────────┴──────────────────────────┴──────────────────────────┘

Select the observation target and name it explicitly
Define the observation boundary: what is included and what is out of scope
State the observation stance: "I am observing, not intervening"
If there is a guiding question, state it — but hold it lightly; be willing to notice things outside the question's scope
Choose the appropriate categories from the matrix above

Expected: A clear frame that directs attention without constraining it. The observer knows where to look and what categories to sort observations into, but remains open to the unexpected.

On failure: If the observation target is too broad ("observe everything"), narrow to one subsystem or one behavior pattern. If the target is too narrow ("observe this one variable"), zoom out to the surrounding context — the interesting patterns are often at the edges.

Step 2: Witness — Sustained Neutral Attention

Hold attention on the observation target without interpreting, judging, or intervening.

Begin systematic observation: read files, trace execution paths, review conversation history — whatever the target requires
Record what is seen, not what it means — description before interpretation
Resist the urge to fix problems encountered during observation — note them and continue
Resist the urge to explain patterns before enough observations accumulate
If attention drifts to a different target, note the drift (it may be meaningful) and return to the frame
Maintain observation for a defined period: at least 3-5 distinct data points before moving to categorization

Expected: A collection of raw observations — specific, concrete, and free from interpretation. Observations read like field notes: "File X imports Y but does not use function Z. File A has 300 lines; file B has 30 lines and covers similar functionality."

On failure: If observation immediately triggers analysis ("this is wrong because..."), the analytical habit is overriding the observational stance. Consciously separate the phases: write the observation as a fact, then write the interpretation as a separate note labeled "hypothesis." If neutrality is impossible (strong reaction to what is observed), note the reaction itself as data: "I noticed strong concern when observing X — this may indicate a significant issue or may indicate my bias."

Step 3: Record — Capture Raw Patterns

Transcribe observations into a structured format while they are fresh.

List each observation as a single statement of fact (what was seen, where, when)
Group naturally similar observations — do not force grouping, but notice when observations cluster
Note frequency: did this pattern appear once, occasionally, or pervasively?
Note contrasts: where did the pattern break? Exceptions are often more informative than rules
Note temporal patterns: did the observation change over time, or was it static?
Capture exact evidence: file paths, line numbers, specific words, concrete examples

Expected: A structured record of 5-15 discrete observations, each with specific evidence. The record should be detailed enough that another observer could verify each observation independently.

On failure: If observations are too abstract ("the code seems messy"), they need grounding in specifics — which files, which patterns, what makes it messy? If observations are too granular ("line 47 has a space before the brace"), zoom out to the pattern level — is this a one-off or a systemic issue?

Step 4: Categorize — Organize Findings

Sort observations into meaningful categories without yet explaining them.

Review all recorded observations and look for natural groupings
Assign each observation to a category from the Step 1 matrix, or create new categories if needed
Within each category, rank observations by frequency and significance
Identify which categories have many observations (well-documented areas) and which have few (potential blind spots)
Look for cross-category patterns: does the same underlying pattern manifest differently in different categories?
Note any observations that do not fit any category — outliers are often the most interesting data

Expected: A categorized observation map with clear groupings. Each category has specific observations supporting it. The map shows both patterns and gaps.

On failure: If categorization feels forced, the observations may not have natural groupings — they may be a collection of unrelated findings, which is itself a finding (the system may lack coherent structure). If everything fits neatly into one category, the observation scope was too narrow — zoom out.

Step 5: Theorize — Generate Hypotheses from Patterns

Now — and only now — begin interpreting the observations.

For each major pattern observed, propose a hypothesis: "This pattern exists because..."
For each hypothesis, identify supporting evidence from the observations
For each hypothesis, identify what counter-evidence would disprove it
Rank hypotheses by explanatory power: which one explains the most observations?
Generate at least one contrarian hypothesis: "The obvious explanation is X, but it could also be Y because..."
Identify which hypotheses are testable and which are speculative

Expected: 2-4 hypotheses that explain the major patterns, each supported by specific observations. At least one hypothesis should be surprising or contrarian. The distinction between observation and interpretation is maintained — it is clear which parts are data and which are theory.

On failure: If no hypotheses form, the observations may need more time to accumulate — return to Step 2. If too many hypotheses form (everything is "maybe"), select the 2-3 with the strongest evidence and set the rest aside. If only obvious hypotheses form, force a contrarian view: "What if the opposite were true?"

Step 6: Archive — Store the Pattern Library

Preserve the observations and hypotheses for future reference.

Summarize the key findings: 3-5 patterns with evidence
State the leading hypotheses and their confidence levels
Note what was not observed (potential blind spots)
Identify follow-up observations that would strengthen or weaken the hypotheses
If the patterns are durable (will be relevant across sessions), consider updating MEMORY.md
Tag the observations with context: when they were made, what prompted them, what scope was covered

Expected: An archive that future observation sessions can build on. The archive distinguishes clearly between observations (data) and hypotheses (interpretation). It is honest about confidence levels and gaps.

On failure: If the observations do not feel worth archiving, they may have been too shallow — or they may be genuinely routine (not every observation session produces insights). Archive even negative results: "Observed X and found no anomalies" is useful future context.

Validation

The observation frame was set before any observation began (not free-form wandering)
Raw observations were recorded as facts before any interpretation
At least 5 discrete observations were captured with specific evidence
Interpretation (hypotheses) was clearly separated from observation (data)
At least one surprising or contrarian finding was generated
The archived record is specific enough for another observer to verify

Common Pitfalls

Premature intervention: Seeing a problem and fixing it immediately, losing the opportunity to understand the broader pattern it belongs to
Observation bias: Seeing what is expected rather than what is present. Expectations filter perception — the clearing step in Step 1 mitigates this but does not eliminate it
Analysis paralysis: Observing endlessly without ever moving to action. Set a time or data-point limit and commit to concluding
Narrative imposition: Constructing a story that connects observations even when the connections are weak. Not all observations form a coherent narrative — disconnected findings are valid
Confusing familiarity with understanding: "I have seen this before" is not the same as "I understand why this is here." Prior exposure can create false confidence
Ignoring own reactions: The observer's emotional or cognitive reactions to observations are data. A sense of confusion, boredom, or alarm about a system often contains real signal

Related Skills

observe-guidance — the human-guidance variant for coaching a person in systematic observation
learn — observation feeds learning by providing raw data for model-building
listen — outward-focused attention toward user signals; observation is broader-scope attention toward any system
remote-viewing — intuitive exploration that can be validated through systematic observation
meditate — develops the sustained attention capacity that observation requires
awareness — threat-focused situational awareness; observation is curiosity-driven rather than defense-driven