From dm-game
Use when designing how audio communicates game state, creates emotion, and serves as a feedback system. Activate for sound effect design, adaptive music systems, spatial audio, ambient soundscapes, audio priority and ducking, emotional audio design, and audio accessibility. Also activate when audio feels disconnected from gameplay, when players routinely play on mute (a design failure signal), when evaluating whether the audio stack has appropriate layering and variation, or when planning how music responds to gameplay context. This skill treats audio as a design system — not audio engineering or production, but the intentional design of what players hear, when, and why. Covers the full audio stack from music and ambience through sound effects, UI audio, and voice.
npx claudepluginhub rbergman/dark-matter-marketplace --plugin dm-gameThis skill uses the workspace's default tool permissions.
Audio is the fastest feedback channel in a game. Players process sound faster than
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
Searches prompts.chat for AI prompt templates by keyword or category, retrieves by ID with variable handling, and improves prompts via AI. Use for discovering or enhancing prompts.
Guides MCP server integration in Claude Code plugins via .mcp.json or plugin.json configs for stdio, SSE, HTTP types, enabling external services as tools.
Audio is the fastest feedback channel in a game. Players process sound faster than visual information — audio tells them what is happening before they see it, behind them, and outside their field of view. A game where players mute the audio and lose nothing has failed at audio design.
This skill covers audio as a design system: deciding what sounds exist, when they play, how they relate to gameplay state, and how they serve the player's understanding of the game world. Production concerns (recording, mastering, compression formats) are out of scope.
Sound is not decoration — it is an information channel that operates independently from the visual field. Design audio with the same rigor applied to any other game system.
What audio communicates that visuals cannot:
| Advantage | Example |
|---|---|
| 360-degree awareness | Footsteps behind the player |
| Off-screen events | Explosion around a corner |
| Anticipation/warning | Audio wind-up before an attack lands |
| Emotional context | Minor-key shift signaling danger |
| Confirmation | Satisfying click confirming a successful action |
| Pacing | Tempo change signaling urgency |
The mute test: If a player can play the game on mute with no loss of information or enjoyment, the audio design is not carrying its weight. Every audio element should either convey information, create emotion, or both. Purely decorative audio that does neither is wasted bandwidth in the mix.
Audio architecture uses independent layers, each serving a distinct purpose. Every layer must be independently controllable — players need per-layer volume sliders, not just a master volume knob.
Purpose: Emotional context, tension management, pacing.
Music tells the player how to feel about what is happening. It operates on slower timescales than other audio — setting mood across minutes rather than reacting frame by frame.
Purpose: Spatial context, atmosphere, world presence.
Ambience is the audio foundation that makes a space feel real. Without it, environments feel sterile regardless of visual quality.
Purpose: Game state feedback, player action confirmation.
Sound effects are the primary feedback channel for moment-to-moment gameplay. They confirm actions, communicate results, and signal state changes.
Purpose: Interface feedback, notifications, system communication.
UI audio confirms interface actions and draws attention to important system events. It should be clean, consistent, and never fatiguing.
Purpose: Dialogue, barks, callouts, narrative delivery.
Voice is the most attention-demanding audio layer. When voice plays, everything else should defer.
Every significant player action needs an audio response. Missing audio feedback creates a disconnect that players feel even if they cannot articulate it.
Match visual weight. A massive explosion with a thin pop sound creates cognitive dissonance. A small UI toggle with a booming thud feels wrong. Audio and visual weight must be proportional.
Immediate onset. Audio feedback should begin within 1–2 frames of the corresponding visual onset. The perception-action cycle depends on audio and visual alignment — see game-feel for timing reference tables. Late audio feels sluggish; early audio feels disconnected.
Priority ordering:
| Priority | Category | Rationale |
|---|---|---|
| 1 | Player actions | The player must hear confirmation of their own inputs |
| 2 | Enemy feedback | Threats need clear audio communication |
| 3 | Environment | World interaction supports immersion |
| 4 | Ambience | Atmosphere fills gaps but yields to gameplay |
Any sound that plays more than twice within a 10-second window needs at minimum 3 variants. Repetition fatigue is one of the fastest ways to make audio annoying.
Variation techniques:
Complex actions deserve composite audio built from multiple layers:
Sword hit = impact layer + material layer + environment layer
= [metal clang] + [flesh thud] + [room reverb tail]
Each layer can be swapped independently — the same impact with a different material layer communicates hitting stone vs. wood vs. armor.
Static music that ignores gameplay state is a missed opportunity. Music should respond to what the player is experiencing.
Define discrete music states tied to gameplay context:
| State | Character | Typical Triggers |
|---|---|---|
| Exploration | Calm, open, melodic | No threats, traversal, discovery |
| Tension | Building, uneasy, rhythmic | Threat proximity, low resources, time pressure |
| Combat | Intense, driving, percussive | Active combat engagement |
| Resolution | Release, exhale, thinning layers | Combat end, threat eliminated |
| Victory | Triumphant, ascending, major key | Boss defeat, objective complete |
| Defeat | Somber, descending, sparse | Player death, mission failure |
| Technique | Description | Best For |
|---|---|---|
| Crossfade | Blend out current, blend in next | Gradual mood shifts |
| Stinger-to-track | Short musical hit bridges to new track | Sudden state changes |
| Horizontal re-sequencing | Same tempo, swap musical layers | Intensity changes within a state |
| Vertical layering | Add/remove instrument layers | Smooth intensity scaling |
The most flexible adaptive music approach uses vertical layering:
Layers add or remove based on a gameplay intensity value (0.0–1.0). This produces smooth, musically coherent transitions because all layers share tempo and key.
Sound in physical space provides directional information that the visual field cannot. In any game with combat or threats, players should be able to locate dangers by audio alone.
Distance affects more than just volume:
| Distance | Volume | Frequency Content | Character |
|---|---|---|---|
| Close (0–5m) | Full | Full spectrum | Detailed, present |
| Medium (5–20m) | Reduced | High-frequency rolloff begins | Clear but receding |
| Far (20–50m) | Low | Significant HF loss | Muffled, ambient |
| Distant (50m+) | Minimal | Low frequencies only | Atmospheric, rumble |
Reverb and reflection communicate space:
| Environment | Reverb Character | Decay Time |
|---|---|---|
| Small room | Tight, short reflections | 0.3–0.8s |
| Large hall | Wide, diffuse | 1.5–3.0s |
| Cave | Dense, colored | 2.0–5.0s |
| Outdoor open | Minimal reverb, natural echo | 0.1–0.5s |
| Underwater | Heavy low-pass, long decay | 3.0–6.0s |
Design rule: In any game with combat, a blindfolded player using only audio should be able to determine threat direction and approximate distance.
When many sounds compete for the player's attention, the mix becomes noise. A priority system ensures the most important audio always cuts through.
| Priority | Examples | Mix Behavior |
|---|---|---|
| Critical | Player damage, death, critical alert | Ducks everything else |
| High | Combat impacts, dialogue, important callouts | Ducks ambient and music |
| Medium | Footsteps, environmental, UI feedback | Normal mix level |
| Low | Ambient loops, distant events, decorative | Ducked by all higher priorities |
Define a maximum number of simultaneous sounds the game will play. For most games, 16–32 concurrent voices is a practical budget.
When the budget is exceeded:
Music and sound are direct emotional tools. Used intentionally, they shape how the player feels moment to moment.
| Tool | Effect | Application |
|---|---|---|
| Minor key/mode | Tension, sadness, unease | Danger zones, loss, mystery |
| Major key/mode | Triumph, safety, joy | Victory, home areas, discovery |
| Tempo increase | Urgency, excitement, anxiety | Chase sequences, time pressure |
| Tempo decrease | Calm, contemplation, dread | Safe zones, ominous slow build |
| Dissonance | Wrongness, horror, instability | Corruption, madness, glitch |
| Drone/pad sustain | Sustained tension, unease | Ambient threat, anticipation |
These are simplified associations — context and execution matter more than formula. A major-key piece at slow tempo can be deeply melancholic. Use these as starting points, not rules.
Short musical hits that mark significant moments:
Stingers should be brief (0.5–2 seconds) and distinctive enough to be recognizable after a few exposures.
Recurring musical themes create emotional associations over time:
After sustained intensity (30+ seconds of high-energy music and dense SFX):
The silence amplifies whatever follows it. This technique is most powerful when used sparingly — if every combat ends with silence, it loses impact.
Audio design must account for players with varying hearing ability. Reference accessibility-design for the complete accessibility framework — this section covers audio-specific measures.
Use this checklist when evaluating a game's audio design:
Both mute-test items should pass. The first ensures accessibility; the second confirms the audio design is contributing real value.
| Anti-Pattern | Problem | Fix |
|---|---|---|
| Uniform volume | Everything equally loud — no hierarchy, just noise | Implement priority and ducking system |
| Missing action feedback | Player acts but hears nothing — feels unresponsive | Audit all player actions for audio coverage |
| Static music | Music ignores gameplay state — emotionally disconnected | Implement adaptive music with state transitions |
| Repetition fatigue | Same sound plays unchanged hundreds of times | Add 3+ variants, pitch randomization |
| Audio-only critical info | Essential information has no visual backup | Add visual redundancy for all critical audio |
| Fighting the mix | Too many sounds at full volume simultaneously | Enforce polyphony budget and priority ducking |
| Abrupt music cuts | Music stops mid-phrase on state change — sounds broken | Transition at phrase boundaries, use crossfades |
| No silence | Constant wall of sound with no breathing room | Design intentional quiet moments between intensity |
| One volume slider | Only master volume, no per-layer control | Provide per-layer sliders (music, SFX, voice, ambience) |