From xiaohongshu-complete-skills
Guides audio processing workflows for Xiaohongshu content: recording setup, voiceover editing, noise reduction, volume leveling, mixing, and platform optimization.
npx claudepluginhub vivy-yi/xiaohongshu-skills --plugin xiaohongshu-complete-skillsThis skill uses the workspace's default tool permissions.
Audio processing encompasses recording, editing, enhancing, and optimizing audio content for Xiaohongshu posts, ensuring professional sound quality that significantly enhances content professionalism, viewer retention, and overall production value. Poor audio is the #1 reason viewers abandon content within seconds - even with stunning visuals, bad audio makes content unwatchable. This skill cov...
Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.
Guides building MCP servers enabling LLMs to interact with external services via tools. Covers best practices, TypeScript/Node (MCP SDK), Python (FastMCP).
Generates original PNG/PDF visual art via design philosophy manifestos for posters, graphics, and static designs on user request.
Audio processing encompasses recording, editing, enhancing, and optimizing audio content for Xiaohongshu posts, ensuring professional sound quality that significantly enhances content professionalism, viewer retention, and overall production value. Poor audio is the #1 reason viewers abandon content within seconds - even with stunning visuals, bad audio makes content unwatchable. This skill covers the complete audio production workflow: from recording setup through editing, noise reduction, mixing, and final optimization for Xiaohongshu's platform specifications.
Key insight: Viewers will forgive mediocre video quality, but they will not tolerate poor audio. Investing in audio processing yields 50%+ improvements in viewer retention and 3-5x increases in engagement rates. Professional audio transforms amateur content into credible, trustworthy content.
Use when:
Do NOT use when:
Before (poor audio quality): ❌ "Background noise, room echo, distractions" ❌ "Inconsistent volume, too quiet then too loud" ❌ "Viewer adjusts volume constantly, gives up" ❌ "Content seems amateur, low credibility" ❌ "Viewers scroll away within 3 seconds"
After (professional audio): ✅ "Clean, clear voice recording" ✅ "Consistent volume levels throughout" ✅ "Pleasant listening experience, no adjustments needed" ✅ "Content feels professional, trustworthy" ✅ "Viewers watch complete content, high engagement"
6 Essential Audio Processing Elements:
| Element | Purpose | Quality Impact | Priority |
|---|---|---|---|
| Clean Recording | Prevent issues at source | Critical | #1 - cannot fix in post |
| Noise Reduction | Remove background distractions | High | #2 - most common issue |
| Volume Normalization | Consistent listening levels | High | #3 - prevents frustration |
| EQ & Clarity | Enhance voice intelligibility | Medium-High | #4 - professional polish |
| Music & Effects | Add emotional depth | Medium | #5 - enhance, don't distract |
| Platform Optimization | Meet technical specs | Medium | #6 - avoid compression artifacts |
Audio Processing Software Comparison:
| Tool | Best For | Skill Level | Cost | Platform | Key Features |
|---|---|---|---|---|---|
| Audacity | Basic editing, noise reduction | Beginner | Free | Win/Mac/Linux | Noise gate, normalize, EQ |
| Adobe Audition | Professional production | Intermediate-Advanced | Paid (Subscription) | Win/Mac | Multitrack, advanced repair, batch processing |
| GarageBand | Mac users, simple editing | Beginner | Free (Mac) | macOS | Built-in effects, music loops, easy interface |
| Descript | Text-based editing, podcasts | Beginner-Intermediate | Paid (Freemium) | Web/Mac/Win | Edit audio like text, overdub, filler removal |
| Logic Pro | Music production, advanced editing | Advanced | Paid (One-time) | macOS | Professional DAW, massive library |
| Reaper | Power users, customization | Advanced | Paid (Free trial) | Win/Mac/Linux | Lightweight, extensible, affordable |
Xiaohongshu Audio Specifications:
Quick Audio Fixes (by symptom):
| Symptom | Likely Cause | Quick Fix |
|---|---|---|
| Background hiss/hum | Room noise, equipment hiss | Noise reduction filter |
| Room echo/reverb | Recording in untreated room | Move closer to mic, use de-reverb plugin |
| Volume too low | Recording level too low | Gain/normalize to -3dB peak |
| Distorted/clipping | Recording level too high | Reduce gain, use clip restoration |
| Muffled sound | Poor mic quality or wrong EQ | High-pass filter + EQ boost |
| Inconsistent levels | Multiple clips or variable distance | Compression + normalization |
Prevention is better than correction - capturing clean audio at source saves hours of editing and yields better results than any post-processing.
Microphone Selection:
| Mic Type | Best For | Pros | Cons | Price Range |
|---|---|---|---|---|
| USB Mic | Beginners, simplicity | Plug-and-play, easy | Limited quality, no upgrades | ¥200-800 |
| Dynamic XLR | Voice recording, noisy rooms | Rejects room noise, durable | Quiet, need preamp | ¥500-2000 |
| Condenser XLR | Studio recording, vocals | Detailed, professional | Sensitive to room noise | ¥800-5000 |
| Lavalier (Lapel) | Video, talking head | Hands-free, close to mouth | Visible in shot, can rub on clothes | ¥100-500 |
| Shotgun | Interviews, outdoor | Directional, outdoor use | Expensive, need operator | ¥1000-8000 |
Environment Setup:
Recording Levels:
Recording Checklist:
Importing and Organizing:
Trimming and Arranging:
Basic Editing Techniques:
| Technique | How | Why |
|---|---|---|
| Cut/Copy/Paste | Select region, edit menu | Remove mistakes, reorder content |
| Split | Cut at cursor point | Separate sections for independent editing |
| Trim | Remove selected region | Quickly cut ends or mistakes |
| Fade In/Out | Apply fade to clip start/end | Smooth transitions, avoid abrupt starts/ends |
| Crossfade | Overlap clips with transition | Seamless joins between audio segments |
Edit Best Practices:
Identify Noise Types:
| Noise | Character | Removal Method | Difficulty |
|---|---|---|---|
| Hiss | Steady high-frequency noise | Noise reduction plugin | Easy |
| Hum | Low-frequency electrical buzz (50/60Hz) | High-pass filter or notch filter | Easy |
| Room reverb | Echoey, cavernous sound | De-reverb plugin or reduce room noise | Medium |
| Clicks/pops | Sharp sudden sounds | Click removal plugin | Medium |
| Wind noise | Low-frequency rumble | High-pass filter + wind reduction | Medium-Hard |
| Static/crackle | Continuous crackling | Noise reduction + de-crackle | Hard |
Noise Reduction Workflow (using Audacity as example):
Step 1: Capture Noise Profile
Step 2: Apply Noise Reduction
Step 3: Fine-Tune
Alternative Noise Reduction Methods:
Consistent volume is critical - viewers should never have to adjust their volume.
Leveling Techniques (in order of application):
1. Normalization (simple, fixes overall level):
2. Compression (evens out dynamics):
3. Limiting (prevents clipping):
Compression Quick Settings by Use Case:
| Use Case | Ratio | Threshold | Attack | Release |
|---|---|---|---|---|
| Spoken word (tutorial) | 2:1 | -18dB | 10ms | 200ms |
| Narration (documentary) | 3:1 | -15dB | 5ms | 150ms |
| Podcast (conversation) | 2.5:1 | -16dB | 8ms | 250ms |
| Emotional/intimate | 1.5:1 | -20dB | 15ms | 300ms |
| Energetic/promo | 4:1 | -12dB | 3ms | 100ms |
Equalization (EQ) shapes tone - making voice sound clear, professional, and pleasant.
Voice EQ Basics:
| Frequency | Effect on Voice | When to Adjust |
|---|---|---|
| Below 80Hz | Low rumble, room noise | Cut completely for voice (high-pass filter) |
| 80-200Hz | Warmth, body | Boost slightly for thin voices, cut for muddy |
| 200-500Hz | Fullness, presence | Leave mostly flat |
| 500Hz-2kHz | Intelligibility, clarity | Boost slightly (+1-3dB) if voice is dull |
| 2kHz-6kHz | Definition, clarity | Boost (+2-4dB) to make voice "pop" |
| 6kHz-12kHz | Air, brilliance, sibilance | Cut S-heavy voices at 7kHz, boost for "air" |
| Above 12kHz | Ultra-highs, hiss | Cut if hissy, leave if clear |
Simple Voice EQ Recipe (works for 80% of recordings):
De-Essing (taming harsh S and T sounds):
Music enhances emotion but should never compete with voice.
Music Selection Principles:
Leveling Voice vs. Music:
| Content Type | Voice Level | Music Level | Ratio |
|---|---|---|---|
| Tutorial/education | -6dB to -3dB | -20dB to -18dB | 12-15dB difference |
| Narration/story | -6dB to -3dB | -16dB to -14dB | 10-12dB difference |
| Emotional/intimate | -8dB to -6dB | -22dB to -20dB | 14-16dB difference |
| High-energy promo | -3dB to 0dB | -12dB to -10dB | 10-12dB difference |
Music Mixing Workflow:
Sound Effects (SFX):
Export Settings for Xiaohongshu:
| Setting | Recommended | Why |
|---|---|---|
| Format | AAC (.m4a) or MP3 | Best compression quality |
| Sample Rate | 44.1kHz or 48kHz | Match source rate |
| Bitrate | 192 kbps (stereo) or 128 kbps (mono) | Balance quality and file size |
| Channels | Stereo or Mono | Mono fine for voice-only |
| Loudness | -16 LUFS | Streaming platform standard |
Export Quality Comparison:
| Bitrate | File Size (1 min) | Quality | Use Case |
|---|---|---|---|
| 128 kbps | ~1 MB | Good | Voice-only,节省流量 |
| 192 kbps | ~1.5 MB | Very Good | Recommended for most content |
| 256 kbps | ~2 MB | Excellent | Music-heavy or audiophile content |
| 320 kbps | ~2.5 MB | Best | Overkill for social media |
Final Checklist Before Export:
Quality Control Testing:
| Mistake | Why It's Wrong | Fix |
|---|---|---|
| Recording in noisy room | Noise reduction can't fix everything, artifacts result | Record in quietest space, treat room with blankets |
| Mic too far from mouth | Room echo increases, voice-to-noise ratio decreases | Move 6-12 inches from mic, use pop filter |
| Recording level too low | Boosting in post amplifies noise floor | Aim for -12dB to -6dB average |
| Recording level too hot | Distortion/clipping is permanent and unfixable | Leave headroom, peak around -6dB |
| Over-applying noise reduction | Audio sounds robotic, underwater artifacts | Use light passes (6-12dB), not heavy (20dB+) |
| No compression on voice | Inconsistent volume, whisper-quiet then too-loud | Apply 2:1 to 4:1 compression |
| Music too loud | Distracts from voice, makes content hard to follow | Duck music 12-15dB below voice |
| Too much high-frequency EQ | Harsh, ear-fatiguing, sibilance amplified | Cut 7kHz region, boost 3kHz instead |
| Exporting at wrong bitrate | Either poor quality (too low) or huge files (too high) | Use 192 kbps for optimal balance |
| Never testing on phone | Sounds different on viewers' most common device | Always final QC on mobile device |
Case Study 1: Tutorial Creator's Retention Transformation
Creator: Xiaohongshu tech tutorial creator Problem: 40% viewer drop-off within 30 seconds, despite valuable content Issue: Poor audio quality - room echo, inconsistent volume, background noise Solution Implemented:
Results (60 days):
Case Study 2: Podcaster's Audio Upgrade
Creator: Storytelling podcast on Xiaohongshu Problem: Listeners complained about "can't hear in car," "too quiet then too loud" Solution:
Results:
Case Study 3: Brand's Audio Consistency
Brand: Beauty brand with multiple content creators Problem: Inconsistent audio quality across 20+ creators, damaged brand credibility Solution:
Results (3 months):
REQUIRED:
RECOMMENDED:
NEXT STEPS:
Professional audio is not about expensive gear - it's about clean recording and thoughtful processing. A ¥300 microphone with good technique beats a ¥5000 mic used poorly. Your viewers will forgive imperfect visuals, but they will abandon content with painful audio. Invest in audio processing first, visuals second.