From rtl-agent-team
Builds C reference model decoder, runs parallel decoding of JVET/JCTVC/3rd-party conformance bitstreams, and verifies bitexact output against golden references using MD5/PSNR/SSIM/VMAF. AWS Batch optional.
npx claudepluginhub babyworm/rtl-agent-team --plugin rtl-agent-teamThis skill uses the workspace's default tool permissions.
<Purpose>
Generates design tokens/docs from CSS/Tailwind/styled-components codebases, audits visual consistency across 10 dimensions, detects AI slop in UI.
Records polished WebM UI demo videos of web apps using Playwright with cursor overlay, natural pacing, and three-phase scripting. Activates for demo, walkthrough, screen recording, or tutorial requests.
Delivers idiomatic Kotlin patterns for null safety, immutability, sealed classes, coroutines, Flows, extensions, DSL builders, and Gradle DSL. Use when writing, reviewing, refactoring, or designing Kotlin code.
This skill automates the full decoder conformance evaluation pipeline:
Scope: Decoder conformance evaluation at the algorithm/C model level.
/rtl-agent-team:rtl-conformance-test (Phase 5)/rtl-agent-team:codec-rd-evalPhase-agnostic: Commonly used during Phase 1-2 (Research/Architecture) to validate decoder algorithm correctness before committing to hardware implementation. Can also be used at any Phase where decoder conformance verification is needed.
| rtl-conformance-test | codec-conformance-eval | |
|---|---|---|
| Target | RTL implementation (.sv) | C ref model decoder |
| Phase | Phase 5 (Verify) | Phase 1-2 (Research/Architecture) |
| Comparison | JM/HM reference output | Official conformance golden output |
| Purpose | RTL-level bitexact verification | Algorithm-level conformance verification |
Execution modes:
Key features:
<Use_When>
<Do_Not_Use_When>
/rtl-agent-team:ref-model)/rtl-agent-team:rtl-conformance-test, Phase 5)/rtl-agent-team:codec-rd-eval)/rtl-agent-team:rtl-model-consistency)
</Do_Not_Use_When><Why_This_Exists> Decoder conformance is a hard requirement for any video codec implementation. A decoder that fails official conformance streams has algorithmic bugs that will propagate to hardware.
Catching conformance failures at the C model level (Phase 1-2) is orders of magnitude cheaper than discovering them after RTL implementation (Phase 5). Official conformance streams from JVET and JCTVC exercise specific codec features and corner cases that typical test sequences may not cover.
This skill automates the tedious process of running hundreds of conformance streams, comparing outputs, and tracking which profile features are covered. </Why_This_Exists>
<Execution_Policy>
python3 skills/codec-conformance-eval/scripts/compare_output.py --test runs built-in unit tests
</Execution_Policy>Decoder build (build_decoder.sh)
bash skills/codec-conformance-eval/scripts/build_decoder.sh <decoder_src> <decoder_binary> [extra_cflags...]gcc -std=c11 -O2 -Wall -Wextra -lm (C11 standard per CLAUDE.md)-DDECODER_ONLY, -DMAX_DPB_SIZE=16Conformance test execution (run_conformance.py)
python3 skills/codec-conformance-eval/scripts/run_conformance.py <config.hjson> --mode localpython3 skills/codec-conformance-eval/scripts/run_conformance.py <config.hjson> --mode aws-batch.rat/scratch/conformance-eval/results.jsonOutput comparison (compare_output.py)
python3 skills/codec-conformance-eval/scripts/compare_output.py <results.json> <config.hjson>.rat/scratch/conformance-eval/conformance-metrics.jsonReport generation
<Tool_Usage>
# ============================================================
# Step 1: Prerequisite validation
# ============================================================
Glob("refc/*.c") # Verify decoder source exists
Read("<conformance-config.hjson>") # Read conformance configuration
Bash("python3 -c 'import hjson; print(\"OK\")'") # Check dependencies
Bash("ls conformance/") # Verify conformance bitstream dirs exist
Bash("ls conformance/golden-outputs/") # Verify golden output references exist
# ============================================================
# Step 2: Decoder build
# ============================================================
Bash("bash skills/codec-conformance-eval/scripts/build_decoder.sh refc .rat/scratch/conformance-eval/decoder")
# ============================================================
# Step 3: Conformance test execution (parallel)
# ============================================================
Bash("python3 skills/codec-conformance-eval/scripts/run_conformance.py <config.hjson> --mode local",
timeout=600000)
# ============================================================
# Step 4: Output comparison
# ============================================================
Bash("python3 skills/codec-conformance-eval/scripts/compare_output.py .rat/scratch/conformance-eval/results.json <config.hjson> --output .rat/scratch/conformance-eval/conformance-metrics.json")
# ============================================================
# Step 5: Report generation
# ============================================================
Read(".rat/scratch/conformance-eval/conformance-metrics.json")
# Write report to configured output path
</Tool_Usage>
**Example 1: H.264 Baseline decoder conformance** ``` User: "Run H.264 Baseline profile decoder conformance test" → Invoke /rtl-agent-team:codec-conformance-eval → Step 1: Verify refc/ exists, generate conformance-config.hjson (target: h264/Baseline) → Step 2: Build decoder → Step 3: Filter Baseline profile from JVET + JCTVC conformance streams, parallel decoding → Step 4: Per-stream MD5 comparison → 42/45 PASS, 3 FAIL → Step 5: Generate docs/phase-1-research/conformance-eval-report.md → "42 of 45 conformance streams PASS. 3 failures: deblocking filter edge cases. Debugging required." ```Example 2: Full HEVC conformance with SSIM
User: "Run H.265 Main profile conformance with SSIM"
→ quality_metrics: ["psnr", "ssim"] configured
→ Execute JVET + JCTVC + Allegro + Elecard conformance streams
→ Mandatory (JVET+JCTVC): 128/128 PASS
→ Optional (3rd party): 45/48 PASS
→ "All mandatory streams PASS. 3 third-party failures (optional). Overall verdict: PASS."
Example 3: No decoder source
User: "Run decoder conformance test"
→ Step 1: No decoder code found in refc/
→ "No ref C model decoder source found. Run /rtl-agent-team:ref-model first to generate the reference model."
<Escalation_And_Stop_Conditions>
<Final_Checklist> Before reporting completion, verify ALL of the following:
domain-packages/video-codec/knowledge/block-level-conformance.md)If ANY item is unchecked → DO NOT report completion. Fix the issue first. </Final_Checklist>