Skill

build-behavior-model

Guides the construction of a functional behavior model that decouples timing but preserves architectural state for RTL difftest — what to keep, what to drop, where to embed probes, how to package the cross-language interface, and how to coordinate memory with the DUT. Activate when the user explicitly invokes /chipdev-method:build-behavior-model, or asks "什么是行为模型", "behavior model 边界", "difftest 用什么 ref 模型", "怎么做 functional simulator", or designs the reference half of a difftest setup.

npx claudepluginhub curryfromuestc/dev-guide --plugin chipdev-method

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Use this skill when the user is building the **functional reference model**

Supporting Assets

references/case-imperas-rvvi.mdreferences/case-nemu.mdreferences/case-spike.md

SKILL.md

Similar Skills

canary-watch

179.4k

Monitors deployed URLs for regressions after deploys, merges, or upgrades by checking HTTP status, console errors, network failures, performance (LCP/CLS/INP), content, and API health.

ecc

Stats

Parent Repo Stars0

Parent Repo Forks0

Last CommitMay 5, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Build Behavior Model

Use this skill when the user is building the functional reference model that a difftest setup will compare against the RTL DUT. Do not autoload.

A behavior model is not a slow performance model with cycles dropped. It is a different artifact with a different contract: it captures architectural state evolution, drops everything timing-coupled, and exposes a curated set of probes for difftest.

If the user wants a cycle-accurate model for microarchitecture exploration, redirect to build-perf-model.

How to use this skill in a response

When triggered:

Confirm the behavior model's purpose — difftest reference, software bring-up, or both.
Pin down the state contract (what must be reproduced; what must not).
Recommend a drive direction — almost always DUT-pushes-ref.
Surface the probe pre-embedding plan; without it, difftest is bolt-on and brittle.
Route to align-and-difftest once the behavior model is implementable.

If the user has not yet defined the alignment contract in define-contracts, push them back. Behavior model implementation without agreed sampling points means the model will be unusable for difftest.

Behavior vs performance model `[abstract]`

Dimension	Behavior model	Performance model
Captures	Architectural state evolution	Cycle-by-cycle microarchitecture
Time model	Event-ordered or sequential, no clock	Cycle-accurate clock
Speed target	10×–100× hardware	0.001× hardware
Used by	Software, driver, difftest reference	Microarch exploration, perf reports
Memory model	Often shares memory with DUT	Owns its own memory
Async events	RTL forwards them in	Drives them on its own

Do not blur these roles. A "fast" performance model and a "slow" behavior model are not on the same axis — they have different state shapes.

What to keep, what to drop `[abstract]`

Keep

Architectural state: register file, PC, CSR, memory contents, TLB entries, pending interrupt mask, debug-mode flag.
ISA semantics: instruction effects, exception conditions, memory ordering required by the architecture.
Trap/interrupt routing: which exception type goes to which handler.
Memory model semantics: ordering rules, atomic semantics.

Drop

Pipeline stages, register-renaming tables, scoreboard state.
Cache state and replacement policy (model the cache interface, not the cache internals — the DUT will own those).
Branch predictor state.
Per-cycle counters (mcycle, perf counters that depend on micro-arch).
Clock-domain timing relationships.

Defer

Floating-point exact ULP behavior — this is real architectural state but extremely expensive to model. Plan a precision tier (start at IEEE-754 round-to-nearest, refine with hardware-specific subnormal handling later).
Atomic instruction semantics in multi-core — start with sequentially consistent, add weak-ordering later if the architecture requires it.

Architectural state checklist `[abstract]`

For an ISA-style design, the typical list is:

Element	Required?	Notes
General-purpose registers	Yes	Width, count from spec.
Floating-point / vector registers	If applicable	ULP precision is a separate tier.
Program counter	Yes	One per hart / lane.
Control / status registers	Yes	Enumerate the architectural ones; skip implementation-defined `mcycle`-class.
Memory	Yes	Share or duplicate — see "Memory coordination".
TLB / page tables	If applicable	Model the architectural translation, not the implementation.
Interrupt pending state	Yes	One bit per architectural interrupt source.
Debug-mode state	If applicable	dpc, dcsr, etc.
Privilege level	If applicable	Current and stack.

For an accelerator-style design (GPU / NPU / DSA), the list shifts:

Element	Notes
Command queue state	Head, tail, sequence numbers.
Per-task / per-kernel state	Whatever the spec defines as architecturally observable.
Memory views	Address ranges visible to architectural code.
Output buffers	The "answer" the DUT must produce.

For accelerators, the alignment contract is usually transaction-level (see align-and-difftest), not retire-level.

Probe pre-embedding `[abstract]`

The behavior model exists to be observed by difftest. Probes are not optional. They must be:

Defined at the boundary — input transactions, output transactions, and (for ISA designs) instruction commit points.
Generated from the same DSL that drives interface and hierarchy generation (see define-contracts). The probe header is a fourth emission target alongside C++ struct, SV interface, and difftest bridge.
Bidirectional: the DUT can call probe functions to push state into the ref, and the ref can call probe functions to pull state from the DUT (mostly for memory feedback — see below).

Concretely, every architectural state element should have:

A probe_<element>(value) function that the DUT calls (via DPI-C or shared memory) at the architectural commit moment.
A query_<element>() function the ref can call when needed.

Pre-embed probes in the behavior model from day one. Bolting them on later forces re-architecting.

Cross-language interface `[industry-pattern]`

Mechanism	When to use	Notes
DPI-C with packed structs	Default. Both sides in same process.	Zero-copy across SystemVerilog ↔ C; no IPC; debuggable.
VPI / PLI	DPI-C unavailable, legacy tooling only.	Slower; harder to tune.
Shared memory ring buffer	Ref must run as a separate process (e.g., the ref is QEMU or a third-party binary).	Higher overhead but flexible.
TCP/Unix socket	Distributed setup, mostly debugging.	Slowest, easiest to instrument.

Default recommendation: DPI-C + packed struct. Package the behavior model as a shared library (.so) the testbench links to. This matches what the major open-source RISC-V difftest setups do [case: OpenXiangShan/difftest, lowRISC Ibex cosim].

Memory coordination — let the DUT feed the ref `[industry-pattern]`

A common false-positive source: ref and DUT each maintain their own memory copy, drift, then disagree on a load value that came from an implementation-defined region (UART, MMIO, weakly-ordered shared region).

The fix: DUT loads, then forwards the loaded value to the ref. The ref does not independently model that load. Two operational forms:

DiffMem-style (push every load result): on every load that retires, the DUT calls ref_set_load_result(addr, value) before instructing the ref to commit the load instruction. Used by NEMU + OpenXiangShan/difftest.
Memory-shadow (pull on demand): the ref consults a mem_query(addr) function backed by the DUT's view of memory. Used when the ref runs ahead of the DUT.

Either way, the principle is: the ref should not have an independent memory model that competes with the DUT's. [case: NEMU DiffMem, ImperasDV with RVVI mem-feed]

Async event handling — RTL detects, ref accepts `[industry-pattern]`

Interrupts, debug breakpoints, and external events arrive asynchronously to the DUT. If the ref also generates them on its own internal clock, alignment is impossible.

Pattern: RTL detects the event, then forwards a sync signal to the ref. The ref injects the event at the next architectural commit boundary, so both sides see the same architectural-time injection point.

                  external IRQ
                       ↓
  RTL DUT  ──────[detect, latch at commit]──────→ ref:  commit_with_irq()

[case: lowRISC Ibex cosim, ImperasDV / RVVI]

Performance strategies for behavior models `[abstract]`

A behavior model that is too slow becomes the bottleneck of difftest. Targets are typically 100×–1000× faster than the cycle-accurate model.

Pack the ref as a .so, not a separate process. Avoid IPC.
Inline the architectural state: a flat struct with PC, registers, CSR — no virtual functions on the hot path.
Skip what doesn't matter: micro-arch state, performance counters (those that aren't architecturally visible).
Batch commits when the DUT can retire multiple instructions per cycle: pass an array of retire records, not one call per record.
Snapshot regularly for replay-on-mismatch (see "Snapshot debugging" in align-and-difftest).

Common failure modes `[abstract]`

Behavior model with cycle counters. A mcycle-style counter that reflects "ticks of the simulator" leaks into difftest as architectural state and causes mismatches that aren't real. Either omit, or stub to match the DUT's reported value.
Probes added after implementation is done. They end up timing- coupled and brittle. Pre-embed at module write time.
Independent memory model in the ref. Implementation-defined regions flag false positives. Let the DUT feed the ref's memory loads.
Ref runs ahead of DUT for asynchronous events. Interrupts injected on the ref's own schedule never agree with the DUT's. Always have the RTL detect and forward.
Floating-point divergence ignored until Phase 4. Subnormal handling and rounding-mode bugs surface late and refuse to localize. Align precision tier with the RTL team in Phase 0.
Behavior model implements branch prediction. It shouldn't; that's micro-arch. The ref always assumes branches resolve at commit.

build-behavior-model

Tool Access

Preview

Supporting Assets

SKILL.md

Similar Skills

Help us improve

Help us improve

build-behavior-model

Tool Access

Preview

Supporting Assets

SKILL.md

Build Behavior Model

How to use this skill in a response

Behavior vs performance model [abstract]

What to keep, what to drop [abstract]

Keep

Drop

Defer

Architectural state checklist [abstract]

Probe pre-embedding [abstract]

Cross-language interface [industry-pattern]

Memory coordination — let the DUT feed the ref [industry-pattern]

Async event handling — RTL detects, ref accepts [industry-pattern]

Performance strategies for behavior models [abstract]

Common failure modes [abstract]

See also

Similar Skills

Help us improve

Build Behavior Model

How to use this skill in a response

Behavior vs performance model [abstract]

What to keep, what to drop [abstract]

Keep

Drop

Defer

Architectural state checklist [abstract]

Probe pre-embedding [abstract]

Cross-language interface [industry-pattern]

Memory coordination — let the DUT feed the ref [industry-pattern]

Async event handling — RTL detects, ref accepts [industry-pattern]

Performance strategies for behavior models [abstract]

Common failure modes [abstract]

See also

Behavior vs performance model `[abstract]`

What to keep, what to drop `[abstract]`

Architectural state checklist `[abstract]`

Probe pre-embedding `[abstract]`

Cross-language interface `[industry-pattern]`

Memory coordination — let the DUT feed the ref `[industry-pattern]`

Async event handling — RTL detects, ref accepts `[industry-pattern]`

Performance strategies for behavior models `[abstract]`

Common failure modes `[abstract]`

Behavior vs performance model `[abstract]`

What to keep, what to drop `[abstract]`

Architectural state checklist `[abstract]`

Probe pre-embedding `[abstract]`

Cross-language interface `[industry-pattern]`

Memory coordination — let the DUT feed the ref `[industry-pattern]`

Async event handling — RTL detects, ref accepts `[industry-pattern]`

Performance strategies for behavior models `[abstract]`

Common failure modes `[abstract]`