Skill

level-up-validation

Validates D&D 5e level-up flows: availability entry, complete recommended packages with auto-selection, and real LevelUpAgent evidence. Use during testing to confirm correctness for any class, subclass, multiclass, or custom class.

testing

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-commands:level-up-validation

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Use this skill when judging whether a level-up flow works correctly for any

SKILL.md

172 lines · ~1.5k tokens

Stats

LanguagePython

Stars27

Forks4

MaintenanceExcellent

Last CommitJun 2, 2026

Actions

View Source View Plugin View on GitHub View README

Level-Up Validation

Use this skill when judging whether a level-up flow works correctly for any D&D 5e class, subclass, multiclass, or campaign-defined custom class.

This is an evaluation protocol. Do not repair code from this skill unless the user explicitly asks for implementation.

Required Inputs

Campaign URL or campaign id.
Current PR head SHA.
Raw model request/response or testing_mcp/core/test_level_up_organic.py evidence bundle.
Exported game_state.json, story transcript, server logs, and checksums.
The active LevelUpAgent prompt stack, including $PROJECT_ROOT/prompts/level_up_instruction.md and the LevelUpAgent final modal override in $PROJECT_ROOT/agents.py.

Pass/Fail Standard

Fail closed. A level-up flow passes only when all of these are proven by real artifacts, not agent claims.

1. Availability Entry

Normal story output shows level-up availability when the model-owned signal says target_level > current_level.
Before the modal starts, the player can choose level_up_now.
level_up_now opens or continues the modal only. It does not commit the new level, advance time, roll HP, award rewards, or resolve story actions.

2. Complete Recommended Package

The first modal response after level_up_now must visibly include Recommended package: and must be finalizable immediately.

The package must audit the character's class/subclass/custom class, current level, target level, campaign mechanics, and prior character sheet. It must account for every automatic gain and every player-selectable decision, including when applicable:

HP method and HP gain.
Class, subclass, multiclass, or custom-class features.
Resource pools, uses, recharge cadence, action economy, and derived stats.
Proficiencies, skills, expertise, languages, tools, and saves.
ASI, feat, fighting style, maneuvers, invocations, metamagic, infusions, wild shape, rage improvements, companions, summons, or other class options.
Spell slots, spells known, spells prepared, cantrips, rituals, spellcasting ability, save DC, and spell attack bonus.
Subclass, lineage, domain, oath, patron, bloodline, circle, custom origin, or feature-granted spells.

For prepared casters, always-prepared or feature-granted spells are additive. They do not satisfy the ordinary prepared-spell decision unless the class rules explicitly say so.

For custom classes, the model must use the custom class description and campaign rules. If a required custom-class rule is genuinely absent, the modal must stay active and ask for that missing mechanical input instead of silently skipping it.

3. Auto-Selection

Every required decision category must have a pre-selected recommendation before the player finishes. The visible text must distinguish:

Automatic gains.
Recommended defaults the player can accept.
Player-selectable categories the player can change.

finish_level_up_return_to_game must be present as the final accept option once the recommended package is complete.

4. Click Editing

Every player-selectable recommendation must be editable through planning-block choices.

Acceptable patterns:

The first modal response exposes a concrete level_up_* choice for the category.
A category-level level_up_* change choice opens a follow-up modal with concrete legal options for that category.

Unacceptable patterns:

Only finish_level_up_return_to_game appears while unresolved decisions remain.
A selectable category is described in prose but has no planning-block edit path.
The modal closes or resumes story before the changed selection is confirmed.
Choices are ordinary story/combat/dialog choices instead of level-up choices before finish.

5. Free-Form Editing

Free-form text must work for every selectable category while the level-up modal is active.

Validate at least one free-form edit in evidence, such as:

"Change my prepared spells to ..."
"Use fixed HP instead."
"Pick a different subclass option."
"Change the skill/proficiency choice."

The response must remain in LevelUpAgent, preserve modal state, reflect the updated pending selection, and keep finish_level_up_return_to_game available when all required decisions are complete.

6. Finish Commit

On finish_level_up_return_to_game:

Commit the target level exactly once.
Clear modal scratch state such as pending selections.
Preserve selected/recommended mechanics in player_character_data.
Resume the interrupted story with real gameplay choices.
Do not return only generic placeholders such as "Continue Story" or "Custom Action" when concrete scene choices are available.

Evidence Procedure

Run the canonical real test when real LLM proof is requested:

cd testing_mcp && ../vpython core/test_level_up_organic.py --server http://127.0.0.1:8001

Do not use pytest collection for testing_mcp/ script suites.
Do not set mock-mode toggles for evidence-bearing testing_mcp/ runs.
Record the evidence bundle path, PR head SHA, server URL, model/provider, and pass/fail result.
Inspect raw artifacts, not just summary prose:
- metadata.json
- run.json
- story transcript
- game_state.json
- server logs
- screenshots or video if UI/browser behavior is claimed
Compare the evidence bundle SHA to the live PR head. If stale, declare exactly which later diffs are behavior-neutral before accepting it.

Verdict Template

Use this structure:

Verdict: PASS | FAIL | PARTIAL
Evidence bundle:
PR head:
Model/provider:

Availability:
Recommended package completeness:
Auto-selection:
Click editing:
Free-form editing:
Finish commit:

Blocking gaps:

level-up-validation

Popularity

Invocation

Context Preview

SKILL.md

level-up-validation

Popularity

Invocation

Context Preview

SKILL.md

Level-Up Validation

Required Inputs

Pass/Fail Standard

1. Availability Entry

2. Complete Recommended Package

3. Auto-Selection

4. Click Editing

5. Free-Form Editing

6. Finish Commit

Evidence Procedure

Verdict Template

Similar Skills

Level-Up Validation

Required Inputs

Pass/Fail Standard

1. Availability Entry

2. Complete Recommended Package

3. Auto-Selection

4. Click Editing

5. Free-Form Editing

6. Finish Commit

Evidence Procedure

Verdict Template

Similar Skills