Evaluate a model session and extract lessons (Phase 1 only)
Analyzes model sessions to extract lessons and calculate performance metrics.
/plugin marketplace add anouar1991/rules-learning-pipeline/plugin install anouar1991-rules-learning-pipeline@anouar1991/rules-learning-pipelineRun Phase 1 of the learning pipeline - evaluate session and extract lessons.
/rules-learning-pipeline:evaluate-model <session-file> [--verbose]
Task(
subagent_type: "model-evaluator",
prompt: "Evaluate session at {input_file}:
1. PARSE action sequences
2. CALCULATE metrics (F1, Precision, Recall)
3. IDENTIFY error types including:
- Wasteful Verification (broad → dismiss → narrow)
- Dismissive Reasoning (pre-existing, not related)
- Redundant Tool Calls
- Scope Mismatch
4. EXTRACT lessons with specificity validation
5. CLASSIFY scope for each lesson
Output to:
- docs/evaluations/{session}-evaluation.md
- docs/evaluations/{session}-lessons-raw.md"
)
docs/evaluations/{session}-evaluation.md - Full metrics and error analysisdocs/evaluations/{session}-lessons-raw.md - Extracted lessons with scopes