Plugin

pm-thought-partner

Name: pm-thought-partner
Author: breethomas

Build custom LLM-as-judge evaluators for failure modes like tone or faithfulness, evaluate RAG pipelines using Recall@k and faithfulness metrics, generate synthetic test data via dimension-based tuples for LLM pipelines, and analyze traces to judge passes, categorize failures, and prioritize fixes in AI features.

Python

ai-ml

testing

npx claudepluginhub breethomas/bette-think

Component Overview

Skills

Component Details

Skills (4)

build-judge

/build-judge

Build an LLM-as-Judge evaluator for one specific failure mode. Binary pass/fail only. Use when a failure mode requires interpretation (tone, faithfulness, relevance, completeness) and cannot be checked with code. Do NOT use when the failure can be checked with regex, schema validation, or execution tests. Do NOT use before completing error analysis (/upgrade-evals).

eval-rag

/eval-rag

Evaluate RAG pipeline retrieval and generation quality separately. Measure Recall@k, Precision@k, MRR, NDCG@k for retrieval. Assess faithfulness and relevance for generation. Use when the AI feature uses retrieval (search, knowledge base, document QA). Do NOT use for non-RAG AI features.

generate-test-data

/generate-test-data

Create diverse synthetic test inputs using dimension-based tuple generation. Use when bootstrapping an eval dataset, when real user data is sparse, or when stress-testing specific failure hypotheses. Do NOT use when you already have 100+ representative real traces (use stratified sampling instead).

upgrade-evals

/upgrade-evals

Systematic error analysis on real AI traces. Read traces, judge pass/fail, let failure categories emerge from data, compute failure rates, decide what to fix. Use when you have 50+ test cases or are seeing production failures. Do NOT use when you have fewer than 20 test cases (use /start-evals first).

README

bette-think

"Attempt the impossible in order to improve your work." — Bette Davis

PM frameworks and strategic sparring for Claude Code.

This repo is now part of Bette. Install the unified plugin to get all 57 skills including everything in this repo.

Install Bette

/plugin marketplace add breethomas/bette
/plugin install bette@breethomas

What's Here

30 skills and frameworks from Marty Cagan, Teresa Torres, Elena Verna, Brian Balfour, Ryan Singer, Hamel Husain, and more. Your sparring partner, not your assistant.

Top skills: strategy-session, spec, shape-up, four-risks, agency-ladder, start-evals, competitive-research, calibrate, now-next-later, growth-loops

7 agents for autonomous research and analysis.

Browse: skills/ · frameworks/ · thought-leaders/

Migrating from pm-thought-partner

/plugin uninstall pm-thought-partner@breethomas
/plugin marketplace add breethomas/bette
/plugin install bette@breethomas

All your skills are still there, plus 27 more.

License

MIT

Part of the Bette system. Fasten your seatbelts.

Similar Plugins

bette-think

Strategic thinking partner for product decisions. Works through problems conversationally, challenges assumptions, helps you ship faster. Grounded in frameworks from Marty Cagan, Teresa Torres, Elena Verna, Brian Balfour, Chip Huyen, Ryan Singer, Hamel Husain, and more. Complete eval chain from first 20 test cases through error analysis, LLM judges, and RAG evaluation. Plus backlog automation with Linear/GitHub integration.

Stats

Version1.0.0

Parent Repo Stars13

Parent Repo Forks2

MaintenanceExcellent

AddedMar 1, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Help us improve

Share bugs, ideas, or general feedback.

Back to Plugins

bette-think

"Attempt the impossible in order to improve your work." — Bette Davis

PM frameworks and strategic sparring for Claude Code.

This repo is now part of Bette. Install the unified plugin to get all 57 skills including everything in this repo.

Install Bette

/plugin marketplace add breethomas/bette
/plugin install bette@breethomas

What's Here

30 skills and frameworks from Marty Cagan, Teresa Torres, Elena Verna, Brian Balfour, Ryan Singer, Hamel Husain, and more. Your sparring partner, not your assistant.

Top skills: strategy-session, spec, shape-up, four-risks, agency-ladder, start-evals, competitive-research, calibrate, now-next-later, growth-loops

7 agents for autonomous research and analysis.

Browse: skills/ · frameworks/ · thought-leaders/

Migrating from pm-thought-partner

/plugin uninstall pm-thought-partner@breethomas
/plugin marketplace add breethomas/bette
/plugin install bette@breethomas

All your skills are still there, plus 27 more.

License

MIT

Part of the Bette system. Fasten your seatbelts.

pm-thought-partner

Component Overview

Component Details

Skills (4)

README

bette-think

Install Bette

What's Here

Migrating from pm-thought-partner

License

Similar Plugins

bette-think

Help us improve

Help us improve

pm-thought-partner

Component Overview

Component Details

Skills (4)

README

bette-think

Install Bette

What's Here

Migrating from pm-thought-partner

License

Similar Plugins

bette-think

Help us improve

pm-advanced

claude-code-pm-skills

product-skills

thinking-partner

executive-mentor