From syntari-nexus
AGI: SteerConf confidence calibration protocol — generates calibrated confidence scores for any analytical output using three-pass prompting (conservative/neutral/optimistic) with consistency measurement. Triggers on any investment analysis, credit memo, deal screening, underwriting, or financial model output. Also triggers when user asks 'how confident are you', 'what's the reliability', 'confidence level', or 'how sure'.
How this skill is triggered — by the user, by Claude, or both
Slash command
/syntari-nexus:confidence-calibrationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Every analytical output must include a calibrated confidence assessment. This skill implements the SteerConf protocol — a prompt-only technique that halves calibration error with zero infrastructure requirements.
Every analytical output must include a calibrated confidence assessment. This skill implements the SteerConf protocol — a prompt-only technique that halves calibration error with zero infrastructure requirements.
Standard AI outputs don't indicate reliability. A model may express identical certainty for a well-supported analysis backed by audited financials and a speculative estimate based on a single data point. SteerConf solves this by measuring internal consistency across calibration perspectives.
Generate the analysis assuming pessimistic interpretations:
Generate the analysis using balanced judgment:
Generate the analysis assuming favorable interpretations:
After all three passes, measure agreement:
| Consistency Level | Score Range | Interpretation |
|---|---|---|
| High (3/3 agree on direction) | 0.80 - 1.00 | Strong analytical foundation, reliable output |
| Medium (2/3 agree) | 0.50 - 0.79 | Reasonable analysis, some uncertainty |
| Low (all 3 diverge) | 0.20 - 0.49 | High uncertainty, flag for human review |
| Conflicting (conservative and optimistic oppose) | 0.00 - 0.19 | Insufficient data or fundamental ambiguity |
base_score = (agreement_count / total_dimensions) * 0.6
data_quality = (verified_datapoints / total_datapoints) * 0.25
source_diversity = (unique_sources / max_expected_sources) * 0.15
confidence = base_score + data_quality + source_diversity
Every analysis that uses this protocol must append:
## Confidence Assessment
- **Score**: 0.XX
- **Calibration**: conservative | neutral | optimistic
- **Confidence Drivers**:
- [Factor 1 that supports reliability]
- [Factor 2 that supports reliability]
- **Data Gaps**:
- [Missing data point 1 — would improve confidence by ~X%]
- [Missing data point 2 — would improve confidence by ~X%]
- **Three-Pass Agreement**:
- Conservative: [key conclusion]
- Neutral: [key conclusion]
- Optimistic: [key conclusion]
- Agreement level: High | Medium | Low | Conflicting
This skill is automatically invoked by:
quick-cut — appends confidence to the credit memodeal-screener — calibrates the 100-point scorecre-underwriter — calibrates return projectionsfinancial-modeling — calibrates model outputsadversarial-debate — calibrates the synthesized viewWhen another skill references "confidence calibration", it means: run the three-pass protocol on the output and append the confidence assessment block.
npx claudepluginhub syntari-international-inc/syntari-nexus-pluginProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.