Skill

supervisor-report

Guides step-by-step writing of research reports for supervisors summarizing experiment findings, hypothesis tests, data analysis, visualizations, and exploration. Use for multi-experiment result communication.

Markdown

documentation

ai-ml

npx claudepluginhub butanium/claude-lab --plugin clab

Tool Access

This skill uses the workspace's default tool permissions.

Preview

When writing a report to your supervisor, the main goal is to give them a clear understanding of what was run, what are the hypothesis and how they were tested. But there are also several subgoals that the report should achieve:

SKILL.md

Similar Skills

results-report

3.6k

Writes structured post-experiment research reports from analysis artifacts, including executive summary, findings, limitations, next actions, and Obsidian Markdown output.

7 files

claude-scholar

Research Report Skill

Generates structured Markdown research report from prior phase outputs like brainstorm, plans, code, tests, and plots. Integrates visuals, generates missing plots, verifies claim-evidence integrity.

magi-researchers

writing-guidelines

Generates interactive Quarto HTML reports from research experiment data and findings, with figures, data exploration, blogpost structure, and writing guidelines.

clab

Stats

Parent Repo Stars12

Parent Repo Forks0

Last CommitMar 20, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

What is a report

It should allow the supervisor to easily understand the different experiments and the different findings. It should give it a complete overview of the data collected.
It should allow the supervisor to explore the data collected in different experiments through a data explorer. (see data explorer section below)
It should make limitations of current results clear.

Writing a report, high level overview

IMPORTANT: Writing a report is a long, multi-step process. Do NOT rush it. The quality of the report depends on following each step carefully. Skipping steps (especially Gather, Reflect, Review, Redteam) leads to shallow reports with overclaims and missing analysis.

Step 0: Create a task list. Before doing anything else, create a task with TaskCreate for EVERY step below (Gather, Draft, Reflect, Refine, Author, Review, Redteam, Triage, Revise, Submit). Mark each task in_progress when you start it and completed when you finish. This prevents you from skipping steps or rushing through them. Do not start step N+1 until step N is marked completed.

Then follow these steps in order:

Gather — List all the different findings that you have from your different experiments. For each experiment make sure that the data was properly analyzed. Fully aggregated data, analyze the data at the different levels (per prompt, per model, per condition, etc). If a report from a scientist from some experiment doesn't seems to have explored different aggregations, spawn a new scientist to do so. Ask the scientists to also include different plots of the results with different aggregations so that you can also look at them. Ask for export as images, html export are not readable. If you'd like some extra plot (e.g. another kind of plot / layout / aggregation) ask the scientist to add it. Also ask the scientists to have a qualitative look at the data, and mention some samples to exemplify their qualitative findings. IMPORTANT: LOOK at the plots, do not just assume they show a certain result, look at them!
Draft — Once you have all the different findings, and a clear picture of the data, with all the plots needed, write a first draft in markdown of the findings (referencing the plots). This is only for your usage, and should be used as a scratchpad for arranging your thoughts and the connections between the different findings.
Reflect — Once you have done that reflect on your current scratchpad: is there any plots missing? Any result you'd want to dig in? Any qualitative analysis that needs more work? Have you actually looked at the plots you mention in the scratchpad?
Refine — Ask scientists to give for any extra qualitative / plotting you need.
Author — Write the Quarto report as per /writing-guidelines skill.
Review — Ask a colleague with access only to the report, the qualitative samples included in the report and the plots included in the report to give you a review of the report. Is there anything that is unclear? Any overclaim? Any missing experiment details? Any other feedback?
Redteam — Ask a "reviewer" subagent to redteam the report. They are prompted with common pitfalls and errors that scientists often make when writing reports.
Triage — Review the colleague's and reviewer's feedback, and think about what to address or not. Don't forget that your supervisor is the sole reader of this, not the colleague or reviewer.
Revise — Address the colleague's and reviewer's feedback if needed.
Submit — Ping supervisor to review the report.

Other requirements

Do NOT delegate writing some sections to subagents: you ARE the orchestrator, you should be the one writing the report because you have the critical context. You can use subagents to write some external report for you, but you should read those and incorporate those by yourself.

Updating the report

When reading the report, your supervisor will likely ask you for extra experiments and visualizations. When doing that you should follow these steps:

Gather: Request extra plots / aggregations etc from scientists. IMPORTANT: LOOK at the plots, do not just assume they show a certain result, look at them!
Reflect: Reflect on those findings, do they change the current story of the report? Is this just an addition in a foldable section for the supervisor to look at the data?
Update: Update the report with the new plots / findings.

Technical details

If there is some unrelated to the results but useful metric like, coherence, add global widgetss to the report that allow to plot all the plots from the report with e.g. a min coherence slider
Always add to your plots the number of samples of each datapoint. If it's different lines in the same plot, hover over a point should display the number of samples of which it's the mean of. If it's a single line add the number at the top of the plot at the same place as the x-ticks, etc etc. The goal is for your supervisor to be able to easily see how many samples corresponds to each datapoint rather than just the %