Component skill for creating compelling data-driven presentations and whitepapers using marp and pandoc with proper citations and reproducibility
Creates professional data presentations and whitepapers using marp and pandoc. Use when you need to communicate analysis findings to stakeholders with proper citations and reproducibility documentation.
/plugin marketplace add tilmon-engineering/claude-skills/plugin install datapeeker@tilmon-eng-skillsThis skill inherits all available tools. When active, it can use any tool Claude has access to.
formats/citations.mdformats/reproducibility.mdframeworks/3-paragraph-essay.mdframeworks/narrative-structure.mdtools/marp.mdtools/pandoc.mdThis component skill guides creation of professional data-driven presentations and whitepapers. Use it when:
Supports two complementary formats:
interpreting-results skill)creating-visualizations skill)Create a TodoWrite checklist for the 5-phase presentation process:
Phase 1: Analyze Audience & Purpose - pending
Phase 2: Structure Narrative - pending
Phase 3: Create Content - pending
Phase 4: Add Citations & Reproducibility - pending
Phase 5: Generate Outputs - pending
Mark each phase as you complete it. Document all presentation materials in your analysis directory.
Goal: Understand who will consume your presentation and what decisions they need to make.
Executive Stakeholders:
Technical Peers:
Mixed Audience:
What decisions will this presentation support?
What actions should the audience take?
CHECKPOINT: Before proceeding to Phase 2, you MUST have:
Goal: Organize findings into a compelling narrative that guides the audience to your conclusions.
→ See frameworks/3-paragraph-essay.md for detailed guidance
The classic essay structure adapts perfectly to data presentations:
Introduction & Thesis
Body & Supporting Arguments
Conclusion & Next Steps
Bibliography & Supporting Documentation
→ See frameworks/narrative-structure.md for storytelling patterns
Data presentations follow narrative arcs:
Create outline document: analysis/[session-name]/presentation-outline.md
For Presentations (Marp):
# Presentation Outline
## Slide 1: Title & Context
- Analysis question
- Time period and data sources
## Slide 2-3: Key Findings (Thesis)
- 3-5 bullet points with metrics
- Visual emphasis
## Slide 4-7: Supporting Evidence (Body)
- One finding per slide
- Include visualizations
- Reference methodology
## Slide 8: Conclusions & Recommendations
- Restate key findings
- Next steps
- Questions
## Slide 9: Reproducibility Notes (Appendix)
- Data sources
- Query locations
- Validation status
For Whitepapers (Pandoc):
# Whitepaper Outline
## Introduction (2-3 pages)
- Business context and objectives
- Research question
- Thesis statement
## Methodology (5-10 pages)
- Data sources and collection methods
- SQL queries and transformations
- Analysis frameworks used
- Data quality assessment
## Results (10-20 pages)
- Finding 1 with supporting data
- Finding 2 with supporting data
- Finding 3 with supporting data
- Visualizations and tables
## Discussion (5-10 pages)
- Interpretation of findings
- Comparison to prior research
- Limitations and caveats
- Alternative explanations
## Conclusions (2-3 pages)
- Summary of key findings
- Recommendations
- Future research directions
## References
- Bibliography (BibTeX format)
- Appendix: SQL queries
- Appendix: Data validation notes
CHECKPOINT: Before proceeding to Phase 3, you MUST have:
Goal: Write presentation content using appropriate tools (marp for slides, pandoc for documents).
→ See tools/marp.md for detailed CLI usage and syntax
Marp transforms Markdown into professional slide decks. Use for:
Basic Marp Syntax:
---
theme: gaia
paginate: true
footer: "DataPeeker Analysis"
---
# Q4 Sales Analysis
## Key Findings
---
## Data Source
- Database: `analytics_prod.sales_metrics`
- Query Date: 2025-11-25
- Period: 2024-10-01 to 2024-12-31
---
## Finding 1: Revenue Growth
- Total Revenue: $1.2M
- YoY Growth: +23%
- Top Region: West Coast

---
## Methodology
\```sql
SELECT
DATE_TRUNC('month', date) as month,
SUM(amount) as revenue
FROM sales_metrics
WHERE date BETWEEN '2024-10-01' AND '2024-12-31'
GROUP BY month
\```
---
## Conclusions
- Revenue exceeded target by 15%
- West Coast expansion successful
- Recommend continued investment
---
## Reproducibility
- Queries: `queries/q4_analysis.sql`
- Data validated: 2025-11-25
- Full report: [Technical Whitepaper](./whitepaper.pdf)
Generate Presentation:
marp presentation.md -o presentation.pdf
→ See tools/pandoc.md for detailed CLI usage and citations
Pandoc creates publication-quality documents. Use for:
Basic Pandoc Structure:
---
title: "Q4 Sales Performance Analysis"
author: "DataPeeker Team"
date: "2025-11-25"
institute: "Tilmon Engineering"
abstract: |
This analysis examines Q4 2024 sales data, revealing 23% YoY growth
driven primarily by West Coast expansion. Methodology, findings, and
recommendations are presented with full reproducibility documentation.
keywords: "sales analysis, SQL, data analysis, reproducibility"
toc: true
lof: true
lot: true
---
# Introduction
This analysis addresses the question: What factors drove Q4 2024
sales performance? Using DataPeeker to analyze production database
records, we identify key growth drivers and provide actionable
recommendations.
# Methodology
## Data Collection
Data was extracted from `analytics_prod.sales_metrics` using:
\```sql
SELECT
transaction_id,
customer_id,
region,
amount,
transaction_date
FROM sales_metrics
WHERE DATE(transaction_date) BETWEEN '2024-10-01' AND '2024-12-31'
ORDER BY transaction_date
\```
## Analysis Framework
Analysis followed established data quality frameworks [@jones2024]
and reproducibility standards [@smith2023].
# Results
{#fig:revenue}
Figure @fig:revenue demonstrates clear regional patterns.
# Discussion
Our findings align with previous research on seasonal trends
[@williams2024] while revealing new patterns in customer behavior.
# Conclusions
Three key findings emerge from this analysis:
1. West Coast growth exceeded projections
2. Customer acquisition accelerated in Q4
3. Average transaction value increased 12%
# References
Generate Whitepaper:
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
--csl ieee.csl \
-F pandoc-crossref \
-s -V geometry:margin=1in \
--toc \
--number-sections \
-o whitepaper.pdf
Use creating-visualizations component skill to create charts and diagrams:
Terminal visualizations:
creating-visualizations terminal formats for inline code examplesImage-based visualizations:
creating-visualizations image formats (Kroki) for slides and whitepapersBest Practices:
CHECKPOINT: Before proceeding to Phase 4, you MUST have:
Goal: Document data sources, queries, and methodology to enable reproducibility and proper attribution.
→ See formats/citations.md for BibTeX and CSL formats
Proper citation enables:
Citation Types for Data Analysis:
Example BibTeX Entries:
@misc{production_database_2025,
author = {Tilmon Engineering},
title = {Production Sales Metrics Database},
year = {2025},
url = {analytics_prod.sales_metrics},
note = {Query timestamp: 2025-11-25 14:30 UTC}
}
@software{datapeeker_2025,
author = {Tilmon Engineering},
title = {DataPeeker: SQL Analysis Tool},
year = {2025},
version = {2.1.0},
url = {https://github.com/tilmon/datapeeker}
}
→ See formats/reproducibility.md for comprehensive checklist
Reproducible research requires documentation of:
Reproducibility Section Template:
## Reproducibility Information
### Data Sources
- **Database**: `analytics_prod.sales_metrics`
- **Schema Version**: v2.3.1
- **Query Timestamp**: 2025-11-25 14:30:00 UTC
- **Records Examined**: 50,000 transactions
- **Time Period**: 2024-10-01 to 2024-12-31
### Analysis Environment
- **Tool**: DataPeeker v2.1.0
- **Python Version**: 3.11.5
- **Key Libraries**: pandas 2.1.0, plotext 5.2.8
- **Operating System**: macOS 14.6.0
### Query Repository
- **Location**: `github.com/tilmon/analysis/queries/q4_sales.sql`
- **Commit Hash**: abc123def456
- **Execution Time**: 3.2 seconds
- **Rows Returned**: 50,000
### Data Quality Validation
- **Null Values**: 0.02% (within tolerance)
- **Duplicates**: 0 detected
- **Outliers**: 12 identified and documented separately
- **Cross-Validation**: Results match source system aggregate queries
### Reproducibility Instructions
1. Clone repository: `git clone github.com/tilmon/analysis`
2. Install dependencies: `pip install -r requirements.txt`
3. Run analysis: `python scripts/q4_analysis.py`
4. View results: `analysis/q4-2024/01-findings.md`
CHECKPOINT: Before proceeding to Phase 5, you MUST have:
Goal: Compile final presentation and whitepaper artifacts using marp and pandoc.
Basic PDF Output:
marp presentation.md -o presentation.pdf
HTML with Speaker Notes:
marp presentation.md -o presentation.html
PowerPoint (Editable):
marp presentation.md --pptx -o presentation.pptx
With Custom Theme:
marp presentation.md --theme-set custom-theme.css -o presentation.pdf
Watch Mode (Live Preview):
marp -w -p presentation.md
PDF with Citations:
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
--csl ieee.csl \
-s -V geometry:margin=1in \
-o whitepaper.pdf
PDF with Cross-References:
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
-F pandoc-crossref \
-s --toc --number-sections \
-o whitepaper.pdf
Word Document (Editable):
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
--reference-doc=template.docx \
-o whitepaper.docx
HTML for Web Publishing:
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
-s -c style.css \
--self-contained \
-o whitepaper.html
Link from slides to detailed documentation:
# Conclusion
For detailed methodology and reproducibility:
→ [Technical Whitepaper](./whitepaper.pdf)
→ [GitHub Repository](https://github.com/tilmon/analysis)
File Organization:
analysis/
├── q4-2024/
│ ├── presentation.md (marp source)
│ ├── presentation.pdf (generated)
│ ├── whitepaper.md (pandoc source)
│ ├── whitepaper.pdf (generated)
│ ├── references.bib (bibliography)
│ ├── queries/
│ │ └── q4_analysis.sql
│ └── visualizations/
│ ├── revenue-chart.png
│ └── regional-breakdown.png
Automation Script:
#!/bin/bash
# generate_deliverables.sh
# Generate presentation
echo "Creating presentation..."
marp presentation.md -o presentation.pdf
# Generate whitepaper
echo "Creating whitepaper..."
pandoc whitepaper.md \
--citeproc \
--bibliography references.bib \
--csl ieee.csl \
-F pandoc-crossref \
-s --toc --number-sections \
-V geometry:margin=1in \
-o whitepaper.pdf
echo "Deliverables created:"
echo " - presentation.pdf (slides)"
echo " - whitepaper.pdf (detailed report)"
CHECKPOINT: Final verification before delivery:
Process skills reference this component skill when presenting findings:
Use the `presenting-data` component skill to create professional
deliverables from your analysis:
- Executive presentations (marp) for quick communication
- Technical whitepapers (pandoc) for detailed documentation
- Both formats with proper citations and reproducibility
When presenting analysis results:
Typical Usage Contexts:
exploratory-analysis → Use presenting-data for final reportguided-investigation → Use presenting-data to document findingshypothesis-testing → Use presenting-data to publish resultscomparative-analysis → Use presenting-data for comparison reportsBefore considering presentation complete:
Content Quality:
Reproducibility:
Citation Quality:
Technical Quality:
Narrative Quality:
Presentation-Whitepaper Hierarchy:
✅ Start with audience analysis to choose format
✅ Use 3-paragraph essay structure for clear narrative
✅ Include SQL queries for reproducibility
✅ Cite data sources in bibliography
✅ Create both slides and whitepaper for mixed audiences
✅ Link from presentations to detailed whitepapers
✅ Document environment, versions, and timestamps
✅ Use creating-visualizations for charts and diagrams
✅ Version control both source markdown and generated PDFs
✅ Test reproducibility instructions before delivery
❌ Skip audience analysis - format should match need ❌ Present findings without methodology documentation ❌ Forget to cite data sources and prior research ❌ Generate PDFs without reviewing output quality ❌ Mix presentation styles (stay consistent) ❌ Overcomplicate slides with too much detail ❌ Omit reproducibility information ❌ Assume audience can recreate analysis without documentation ❌ Ignore cross-references and internal links ❌ Deliver without validating bibliography formatting
Use when working with Payload CMS projects (payload.config.ts, collections, fields, hooks, access control, Payload API). Use when debugging validation errors, security issues, relationship queries, transactions, or hook behavior.