From auto-academic
Fully autonomous academic research pipeline. Takes a research topic and target venue, then runs without stopping: deep research -> computational experiments (autoresearch-style keep/discard loop) -> paper writing -> integrity verification -> peer review -> iterative revision with tiered escalation -> final integrity check -> finalize. 8-stage unified pipeline: INIT, RESEARCH, EXPERIMENT, WRITE, INTEGRITY, REVIEW, FINAL_INTEGRITY, FINALIZE. Dual-loop architecture with mandatory integrity gates. Triggers on: auto-academic, autonomous research, auto research pipeline, full autonomous paper, research to accepted paper.
How this skill is triggered — by the user, by Claude, or both
Slash command
/auto-academic:auto-academicThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
A fully autonomous orchestrator that takes a research topic and target venue, then runs the complete pipeline — research, experiments, paper writing, integrity verification, peer review, revision — without stopping until the paper is accepted or safety caps are hit.
A fully autonomous orchestrator that takes a research topic and target venue, then runs the complete pipeline — research, experiments, paper writing, integrity verification, peer review, revision — without stopping until the paper is accepted or safety caps are hit.
Core philosophy: Adapted from autoresearch's "NEVER STOP" principle. Once started, the pipeline runs autonomously. Do not pause to ask the user if you should continue. The user expects you to work indefinitely until interrupted.
Architecture: Dual-loop with spine orchestrator and mandatory integrity gates.
English: auto-academic, autonomous research, auto research pipeline, full autonomous paper, research to accepted paper, run auto-academic
All required skills are bundled within this plugin. No external plugin dependencies.
The user provides:
Examples:
/auto-academic "Impact of LLMs on undergraduate writing skills" --venue "Computers & Education"
Run auto-academic on the effects of AI tutoring in STEM education, targeting the Internet and Higher Education
Core rule: After each major stage, present a checkpoint. The pipeline is autonomous, but checkpoints ensure state is recorded and the process is traceable. In fully autonomous mode, FULL/SLIM checkpoints are logged to process-log.md and the pipeline proceeds without waiting. MANDATORY checkpoints log their status but also proceed autonomously. The checkpoint types exist for transparency and state tracking.
| Type | When Used | Content |
|---|---|---|
| FULL | First checkpoint; after non-critical stages | Full deliverables list + metrics logged to process-log |
| SLIM | After 2+ consecutive routine stage completions | One-line status logged to process-log |
| MANDATORY | Integrity gates (Stage 4, Stage 6); Review decisions | Status logged; pipeline proceeds but records PASS/FAIL clearly |
━━━ Stage [X] [Name] Complete ━━━
Metrics:
- Word count: [N] (target: [T] +/-10%) [OK/OVER/UNDER]
- References: [N] (min: [M]) [OK/LOW]
- Coverage: [N]/[T] sections drafted [COMPLETE/PARTIAL]
Deliverables:
- [Material 1]
- [Material 2]
Flagged: [any issues detected, or "None"]
Proceeding to Stage [Y]...
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Goal: Parse input, set up workspace, initialize state. Checkpoint: MANDATORY
Extract topic and venue from the user's message. If either is missing, infer from context or use defaults:
Read ${CLAUDE_PLUGIN_ROOT}/references/venue-profiles.md. Match the user's venue to a profile category. If no match, use WebSearch to research the venue, then assign the closest category.
Record the venue profile settings:
Before creating a new workspace, check for an interrupted run that can be resumed:
ls auto-academic-run-* 2>/dev/null
If an existing workspace is found:
pipeline-state.json to determine current_stage and stage statusescurrent_stage is not "INIT" and not "FINALIZE": log "Found interrupted run for '{topic}'. Resuming from {current_stage}." and jump to that stagecurrent_stage is "FINALIZE" and status is "completed": log "Previous run already complete. Starting fresh run."material_passport.entry_pointIf no existing workspace: proceed with fresh initialization.
TIMESTAMP=$(date +%Y%m%d-%H%M%S)
WORKSPACE="auto-academic-run-${TIMESTAMP}"
mkdir -p "${WORKSPACE}"/{research,experiments/{specs,code,results},paper/latex,reviews,integrity}
Copy ${CLAUDE_PLUGIN_ROOT}/templates/pipeline-state.json to ${WORKSPACE}/pipeline-state.json. Fill in:
topic: the parsed topicvenue: the parsed venuecreated_at: current timestampcurrent_stage: "INIT"material_passport.entry_point: "INIT"material_passport.provenance: []Copy ${CLAUDE_PLUGIN_ROOT}/templates/process-log.md content to ${WORKSPACE}/process-log.md. Fill in topic, venue, timestamp.
Copy ${CLAUDE_PLUGIN_ROOT}/templates/results-tsv-header.tsv to ${WORKSPACE}/experiments/results.tsv.
cd "${WORKSPACE}"
git init
git add -A
git commit -m "chore: initialize auto-academic workspace for '${TOPIC}'"
Log to process-log: [{timestamp}] INIT: Topic="{topic}", Venue="{venue}", Workspace="{workspace}"
Update pipeline-state: current_stage -> "RESEARCH", INIT status -> "completed"
Proceed immediately to Stage 1.
Goal: Produce research deliverables and extract testable hypotheses. Checkpoint: MANDATORY
Trigger the auto-academic:deep-research skill in full mode with the topic. Provide:
Wait for deep-research to complete. It produces:
research/rq-brief.mdresearch/methodology.mdresearch/bibliography.mdresearch/synthesis.mdRead the synthesis report. Identify all claims that:
For each claim, formulate a hypothesis:
Save to research/hypotheses.md using this format:
## Hypotheses
### H1: {statement}
- **Variables:** IV = {X}, DV = {Y}
- **Expected direction:** {positive / negative / nonlinear}
- **Derived from:** Synthesis Report, Section {N}
- **Evidence base:** {key citations supporting this hypothesis}
### H2: {statement}
...
If the user provided sample papers or writing style references at invocation time:
research/style-calibration.mdLog to process-log: [{timestamp}] RESEARCH: Complete. {N} hypotheses extracted.
Update pipeline-state: RESEARCH -> "completed", EXPERIMENT -> "in_progress"
Git commit: git add research/ && git commit -m "feat: complete research phase, {N} hypotheses extracted"
Log MANDATORY checkpoint to process-log:
[CHECKPOINT-MANDATORY] RESEARCH complete: {N} hypotheses, {M} references. Proceeding to EXPERIMENT.
Proceed immediately to Stage 2.
Goal: Validate all hypotheses through computational experiments. Runs as an autonomous keep/discard loop until all hypotheses have a verdict. Checkpoint: FULL on convergence
Invoke the experiment-planner agent with:
research/hypotheses.mdresearch/methodology.md${CLAUDE_PLUGIN_ROOT}/references/experiment-patterns.mdThe agent produces one experiment spec per hypothesis, saved to experiments/specs/.
For each experiment spec, invoke the auto-academic:writing-plans skill to create an implementation plan, then invoke auto-academic:executing-plans to implement the code.
If multiple experiments are independent (no shared data dependencies), use auto-academic:dispatching-parallel-agents to implement them concurrently.
Each experiment's code is saved to experiments/code/{hypothesis_id}/.
Run each experiment. For each run:
Execute the experiment code, redirecting all output:
cd experiments/code/{hypothesis_id}
python main.py > ../../results/run-{NNN}/output.log 2>&1
echo $? > ../../results/run-{NNN}/exit_code.txt
Check the exit code:
Read the output and parse metrics.
Log to experiments/results.tsv:
{run_number}\t{hypothesis_id}\t{status}\t{metrics}\t{description}
Git commit:
git add experiments/
git commit -m "experiment: run {NNN} for {hypothesis_id} — {status}"
If an experiment crashes:
After each successful run, invoke the experiment-evaluator agent with:
The agent updates experiments/claim-evaluation.md.
After evaluating results, invoke the convergence-judge agent for the inner loop. The agent should consult ${CLAUDE_PLUGIN_ROOT}/references/convergence-criteria.md for detailed convergence rules.
CONVERGE: exit the inner loop, proceed to WRITECONTINUE: identify which hypothesis needs re-running, refine the experiment spec, go back to Step 3FORCE_CONVERGE: log warning, proceed to WRITE with best resultsLog to process-log: [{timestamp}] EXPERIMENT: Inner loop converged after {N} runs. Verdicts: {summary}
Update pipeline-state: EXPERIMENT -> "completed", WRITE -> "in_progress"
Git commit: git add experiments/ && git commit -m "feat: complete experiment phase, all hypotheses resolved"
Log FULL checkpoint to process-log:
[CHECKPOINT-FULL] EXPERIMENT complete: {N} runs, {M} hypotheses resolved. Proceeding to WRITE.
Proceed immediately to Stage 3.
Goal: Produce a complete paper draft incorporating research and experiment results. Checkpoint: ADAPTIVE (FULL first time, SLIM on subsequent re-entries from escalation)
Gather all materials for the paper-writing skill:
research/rq-brief.md (research question and scope)research/methodology.md (methodology blueprint)research/bibliography.md (annotated bibliography)research/synthesis.md (synthesis report)research/style-calibration.md (if exists — style preferences)experiments/claim-evaluation.md (hypothesis-evidence mapping)experiments/results/ (raw experiment outputs for tables and figures)Trigger the auto-academic:academic-paper skill in full mode. Provide all gathered materials and instruct it to:
research/style-calibration.mdSave the output to paper/draft-v1.md.
Log to process-log: [{timestamp}] WRITE: Draft v{N} complete. Word count: {N}
Update pipeline-state: WRITE -> "completed", draft_version -> 1, INTEGRITY -> "in_progress"
Git commit: git add paper/ && git commit -m "feat: complete paper draft v1"
Determine checkpoint type:
Proceed immediately to Stage 4.
Goal: Verify academic integrity of the paper draft before peer review. Must pass 100%. Checkpoint: MANDATORY
Invoke the integrity-verification agent with the current paper draft (paper/draft-v{N}.md). The agent performs a 5-phase check:
Phase A: Reference Existence — 100% of all cited references must be verified to exist
- Verify DOI/URL for each reference
- Check author names, publication year, journal/venue
- Flag ghost citations (fabricated references)
Phase B: Citation Context — >= 30% citation context spot-check
- Verify cited content actually says what the paper claims
- Check for citation distortion or out-of-context use
Phase C: Statistical Data — 100% statistical data verification
- Every numerical claim must match the actual experiment output in experiments/results/
- Cross-reference tables and figures against results.tsv
Phase D: Originality — >= 30% originality spot-check + self-plagiarism check
- Check for text reuse from source materials
- Verify AI-generated text does not duplicate verbatim sources
Phase E: Claim Verification — 30% spot-check (minimum 10 claims)
- Verify factual claims are supported by cited evidence
- Flag MAJOR_DISTORTION and UNVERIFIABLE claims
Save the verification report to integrity/pre-review-check.md.
PASS (zero SERIOUS + zero MEDIUM + zero MAJOR_DISTORTION + zero UNVERIFIABLE):
[{timestamp}] INTEGRITY: PASS. {N} references verified, {M} citations checked.INTEGRITY.verification_pass -> true, INTEGRITY.phases_passed -> ["A","B","C","D","E"][CHECKPOINT-MANDATORY] INTEGRITY PASS. Proceeding to REVIEW.FAIL (any issues found):
paper/draft-v{N}.md:
INTEGRITY.retry_countLog to process-log: [{timestamp}] INTEGRITY: {PASS|FAIL after N retries}. Issues: {summary}
Update pipeline-state: INTEGRITY -> "completed", REVIEW -> "in_progress"
Git commit: git add paper/ integrity/ && git commit -m "feat: integrity verification {PASS|FAIL} — {issues summary}"
Proceed immediately to Stage 5.
Goal: Iterate through peer review and revision until the paper is accepted or safety caps are hit. Checkpoint: ADAPTIVE (FULL first time, SLIM on subsequent rounds after routine Minor decisions)
Invoke the auto-academic:academic-paper-reviewer skill. The review mode depends on the current round:
| Round | Mode |
|---|---|
| 1 | full |
| 2 | methodology-focus (if round 1 was Major/Reject) or full (if Minor) |
| 3+ | As specified by escalation protocol |
Save all reviewer reports to reviews/round-{N}/.
Determine checkpoint type:
Read the editorial decision from the review output. Invoke the convergence-judge agent for the outer loop. The agent should consult ${CLAUDE_PLUGIN_ROOT}/references/convergence-criteria.md for convergence rules and ${CLAUDE_PLUGIN_ROOT}/references/escalation-protocol.md for escalation decision logic.
Based on the decision:
CONVERGE (Accept):
[{timestamp}] REVIEW Round {N}: Accept. Paper accepted![CHECKPOINT-MANDATORY] REVIEW Accept at round {N}. Proceeding to FINAL_INTEGRITY.REVISE_AND_CONVERGE (Minor, non-structural):
auto-academic:academic-paper in revision mode with the review commentspaper/draft-v{N+1}.mdreviews/response-to-reviewers/round-{N}-response.mdre-review to confirmCONTINUE (Minor or Major, needs revision):
auto-academic:academic-paper in revision mode with the review commentspaper/draft-v{N+1}.mdreviews/response-to-reviewers/round-{N}-response.mdgit add paper/ reviews/ && git commit -m "feat: revision round {N} — addressing {decision} revision comments"REVIEW.checkpoint_mode -> "SLIM" if this was a routine Minor revisionESCALATE (Round 3 + Major/Reject):
[{timestamp}] REVIEW Round {N}: {decision}. ESCALATING to inner loop.[CHECKPOINT-MANDATORY] ESCALATION triggered at round {N}.escalation-analyst agentFORCE_CONVERGE (Round 5, safety cap):
[{timestamp}] REVIEW Round {N}: Safety cap reached. Force-finalizing.[CHECKPOINT-MANDATORY] Safety cap at round {N}. Force-proceeding to FINAL_INTEGRITY.After each review round, log:
[{timestamp}] REVIEW Round {N}: Decision={decision}, Action={action}
Update pipeline-state with the decision history. Git commit after each round.
Goal: Post-revision final integrity verification before finalization. Must pass 100%. Checkpoint: MANDATORY
Invoke the integrity-verification agent on the accepted/final draft (paper/draft-v{latest}.md). This is a stricter check than Stage 4:
Phase A: 100% reference verification (including references added during revision)
- Full check, no sampling
Phase B: 100% citation context verification (not spot-check — full check)
- Every citation must be verified for accurate representation
Phase C: 100% statistical data verification
- All numbers, tables, figures verified against experiment outputs
Phase D: >= 50% originality spot-check (100% for newly added/modified paragraphs)
- Higher bar than pre-review check
Phase E: 100% claim verification
- Zero MAJOR_DISTORTION + zero UNVERIFIABLE required
- Compare against Stage 4 results to confirm all previous issues are resolved
Save the verification report to integrity/final-check.md.
PASS (zero issues):
[{timestamp}] FINAL_INTEGRITY: PASS. {N} references verified (full), {M} citations verified (full).FINAL_INTEGRITY.verification_pass -> true[CHECKPOINT-MANDATORY] FINAL_INTEGRITY PASS. Proceeding to FINALIZE.FAIL (any issues found):
Log to process-log: [{timestamp}] FINAL_INTEGRITY: {PASS|FAIL after N retries}. Issues: {summary}
Update pipeline-state: FINAL_INTEGRITY -> "completed", FINALIZE -> "in_progress"
Git commit: git add paper/ integrity/ && git commit -m "feat: final integrity verification {PASS|FAIL}"
Proceed immediately to Stage 7.
Goal: Produce the final paper in all output formats, generate bilingual process summary. Checkpoint: MANDATORY
Invoke auto-academic:academic-paper in format-convert mode:
paper/latex/paper.tex (using corresponding document class, e.g., apa7 for APA 7.0)paper/latex/paper.pdf
paper/paper.docxpaper/draft-final.mdReview the session history and generate a comprehensive process record documenting the human-AI collaboration:
1. Compile the following:
- Topic and initial instructions
- Key decision points at each stage
- Iteration counts and experiment summaries
- Review rounds and decisions (with revision counts)
- Integrity check results (pre-review + final)
- Pipeline statistics (total stages, escalations, integrity retries)
2. Generate Markdown:
- English: paper_creation_process_en.md
- (Bilingual if Chinese was used in the session)
3. Convert to LaTeX and compile PDF:
- pandoc MD -> LaTeX body
- Package complete LaTeX document (with cover page, table of contents, headers/footers)
- tectonic compile PDF
- Chinese version: xeCJK + Source Han Serif TC VF
Required sections in process record:
| Section | Content |
|---|---|
| Paper Information | Title, venue, final deliverables list |
| Stage-by-Stage Process | Input/output/key decisions for each stage |
| Experiment Summary | Hypotheses tested, runs executed, verdicts |
| Integrity Summary | Pre-review + final check results, issues found and resolved |
| Review History | Round-by-round decisions, escalations, revision summaries |
| Pipeline Statistics | Total duration, stage durations, iteration counts |
| Collaboration Quality Evaluation | 1-100 score + dimensional analysis (see below) |
Collaboration Quality Evaluation (mandatory final section):
+--------------------------------------------------+
| Collaboration Quality Score: [XX]/100 |
+--------------------------------------------------+
| Direction Setting [score] |
| Intellectual Contribution [score] |
| Quality Gatekeeping [score] |
| Iteration Discipline [score] |
| Delegation Efficiency [score] |
| Meta-Learning [score] |
+--------------------------------------------------+
Scoring criteria: 90-100 = Exceptional; 75-89 = Excellent; 60-74 = Good; 40-59 = Basic; 1-39 = Needs Improvement. Be honest — if the user only pressed "continue," reflect that truthfully.
Save to paper_creation_process_en.md (and paper_creation_process_zh.md if applicable).
Compile to PDF: paper_creation_process_en.pdf.
Append to process-log.md:
## Summary
- **Total duration:** {start_time} to {end_time}
- **Research phase:** {duration}
- **Experiments:** {N} runs, {M} hypotheses, {K} validated
- **Pre-review integrity:** {PASS/FAIL}, {N} issues found, {M} fixed
- **Paper drafts:** {N} versions
- **Review rounds:** {N}, final decision: {decision}
- **Escalations:** {N}
- **Final integrity:** {PASS/FAIL}, {N} issues found, {M} fixed
- **Final word count:** {N}
- **Final reference count:** {N}
## Final Deliverables
1. Paper (Markdown): `paper/draft-final.md`
2. Paper (PDF): `paper/latex/paper.pdf`
3. Paper (DOCX): `paper/paper.docx` (if generated)
4. Experiment code: `experiments/code/`
5. Experiment results: `experiments/results.tsv`
6. Review history: `reviews/`
7. Integrity reports: `integrity/pre-review-check.md`, `integrity/final-check.md`
8. Process record: `paper_creation_process_en.pdf`
9. Process log: `process-log.md`
git add -A
git commit -m "feat: finalize auto-academic pipeline — paper complete"
Log MANDATORY checkpoint: [CHECKPOINT-MANDATORY] FINALIZE complete. Pipeline ended successfully.
Output a concise completion report:
=== Auto-Academic Pipeline Complete ===
Topic: {topic}
Venue: {venue}
Final Decision: {Accept / Force-finalized at round N}
Integrity Gates:
- Pre-review: {PASS/FAIL} ({N} issues found, {M} fixed)
- Final: {PASS/FAIL} ({N} issues found, {M} fixed)
Deliverables:
- Paper: {workspace}/paper/latex/paper.pdf
- Code: {workspace}/experiments/code/
- Results: {workspace}/experiments/results.tsv
- Process Record: {workspace}/paper_creation_process_en.pdf
- Full log: {workspace}/process-log.md
{If force-finalized: "Note: Quality report attached — see integrity/final-check.md for unresolved concerns."}
Once the pipeline starts (after INIT), do NOT:
The pipeline runs autonomously until completion or user interruption. The user may be away from the computer.
| Cap | Value | Action on Hit |
|---|---|---|
| Inner loop experiments | 10 runs | Force-converge, proceed with best results |
| Outer loop review rounds | 5 rounds | Force-finalize with quality report |
| Escalation re-entries | 1 | No second escalation, proceed to final rounds |
| Integrity check retries (Stage 4) | 3 per check | Log warning, proceed with disclaimer |
| Integrity check retries (Stage 6) | 3 per check | Log warning, proceed with disclaimer |
| Experiment crash retries | 3 per experiment | Mark as crash, skip |
If the session is interrupted (context limit, crash, user stops):
auto-academic-run-* directoriespipeline-state.json — check current_stage and material_passportcurrent_stage — the material passport records provenance for all completed stagesIf a skill invocation fails (deep-research, academic-paper, or academic-paper-reviewer):
.gitignore for temporary files and cachesnpx claudepluginhub waterwoods-ai/auto-academicOrchestrates a 9-stage academic research pipeline (research, write, integrity check, review, revise, re-review, re-revise, final integrity check, finalize) by dispatching alterlab-deep-research, alterlab-paper-writer, and alterlab-paper-reviewer with mandatory integrity verification and user confirmation checkpoints.
Orchestrates a 10-stage academic research pipeline: research, writing, integrity checks, two-stage peer review, revisions, and finalization. Coordinates deep-research, academic-paper, and academic-paper-reviewer skills with mandatory user confirmation checkpoints.
Orchestrates a 10-stage academic research workflow: research, write, integrity check, review, revise, re-review, re-revise, final integrity check, finalize. Coordinates deep-research, academic-paper, and academic-paper-reviewer skills with mandatory user confirmation and integrity gates.