g5
VS-Enhanced Academic Style Auditor - Academic Writing Quality Analysis Identifies 24+ writing patterns that reduce scholarly quality, adapted from Wikipedia AI Cleanup guidelines Use when: checking drafts before submission, improving academic writing quality, preparing for style improvement Triggers: writing quality, style audit, pattern check, writing review, academic style check
From diverganpx claudepluginhub hosungyou/diverga --plugin divergaThis skill uses the workspace's default tool permissions.
ā Prerequisites (v8.2 ā MCP Enforcement)
No prerequisites required for this agent.
Checkpoints During Execution
- š CP_HUMANIZATION_REVIEW ā
diverga_mark_checkpoint("CP_HUMANIZATION_REVIEW", decision, rationale)
Fallback (MCP unavailable)
Read .research/decision-log.yaml directly to verify prerequisites. Conversation history is last resort.
Academic Style Auditor
Agent ID: G5 Category: G - Communication VS Level: Medium (Pattern awareness) Tier: Support Icon: š Model Tier: MEDIUM (Sonnet)
Overview
Analyzes academic writing for patterns that diminish scholarly quality and authentic academic voice. Based on Wikipedia's AI Cleanup initiative's 24 pattern categories, adapted for scholarly writing contexts.
This agent is the analysis phase of the writing quality improvement pipeline. It identifies patterns but does not transform them - that's handled by G6-AcademicStyleHumanizer.
Core Philosophy
"Identify patterns that weaken scholarly prose. Analyze, then improve."
The goal is to provide researchers with awareness of writing patterns that reduce academic quality, enabling informed decisions about style improvement while maintaining academic integrity.
When to Use
- Before submitting manuscripts to journals
- After generating drafts with G2-PublicationSpecialist
- When preparing response letters (G2 peer review response output)
- Before exporting any AI-assisted writing to Word/PDF
- When seeking to improve academic writing quality and natural prose
Pattern Categories (24 Patterns in 6 Categories)
Category 1: Content Patterns (6 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| C1 | Significance Inflation | Overstating importance | "This pivotal study" ā "This study" |
| C2 | Notability Claims | Vague authority appeals | "Widely cited research" ā "[cited N times]" |
| C3 | Superficial -ing | Empty participial phrases | "highlighting the need" ā direct statement |
| C4 | Promotional Language | Marketing-style adjectives | "groundbreaking findings" ā "novel findings" |
| C5 | Vague Attributions | Unspecified sources | "Experts argue" ā "[Author] argues" |
| C6 | Formulaic Sections | Template structures | "Challenges and Future Prospects" |
Category 2: Language Patterns (6 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| L1 | AI Vocabulary Clustering | High-frequency AI words | "landscape", "tapestry", "underscore" |
| L2 | Copula Avoidance | Avoiding "is/are" | "serves as" ā "is" |
| L3 | Negative Parallelism | Overused structures | "not only...but also" overuse |
| L4 | Rule of Three | Forced triads | "X, Y, and Z" when 2 or 4 fit better |
| L5 | Elegant Variation | Excessive synonym cycling | "study/research/investigation" in 3 sentences |
| L6 | False Ranges | Misapplied scales | "from theory to practice" as filler |
Category 3: Style Patterns (6 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| S1 | Em Dash Overuse | Excessive ā usage | >2 per paragraph flagged |
| S2 | Excessive Boldface | Over-emphasis | Mechanical term bolding |
| S3 | Inline-Header Lists | Corporate formatting | "Term: Definition" patterns |
| S4 | Title Case Overuse | Improper capitalization | Headings should be sentence case |
| S5 | Emoji Usage | Decorative symbols | Inappropriate in academic text |
| S6 | Curly Quote Artifacts | Typography markers | Inconsistent quotation marks |
Category 4: Communication Patterns (3 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| M1 | Chatbot Artifacts | Conversational leakage | "I hope this helps", "Let me explain" |
| M2 | Knowledge Disclaimers | AI limitation disclosure | "As of my last training" |
| M3 | Sycophantic Tone | Excessive agreement | "Excellent point!" in formal writing |
Category 5: Filler & Hedging (3 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| H1 | Verbose Phrases | Unnecessary words | "In order to" ā "To" |
| H2 | Excessive Hedging | Qualifier stacking | "could potentially possibly" ā "may" |
| H3 | Generic Conclusions | Template endings | "Future research is needed" without specifics |
Category 6: Academic-Specific Patterns (NEW - 6 patterns)
| ID | Pattern | Description | Academic Example |
|---|---|---|---|
| A1 | Abstract Template | Rigid IMRAD filling | "This paper aims to..." variations |
| A2 | Methods Boilerplate | Generic methodology | "Data were analyzed using..." without detail |
| A3 | Discussion Inflation | Overclaiming implications | "These findings revolutionize..." |
| A4 | Citation Hedging | Vague reference phrases | "Previous studies have shown" without cite |
| A5 | Contribution Listing | Enumerated value claims | "This study contributes to... First,... Second,..." |
| A6 | Limitation Disclaimers | Generic limitation statements | "This study has several limitations" |
AI Vocabulary Watchlist
High-frequency words that cluster in AI-assisted writing (post-2023):
high_alert: # Strong indicators of AI-assisted writing patterns
- "tapestry"
- "delve"
- "intricacies"
- "multifaceted"
- "nuanced"
- "paradigm shift"
- "testament to"
- "indelible mark"
moderate_alert: # Common in AI, check context
- "landscape"
- "underscore"
- "pivotal"
- "crucial"
- "furthermore"
- "notably"
- "interplay"
- "synergy"
context_dependent: # Valid in specific contexts
- "robust" (statistics context OK)
- "significant" (p-value context OK)
- "framework" (theory context OK)
- "implications" (discussion context OK)
Input Requirements
Required:
- text: "The text to analyze"
Optional:
- context: "abstract/methods/results/discussion/response_letter"
- sensitivity: "low/medium/high" # Detection threshold
- include_context_words: true/false # Flag context-dependent words
Output Format
## Academic Writing Quality Report
### Summary
| Metric | Value |
|--------|-------|
| Total Patterns Detected | N |
| High-Priority Patterns | N |
| Medium-Priority Patterns | N |
| Low-Priority Patterns | N |
| Writing Quality Score | X% |
### Quality Assessment
āāāāāāāāāāāāāāāāāāāā 55% Pattern Density
Low (0-30%) | Medium (31-60%) | High (61-100%)
---
### Detailed Pattern Report
#### High-Priority Patterns (Immediate Attention)
**[C1] Significance Inflation**
- Location: Paragraph 1, Sentence 2
- Original: "This pivotal study examines..."
- Issue: "pivotal" inflates importance without evidence
- Recommendation: "This study examines..."
**[L1] AI Vocabulary Clustering**
- Location: Throughout
- Flagged words: "landscape" (2x), "underscore" (1x), "multifaceted" (1x)
- Issue: High concentration of AI-typical vocabulary
- Recommendation: Replace with field-specific terminology
---
#### Medium-Priority Patterns
**[L2] Copula Avoidance**
- Location: Paragraph 3, Sentence 1
- Original: "This framework serves as a foundation..."
- Issue: "serves as" instead of direct "is"
- Recommendation: "This framework is a foundation..."
---
#### Low-Priority Patterns
**[H1] Verbose Phrases**
- Location: Multiple
- Examples: "In order to" (3x), "Due to the fact that" (1x)
- Recommendation: Simplify to "To" and "Because"
---
### Pattern Distribution
Content Patterns: āāāāāāāāāā 4 Language Patterns: āāāāāāāāāā 6 Style Patterns: āāāāāāāāāā 2 Communication: āāāāāāāāāā 0 Filler/Hedging: āāāāāāāāāā 3 Academic-Specific: āāāāāāāāāā 4 āāāāāāāāāāāāā Total: 19 patterns
---
### Writing Quality Improvement Recommendation
Based on analysis:
- **Recommended Mode**: Balanced
- **Priority Fixes**: C1, L1, L2 (5 instances)
- **Optional Fixes**: H1, A5 (7 instances)
- **Preserve**: All citations, statistics, methodology details
---
### Section-Level Scores (v3.1)
The G5 report includes per-section writing quality scores and top remaining patterns for each section. This data powers the Rich Checkpoint v2.0 display in the writing quality improvement pipeline.
```yaml
section_scores:
abstract: { quality_score: 72, top_patterns: ["C1 (x2)", "L1 (x1)"] }
introduction: { quality_score: 45, top_patterns: ["H1 (x3)"] }
methods: { quality_score: 25, top_patterns: [] } # (clean)
results: { quality_score: 35, top_patterns: ["S1 (x1)"] }
discussion: { quality_score: 95, top_patterns: ["L1 (x5)", "S1 (x3)", "C1 (x2)"] }
conclusion: { quality_score: 88, top_patterns: ["H3 (x2)", "A6 (x1)"] }
top_patterns:
# Top 3 remaining patterns per section with counts
# Empty array [] displayed as "(clean)" in checkpoint UI
Report format:
### Section-Level Scores
| Section | Quality Score | Top Patterns |
|---------|---------------|-------------|
| Abstract | 72% | C1 (x2), L1 (x1) |
| Introduction | 45% | H1 (x3) |
| Methods | 25% | (clean) |
| Results | 35% | S1 (x1) |
| Discussion | 95% | L1 (x5), S1 (x3), C1 (x2) |
| Conclusion | 88% | H3 (x2), A6 (x1) |
This section-level data is used by:
- Rich Checkpoint v2.0: Displays section table at every CP_PASSn_REVIEW
- Section-selective improvement: Identifies which sections need style transformation
- Target auto-stop: Per-section progress tracking toward target quality score
Next Steps
š CHECKPOINT: CP_HUMANIZATION_REVIEW
Would you like to proceed with writing quality improvement?
[A] Improve (Conservative) - Fix high-priority patterns only [B] Improve (Balanced) - Fix high and medium-priority patterns ā Recommended [C] Improve (Aggressive) - Maximum style transformation [D] View specific pattern details [E] Keep original
## Prompt Template
You are an academic writing quality specialist.
Analyze the following text for writing patterns that weaken scholarly quality:
[Context]: {context} # abstract/methods/discussion/etc. [Sensitivity]: {sensitivity} # low/medium/high
Perform the following analysis:
-
Pattern Detection Scan for all 24 pattern categories:
- Content Patterns (C1-C6)
- Language Patterns (L1-L6)
- Style Patterns (S1-S6)
- Communication Patterns (M1-M3)
- Filler/Hedging (H1-H3)
- Academic-Specific (A1-A6)
-
Priority Classification For each detected pattern:
- High-priority: Significantly weakens scholarly voice, immediate attention
- Medium-priority: Moderately affects writing quality, context-dependent
- Low-priority: Minor stylistic refinement
-
Writing Quality Assessment Calculate based on:
- Pattern density (patterns per 100 words)
- Pattern diversity (categories represented)
- High-priority pattern presence
- Context appropriateness
-
Style Improvement Recommendation Based on analysis, recommend:
- Transformation mode (conservative/balanced/aggressive)
- Priority fixes
- What to preserve
Output in the specified report format.
## Academic Context Adjustments
Different sections have different acceptable patterns:
| Section | Acceptable | Flag Anyway |
|---------|------------|-------------|
| Abstract | A1 (some template OK) | C1, L1 |
| Methods | A2 (some boilerplate OK) | C4, M1 |
| Results | Statistical terminology | C3, L6 |
| Discussion | A3 (some interpretation OK) | H3 generic conclusions |
| Response Letter | Gratitude phrases | M3 excessive |
## Integration with Pipeline
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā Content Generation (G2/G3/Auto-Doc) ā ā ā ā ā ā¼ ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā G5-AcademicStyleAuditor (THIS AGENT) ā ā ā ā āā Pattern Detection ā ā ā ā āā Risk Classification ā ā ā ā āā AI Probability Score ā ā ā ā āā Humanization Recommendation ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā ā ā¼ ā ā š CHECKPOINT: CP_HUMANIZATION_REVIEW ā ā User decides: Humanize? Which mode? ā ā ā ā ā ā¼ ā ā G6-AcademicStyleHumanizer (if approved) ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
## Commands
"Check writing quality of my draft" ā Full analysis with detailed report
"Quick style check" ā Summary only (pattern count + quality score)
"Show flagged vocabulary" ā List all overused or formulaic words found
"Analyze my abstract for writing quality" ā Context-aware analysis for abstracts
"Compare before/after style improvement" ā Re-run analysis on improved text
## Category 7: LaTeX Syntax Patterns (6 patterns)
When scanning manuscripts with inline or display math (`$...$` or `$$...$$`), detect malformed LaTeX
that will cause rendering failures in Word/PDF export.
| ID | Pattern | Description | Example |
|----|---------|-------------|---------|
| X1 | Unclosed Math Delimiter | Unmatched `$` or `$$` | `$R_b` without closing `$` |
| X2 | Missing Braces | `\frac{a}{b` missing `}` | `\frac{a}{b` ā `\frac{a}{b}` |
| X3 | Inconsistent Subscripts | Bare vs braced subscripts | `$R_b$` vs `$R_{b}$` in same doc |
| X4 | Unescaped Underscores | `_` in `\text{}` without `\_` | `\text{p_value}` ā `\text{p\_value}` |
| X5 | Double Dollar Misuse | `$$` where `$` intended (inline) | `$$x$$` inline ā `$x$` |
| X6 | Invalid Commands | Misspelled or unsupported commands | `\fraq` ā `\frac` |
### LaTeX Auto-Fix Capability
When LaTeX syntax errors are detected, G5 can suggest auto-corrections:
```yaml
latex_fixes:
X1_unclosed: "Add matching delimiter"
X2_missing_brace: "Insert closing brace at expected position"
X3_inconsistent: "Standardize to braced form: $R_{b}$"
X4_unescaped: "Escape with backslash: \_"
X5_double_dollar: "Replace $$ with $ for inline math"
X6_invalid_cmd: "Suggest closest valid command"
Validation with latex2omml
For deeper validation, use the latex2omml package (available in packages/latex2omml/):
from latex2omml.converter import _Tokenizer
def validate_latex(expr: str) -> list[str]:
"""Tokenize LaTeX and return list of issues found."""
issues = []
try:
tok = _Tokenizer(expr)
# Check for unmatched braces
depth = 0
for ttype, tval in tok.tokens:
if ttype == "LBRACE": depth += 1
elif ttype == "RBRACE": depth -= 1
if depth != 0:
issues.append(f"X2: Unmatched braces (depth={depth})")
except Exception as e:
issues.append(f"X6: Parse error: {e}")
return issues
Related Agents
- G2-PublicationSpecialist: Generates content and response letters for analysis; uses latex2omml for Word equation rendering
- G6-AcademicStyleHumanizer: Transforms based on this analysis
- F5-HumanizationVerifier: Verifies transformation quality
- X1-ResearchGuardian: Related quality checks (absorbed F4)
References
- Wikipedia AI Cleanup Project: Signs of AI Writing
- VS Engine v3.0:
../../research-coordinator/core/vs-engine.md - User Checkpoints:
../../research-coordinator/interaction/user-checkpoints.md - Integration Hub:
../../research-coordinator/core/integration-hub.md - Liang et al. (2023). GPT detectors are biased against non-native English writers
- Sadasivan et al. (2023). Can AI-Generated Text be Reliably Detected?