Skills for analyzing large text corpora — topic modeling, NER, categorization, taxonomy derivation, word frequency, synonym clustering, parametric and correlation analysis. Supports classical NLP, local LLMs, and cloud LLMs (OpenRouter) with explicit cost-awareness for large runs.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin text-corpus-analysisAssign each document in a corpus to one of N user-defined categories. Use when the user has a fixed taxonomy (e.g. 10-20 labels) and wants every note/document routed into exactly one (or top-k) of them. Supports zero-shot classifiers, local LLMs, and cloud LLMs with cost-aware batching.
Decide whether a corpus analysis task should use classical NLP, a local LLM, or a cloud LLM (OpenRouter) given corpus size, task complexity, and cost tolerance. Use first, before any other skill in this plugin, especially when the corpus is large (thousands+ of documents) or when an LLM pass could get expensive.
Correlate metadata (timestamps, tags, source, author) with content features (topics, entities, length, sentiment) to surface non-obvious patterns. Use when the user asks "does X correlate with Y in my corpus" or wants to discover relationships between when/where/who and what.
Build a multi-level taxonomy (categories → tags → sub-categories) from a text corpus. Use when the user wants more than a flat category list — e.g. "give me a hierarchical taxonomy for my tech notes" or "categories, tags, and sub-tags for this corpus of GitHub repos".
Extract named entities (people, places, organizations, dates, products) from a text corpus. Use when the user wants to know "who and where is mentioned" or needs a list of entities for downstream indexing, linking, or trend analysis.
Compute summary statistics over a text corpus — average word length, words/doc, sentences/doc, lexical diversity, readability scores, token length distributions. Use when the user wants a quantitative description of the corpus shape rather than its content.
Recommend well-maintained external libraries and tools for text corpus analysis beyond what this plugin ships — classical NLP, topic modeling, corpus indexing, aspect-based sentiment, multilingual analysis. Use when a task calls for something this plugin doesn't do natively, or when the user wants a survey of what's out there.
Audit what local LLM runtimes are installed (Ollama, llama.cpp, vLLM, LM Studio) and suggest/install a model suitable for corpus analysis tasks — classification, labeling, summarization. Use when a skill in this plugin wants a local-LLM lane but nothing is available, or when the user wants to move a cloud workload local for cost/privacy.
Configure OpenRouter as the cloud-LLM backend for skills in this plugin. Use when a skill needs cloud LLM access and the user wants pay-as-you-go routing across Claude, GPT, Gemini, DeepSeek, Llama, Qwen without managing multiple provider keys.
Derive N categories from the dominant themes of a corpus — the user says "give me 10 categories for these 1000 notes" or "propose 20 labels that would cover most of this data". Produces a proposed category list with definitions, coverage estimates, and example documents.
Identify tokens or phrases that refer to the same concept but appear in different forms — transcription variants from voice notes, spelling variants, acronyms vs expansions, aliases. Use on any voice-note or STT-derived corpus before frequency/NER/topic work, or to deduplicate entity lists.
Identify topic clusters in a text corpus and track how those topics evolve over time. Use when the user has a body of notes, voice notes, articles, or documents and wants to know "what is this corpus mostly about" or "how have my interests shifted". Supports classical topic modeling (LDA/BERTopic) and LLM-assisted labeling.
Identify temporal trends across a text corpus — rising/falling topics, entities, or keywords over time. Use after topic-analysis or ner-extraction when the user wants "what am I talking about more / less than before" or "when did X first show up".
Count word/token occurrences across a corpus with stopword filtering, stemming/lemmatization options, and n-gram support. Use when the user wants a simple frequency export — "how often does X come up", "top 100 words in my notes", "bigram frequencies".
Claude Code plugin: reusable task definitions for text corpus and topic analysis. Covers categorization, taxonomy development, topic modeling, NER, trend/correlation analysis, and synonym clustering across corpora ranging from a handful of notes to tens of thousands.
| Skill | Purpose |
|---|---|
choose-approach | Decide NLP vs local-LLM vs cloud-LLM for a given task + corpus size. Estimates cost. |
topic-analysis | Topic clusters and their evolution over time. |
ner-extraction | Named entity recognition — people, places, orgs. |
trend-analysis | Temporal trends across topics, entities, or keywords. |
categorize-corpus | Assign each document to one of N user-defined categories. |
suggest-categories | Derive N categories from a corpus's dominant themes. |
define-taxonomy | Build a multi-level category → tag → sub-category taxonomy. |
word-frequency | Word/token counts, stopword-filtered. |
synonym-cluster | Find variant spellings / transcription variants of the same concept. |
parametric-analysis | Summary statistics (avg word length, sentences/doc, etc.). |
correlation-analysis | Correlate metadata (timestamps, tags) with content features. |
setup-local-llm | Audit/install a local LLM suitable for corpus work (Ollama). |
setup-openrouter | Configure OpenRouter access for cloud LLM calls. |
recommend-tools | Catalog of external libraries/plugins for text corpus work. |
choose-approach — pick the execution lane and estimate cost.word-frequency / ner-extraction — cheap classical pass to surface candidates.suggest-categories or define-taxonomy — derive structure from a stratified sample.categorize-corpus — apply the structure to the whole corpus.trend-analysis / correlation-analysis — analyze the categorized corpus over time.UI/UX design intelligence. 67 styles, 161 palettes, 57 font pairings, 25 charts, 15 stacks (React, Next.js, Vue, Svelte, Astro, SwiftUI, React Native, Flutter, Tailwind, shadcn/ui, Nuxt, Jetpack Compose). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient.
Share bugs, ideas, or general feedback.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
This skill should be used when users need to generate ideas, explore creative solutions, or systematically brainstorm approaches to problems. Use when users request help with ideation, content planning, product features, marketing campaigns, strategic planning, creative writing, or any task requiring structured idea generation. The skill provides 30+ research-validated prompt patterns across 14 categories with exact templates, success metrics, and domain-specific applications.
Comprehensive startup business analysis with market sizing (TAM/SAM/SOM), financial modeling, team planning, and strategic research
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim