By agent-sh
Codebase onboarding - automated data collection and interactive project orientation
npx claudepluginhub agent-sh/agentsys --plugin onboardCodebase onboarding for AI agents - automated data collection and interactive project orientation.
Part of the agentsys ecosystem.
Drop into any unfamiliar codebase and get oriented in under 3 minutes. The collector gathers project metadata automatically (no LLM calls), then an Opus agent synthesizes it into a guided tour and answers follow-up questions interactively.
agentsys install onboard
/onboard
Three phases run in sequence:
Pure JavaScript collector gathers everything an agent needs to orient:
| Data Source | What it captures |
|---|---|
| Manifest | package.json, Cargo.toml, go.mod, pyproject.toml, deno.json, CMakeLists.txt, meson.build, setup.py, pom.xml, build.gradle |
| Structure | 3-level directory tree (excluding build artifacts) |
| README | First 5KB of README content |
| CLAUDE.md / AGENTS.md | Project rules and conventions |
| CI/CD | GitHub Actions workflows, Dockerfile presence |
| Git | Branch, commit count, remote URL |
| Repo-intel | Hotspots, ownership, areas, health (if agent-analyzer available) |
| Repo-map | AST symbols and imports (if available) |
No LLM tokens are spent on collection. The agent receives pre-structured data and focuses on synthesis and guidance.
| Level | Time | What's included |
|---|---|---|
quick | ~2s | Manifest + README + structure + git |
normal | ~5s | + CLAUDE.md/AGENTS.md + CI + repo-intel (default) |
deep | ~15s | + repo-map AST symbols |
/onboard --depth=deep
/onboard /path/to/repo
The Opus agent synthesizes collected data into:
After the summary, the agent stays in conversation to answer follow-up questions, read specific files, and guide you to the right place for what you want to do.
Tested across 100 open-source repositories spanning 8 ecosystems:
| Language | Repos | Token savings vs manual |
|---|---|---|
| C/C++ | 6 | 86% |
| Go | 18 | 79% |
| Python | 17 | 74% |
| Rust | 16 | 73% |
| TypeScript | 31 | 71% |
| JavaScript | 11 | 55% |
| Java | 1 | - |
| Deno | 1 | - |
74% average token savings - the collector pre-structures project data in 68ms (median), so the agent spends tokens on synthesis and guidance instead of file discovery. Repos tested include nanoid, zod, express, hono, ripgrep, rich, django, flask, fastapi, gin, cobra, axum, serde, react, next.js, and 85 others.
package.json, Cargo.toml, go.mod, pyproject.toml, setup.py, deno.json, CMakeLists.txt, meson.build, configure.ac, Makefile, pom.xml, build.gradle. Detects monorepos (npm workspaces, pnpm, lerna, Cargo workspaces, Python libs/, Deno workspaces).
MIT
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Uses power tools
Uses Bash, Write, or Edit tools
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Comprehensive real estate investment analysis plugin with financial modeling, market data APIs, deal analysis agents, and tax-aware structuring. Covers all property types: residential, commercial, multifamily, short-term rentals, and land development.
Efficient skill management system with progressive discovery — 410+ production-ready skills across 33+ domains
Open-source, local-first Claude Code plugin for token reduction, context compression, and cost optimization using hybrid RAG retrieval (BM25 + vector search), reranking, AST-aware chunking, and compact context packets.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.