By 88plug
Cross-session, cross-CLI memory for AI coding assistants. Mines transcripts from 10 supported CLI clients. Surfaces operator identity / decisions / bans / corrections / goals / voice via 26 MCP tools, 6 hooks (SessionStart signpost + SessionStart compact-restore + UserPromptSubmit retrieval + Stop/PostCompact re-index + PreCompact continuity-seed), 15 slash commands, and 3 skills. gte-modernbert hybrid recall; worktree-aware project scoping; post-compaction coding-continuity packet. Optional local-LLM refinement ([llm] extra, ollama, off by default). The operator becomes the source of truth; the model stops re-asking what they already told it.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Check if a provider, tool, or pattern is on the operator ban list before suggesting it.
Surface recent operator corrections on a topic — what Claude got wrong, what was demanded.
Estimate Anthropic cost from total-recall's index — per-model token breakdown over a window.
Show standing decisions the operator has made (e.g. provider-a > provider-b, billing-a > billing-b). Useful before recommending defaults.
Assess the operator's current frustration level from their last turn — calm/mild/escalated/high/breaking_point.
One-time setup for total-recall's optional local-LLM refinement layer — installs ollama (if missing) and pulls the configured model. Triggered when the SessionStart hook reports ollama / model missing.
Orient Claude on mining past Claude Code session logs via the total-recall plugin. Use when the user references prior work, asks "what did I do", asks about past decisions, mentions earlier sessions, wants to avoid repeating a corrected mistake, or asks why a previous choice was made. Surfaces user corrections, decisions, self-corrections, progress markers, and stable domain facts from ~/.claude/projects/*.jsonl across this and prior sessions in the same cwd or globally across all projects.
Match the operator's communication cadence (lowercase, terse, no preambles, "we" framing, no emojis). Use when responding in chat. Don't mock their typos — match their directness.
Admin access level
Server config contains admin-level keywords
Cross-session, cross-CLI memory for Claude Code and other AI coding assistants — it mines your own session transcripts so a new session already knows your decisions, corrections, bans, and goals.
Every session you run is already on disk as append-only JSONL: every decision, every "no, do it this way", every dead end. Total Recall reads that history locally and feeds the high-signal parts back to new sessions in a low-token form, so the model stops re-asking what you already told it.
Marketplace (recommended):
/plugin marketplace add 88plug/claude-code-plugins
/plugin install total-recall@88plug
Local checkout (for development):
git clone https://github.com/88plug/total-recall.git
cd total-recall
pip install -e .[vec]
claude --plugin-dir "$PWD"
[!NOTE] Requirements are
bash+curl+ internet. The plugin bootstraps everything else (uv, Python, deps) into its own data dir on first hook fire. No system-widepip installand no system Python required.
First run backfills your existing transcripts in the background (detached, so it survives the spawning session exiting; progress goes to logs/bootstrap.log). After that, every new session gets a short SessionStart brief about what is relevant to the current directory.
Check what was indexed:
/recall-status
Ask the model to use its memory at any time:
/recall what did we decide about the deploy pipeline?
For a manual full reindex:
total-recall index --rebuild --jobs 4
A typical corpus drops from about 22s single-threaded to about 9s at --jobs 4.
Claude Code has three kinds of memory today, and none of them mine the transcript history: amnesia (88plug) keeps one session alive across compaction but ignores other sessions; auto-memory (~/.claude/projects/<proj-slug>/memory/) is hand-curated; CLAUDE.md is static and hand-edited.
The operator is the source of truth. Models change and projects come and go; the human running the sessions is the one constant. Total Recall makes that explicit: an operator profile and voice profile are first-class extracted artifacts, queryable in one MCP call at session start.
17 extractors total. 11 run inline over each session's record stream; 6 are operator-level aggregators that run out-of-band against the full corpus.
corrections — turns where you redirected the model.decisions — "we're going with X because Y" moments.self_corrections — places the model corrected itself ("actually, scratch that").progress — how far a line of work actually got; anchors "we already did X".domain_facts — durable signals about the codebase or environment (versions, paths, conventions).away_summaries — recap text you wrote after returning to a stale session.model_corrections — corrections specifically about model behavior or output format.standing_decisions — decisions you marked as durable across sessions.bans — explicit "never do X" instructions.goals — what you said you are trying to achieve in a session.truth_rhetoric — assertions you made about objective state, kept so a later session can check whether they still hold.operator_profile — durable signals about the human: who they are, how they work, preferences across projects.voice_profile — how you write: tone, phrasing patterns, verbal tics, so a model can match register without being told.ontology — vocabulary you use for your own systems (project, machine, and service names) plus a cross-project co-mention graph.workflow — how you work: fan-out vocabulary, autonomy score, mid-flight interrupt rate, planning idiom, preferred work window, subagent adoption.implicit_preferences — preferences expressed by behavior rather than as a ban or decision (tool-call ratios, shell-command dominance, format preferences). Promoted only past a multi-axis threshold.satisfaction — bidirectional praise/frustration profile paired with the preceding assistant-turn shape.Live queries the model can call mid-conversation.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub 88plug/claude-code-plugins --plugin total-recallUltra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Eyes and hands on a Linux Wayland desktop: screenshot any monitor and click, type, scroll, drag, and read any visible app over xdg-desktop-portal (RemoteDesktop + ScreenCast), with optional OCR + OmniParser icon grounding. Pure-Python, CPU-only. GNOME/Wayland only. Ships the MCP server plus a drive-screen skill that encodes the locate-ground-act-confirm loop.
The sanctioned OS 'motor cortex' for an agent on a Linux box: control systemd services/timers, query journald, read host resources/processes, send desktop notifications, drive D-Bus, and manage power (suspend/reboot/poweroff) — all through structured interfaces (systemctl/loginctl/journalctl/busctl), never raw PID hacks. Pure standard library, zero pip deps. A self-preservation guard refuses severing actions on units the agent depends on (dbus, logind, sshd, network, tailscaled, the session, goosed) unless force=true. Ships the MCP server plus a control-os skill. The system-service counterpart to screen-mcp's GUI control.
Detects when Claude drifts away from your active output contract (a terse persona, hard formatting/length rules, an in-character voice) and quietly steers it back. Each assistant turn is scored by a deterministic, dependency-free engine; a status-line badge shows live drift and the next prompt gets a one-shot correction nudge when the previous reply broke contract.
Seamless context continuity across Claude Code compaction. Four background layers (continuous tool-call capture, mechanical PostCompact handoff, async Opus 4.7 enrichment at --effort max, async Stop-hook refinement, and preemptive snapshot before the next compact) keep the agent's working state intact across every compaction and resume. All summarization is isolated from CLAUDE.md/auto-memory and invisible to the user.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Harness-native ECC plugin for engineering teams - 67 agents, 277 skills, 92 legacy command shims, reusable hooks, rules, MCP conventions, and operator workflows for Claude Code plus adjacent agent harnesses
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.