Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By skynetcmd
Store, search, and manage long-term memory for Claude Code using local SQLite with hybrid search (FTS5 + vector + MMR), multi-agent handoffs, chatlog auto-capture, and GDPR-compliant deletion — all offline with no external API dependencies.
npx claudepluginhub skynetcmd/m3-memory --plugin m3List registered agents and their last heartbeat. Useful for multi-agent setups.
GDPR Article 20 — export all memories you own as portable JSON.
Search captured chat-log turns from your prior Claude / Gemini sessions.
Delete a memory permanently. Asks for confirmation first.
Fetch one memory by UUID or short prefix.
Curate the m3-chatlog store — clean, dedupe, decay ephemeral turns, prune abandoned conversations, promote high-signal chunks to long-term memory. Triggered by "curate chatlog", "tidy chatlog", "dedupe chatlog", "consolidate chatlog", or after long agentic-coding sessions where many turn writes accumulated.
Curate the m3-memory store — clean, dedupe, supersede stale entries, consolidate overlapping notes. Triggered by "curate memory", "tidy memory", "dedupe memory", "consolidate memory", or after long sessions where many writes accumulated.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
This plugin requires configuration values that are prompted when the plugin is enabled. Sensitive values are stored in your system keychain.
endpointOpenAI-compatible endpoint for embeddings + enrichment. Leave empty to probe LM Studio :1234 and Ollama :11434.
${user_config.endpoint}embed_ggufAbsolute path to a BGE-M3 GGUF file. When set, the MCP server loads it in-process via m3_core_rs for 10-100x faster embeds. Optional — if empty, all embeds go through tier-2 HTTP. Common path: %USERPROFILE%/.lmstudio/models/deepsweet/bge-m3-GGUF-Q4_K_M/bge-m3-GGUF-Q4_K_M.gguf
${user_config.embed_gguf}capture_modeWhich Claude Code hooks fire chatlog ingest. One of: both | stop | precompact | none. 'both' = Stop + PreCompact (recommended). Validated by mcp-memory at runtime; invalid values fall back to 'both'.
${user_config.capture_mode}embed_fallback_urlAlways-on HTTP embedder URL — the m3-embed-server CPU service (install via `m3-embed-server.exe install` on Windows, systemd unit on Linux). Default :8082. Leave empty to fall back to llm_failover probes (much slower; not BGE-M3).
${user_config.embed_fallback_url}MeMesh — Local memory for Claude Code and MCP coding agents. One SQLite file, zero cloud required.
Long-term memory for Claude Code — slash commands, skills, opt-in session hooks, codebase indexing, and a deep-retrieval subagent over the @memsy-io/mcp server.
Deja Vu memory layer for AI applications. Add persistent memory, personalization, and semantic search to Claude workflows using the Deja Vu Platform MCP server.
Give your AI a memory — mine projects and conversations into a searchable palace. 19 MCP tools, auto-save hooks, and guided setup.
Long-term semantic memory for Claude Code, powered by OpenViking. Auto-recall relevant memories at session start and capture important information during conversations.
memX: local-first semantic memory for coding agents. Native Claude Code lifecycle hooks.
![M3 Memory]
Local-first Memory Framework for AI Agents · 99.2% LongMemEval-S retrieval @ k=10 · Supports Claude · Gemini · Antigravity · OpenCode · OpenClaw · Hermes · MCP-native and plugins · Hybrid search (FTS5 + vector + MMR) · GDPR · FIPS 140-3 ready · 100% local (fully offline) or cloud capable
"Wait, you remember that?" — Stop re-explaining your project to your AI. Give it a long-term brain that stays 100% on your machine.
🚀 New to M3? Start here with our 5-minute "Human-First" guide.
Works with Claude Code, Gemini CLI, Aider, Google Antigravity, OpenCode, Hermes Agent, and any MCP-compatible agent. Quick one-line command to have your agent install chat log sub-system which saves verbatim chat log info, before compaction, with zero lag/latency and 100% retrieval recall. Just tell your AI agent "install m3-memory chat log sub-system" and your agent will automatically install it with all the proper hooks with some minimal customization questions from you (you can accept the default answers).
👉 I've read enough, I just want to install it on Windows, macOS, or Linux.
curl -fsSL https://raw.githubusercontent.com/skynetcmd/m3-memory/main/install.sh | bash
Installs on macOS or Linux with the single command above. Use this to install on Windows. Use this link to install manually and this to examine the script and what it does.
Claude Code users can also install as a plugin instead — gets you 15 /m3:* slash commands, two curator subagents (m3:curate-memory, m3:curate-chatlog), and auto-wired hooks:
/plugin marketplace add skynetcmd/m3-memory
/plugin install m3@skynetcmd
Plugin reference · Claude.ai (web/desktop) connector
Google Antigravity users can install the plugin directly:
agy plugin install https://github.com/skynetcmd/m3-memory
Hermes Agent users can install the memory-provider plugin directly (supports optimal replacement of default memory or parallel coexistence for rich SOTA retrieval):
# Handled automatically via our setup wizard:
m3 setup
Add to your MCP config:
{
"mcpServers": {
"memory": { "command": "m3" }
}
}
pip install m3-memory
m3 setup
m3 setup is an interactive wizard. It detects every agent on PATH (Claude
Code, Gemini CLI, OpenCode, OpenClaw), asks a handful of questions, then
drives the full install end-to-end: system payload, sovereign CPU embedder
(BGE-M3 on port 8082), per-agent MCP wiring, chatlog hooks, and a doctor
verification. Restart your agent — that's it.
The embedder ships in the repo. Our own BGE-M3 CPU embedder runs as a
small always-on service on 127.0.0.1:8082 after m3 setup. No LM
Studio, no Ollama, no GPU, no internet required for embedding to work.