Files
mempalace/website/index.md
T
Igor Lins e Silva f20a1a30fe docs(website): align mempalaceofficial.com with honest benchmarks
Part of #875. Bring the VitePress site into line with the new README
and the reproducibility scorecard: drop category-error comparisons,
drop retracted claims, retain only metrics and caveats that survive
audit.

website/index.md
 - New tagline matches README (local-first, verbatim, pluggable backend,
   96.6% R@5 raw, zero API calls).
 - Replace the "MemPalace hybrid 100% / Supermemory ~99% / Mastra
   94.87% / Mem0 ~85%" comparison table with a single honest table
   showing MemPalace's own retrieval-recall numbers (raw 96.6%,
   hybrid v4 held-out 98.4%). Add an explicit sentence explaining why
   we no longer publish a cross-system table on the landing page
   (retrieval recall vs QA accuracy are different metrics).
 - Soften the "ChromaDB-powered vector search" feature blurb to be
   backend-agnostic, since the retrieval layer is pluggable.

website/reference/benchmarks.md
 - Full rewrite of the retrieval-recall tables. No more "100%"
   headline; honest held-out 98.4% R@5 replaces it. Added the
   model-agnostic rerank result (99.2% R@5 / 100% R@10 with
   minimax-m2.7 via Ollama) to show the pipeline is not Haiku-specific.
 - Drop the LoCoMo "Hybrid v5 + Sonnet rerank (top-50) 100%" row.
   With per-conversation session counts of 19-32 and top_k=50, the
   retrieval stage returns every session by construction — the number
   measures an LLM's reading comprehension, not retrieval.
 - Drop the cross-system comparison tables. Link out to each project's
   own research page (Mastra, Mem0, Supermemory) for their published
   numbers and metric definitions.
 - Rewrite reproduction commands to use the correct repository and
   demonstrate the new --llm-backend ollama flag.

website/concepts/the-palace.md
 - Remove the "+34%" row / paragraph. Wing/room filtering is standard
   metadata filtering in the vector store, not a novel retrieval
   mechanism — the April-7 note already retracted that framing; this
   finishes the retraction on the website where it had remained.

website/guide/searching.md
 - Same treatment for "34% retrieval improvement". Reframe as
   operational scoping, not a novel boost.

website/reference/contributing.md
 - Update the "palace structure matters" bullet to reflect the same
   framing: scoping-not-magic.

website/concepts/knowledge-graph.md
 - Replace the MemPalace-vs-Zep feature matrix with a short "related
   work" note that links to Zep's own documentation for authoritative
   details on their deployment model. Avoids claims we cannot verify
   at source.
2026-04-14 21:37:45 -03:00

3.2 KiB

layout, hero, features
layout hero features
home
name text tagline image actions
MemPalace Give your AI a memory. Local-first AI memory. Verbatim storage, pluggable backend, 96.6% R@5 raw on LongMemEval — zero API calls.
src alt
/mempalace_logo.png MemPalace
theme text link
brand Get Started /guide/getting-started
theme text link
alt Architecture → /concepts/the-palace
theme text link
alt GitHub ↗ https://github.com/MemPalace/mempalace
icon title details
src alt
/icons/file-text.svg Verbatim Storage
Verbatim Storage Store source text directly instead of extracting facts up front. The raw benchmark result comes from retrieving verbatim content.
icon title details
src alt
/icons/building-2.svg Palace Structure
Palace Structure Wings and rooms give retrieval useful structure. In the project benchmarks, narrowing search scope outperformed flat search.
icon title details
src alt
/icons/search.svg Semantic Search
Semantic Search Vector search over verbatim content lets the model retrieve past discussions by topic, project, or room. Backend is pluggable.
icon title details
src alt
/icons/git-merge.svg Knowledge Graph
Knowledge Graph Temporal entity-relationship triples in SQLite. Facts can be added, queried, and invalidated over time.
icon title details
src alt
/icons/wrench.svg 19 MCP Tools
19 MCP Tools MCP tools expose search, filing, knowledge graph, graph navigation, and diary operations to compatible clients.
icon title details
src alt
/icons/shield-check.svg Zero Cloud
Zero Cloud Core storage and retrieval run locally. Optional reranking features can add an API dependency but are not required for the benchmark path.
<style> :root { --vp-home-hero-name-color: transparent; --vp-home-hero-name-background: linear-gradient( 135deg, #4f46e5 0%, #06b6d4 50%, #8b5cf6 100% ); } </style>

Verbatim Retrieval First

MemPalace stores source text and retrieves it with semantic search. The benchmarked raw mode does not require an LLM at any stage — no extraction, no rerank, no summarisation.

LongMemEval retrieval recall (500 questions):

Mode R@5 LLM required
Raw (semantic search over verbatim text) 96.6% None
Hybrid v4, held-out 450q 98.4% None

The raw 96.6% reproduces on any machine with the committed dataset: result JSONLs, the seed=42 train/held-out split, and the --mode raw / --held-out runners are all in the benchmarks/ directory of the repo.

We deliberately do not publish a side-by-side comparison against other memory systems on this page. Retrieval recall (R@5) and end-to-end QA accuracy are different metrics and are not comparable; where MemPalace can be fairly compared on the same metric, we link to the other project's published source.