mempalace/website/index.md at 10cdd93cec0031da73c59bf0c0e55c19df098516

Files

T

Igor Lins e Silva f20a1a30fe docs(website): align mempalaceofficial.com with honest benchmarks

Part of #875. Bring the VitePress site into line with the new README
and the reproducibility scorecard: drop category-error comparisons,
drop retracted claims, retain only metrics and caveats that survive
audit.

website/index.md
 - New tagline matches README (local-first, verbatim, pluggable backend,
   96.6% R@5 raw, zero API calls).
 - Replace the "MemPalace hybrid 100% / Supermemory ~99% / Mastra
   94.87% / Mem0 ~85%" comparison table with a single honest table
   showing MemPalace's own retrieval-recall numbers (raw 96.6%,
   hybrid v4 held-out 98.4%). Add an explicit sentence explaining why
   we no longer publish a cross-system table on the landing page
   (retrieval recall vs QA accuracy are different metrics).
 - Soften the "ChromaDB-powered vector search" feature blurb to be
   backend-agnostic, since the retrieval layer is pluggable.

website/reference/benchmarks.md
 - Full rewrite of the retrieval-recall tables. No more "100%"
   headline; honest held-out 98.4% R@5 replaces it. Added the
   model-agnostic rerank result (99.2% R@5 / 100% R@10 with
   minimax-m2.7 via Ollama) to show the pipeline is not Haiku-specific.
 - Drop the LoCoMo "Hybrid v5 + Sonnet rerank (top-50) 100%" row.
   With per-conversation session counts of 19-32 and top_k=50, the
   retrieval stage returns every session by construction — the number
   measures an LLM's reading comprehension, not retrieval.
 - Drop the cross-system comparison tables. Link out to each project's
   own research page (Mastra, Mem0, Supermemory) for their published
   numbers and metric definitions.
 - Rewrite reproduction commands to use the correct repository and
   demonstrate the new --llm-backend ollama flag.

website/concepts/the-palace.md
 - Remove the "+34%" row / paragraph. Wing/room filtering is standard
   metadata filtering in the vector store, not a novel retrieval
   mechanism — the April-7 note already retracted that framing; this
   finishes the retraction on the website where it had remained.

website/guide/searching.md
 - Same treatment for "34% retrieval improvement". Reframe as
   operational scoping, not a novel boost.

website/reference/contributing.md
 - Update the "palace structure matters" bullet to reflect the same
   framing: scoping-not-magic.

website/concepts/knowledge-graph.md
 - Replace the MemPalace-vs-Zep feature matrix with a short "related
   work" note that links to Zep's own documentation for authoritative
   details on their deployment model. Avoids claims we cannot verify
   at source.

2026-04-14 21:37:45 -03:00

3.2 KiB

Raw Blame History

layout, hero, features

layout

hero

features

home

name

text

tagline

image

actions

MemPalace

Give your AI a memory.

Local-first AI memory. Verbatim storage, pluggable backend, 96.6% R@5 raw on LongMemEval — zero API calls.

src	alt
/mempalace_logo.png	MemPalace

theme	text	link
brand	Get Started	/guide/getting-started

theme	text	link
alt	Architecture →	/concepts/the-palace

theme	text	link
alt	GitHub ↗	https://github.com/MemPalace/mempalace

icon

title

details

src	alt
/icons/file-text.svg	Verbatim Storage

Verbatim Storage

Store source text directly instead of extracting facts up front. The raw benchmark result comes from retrieving verbatim content.

icon

title

details

src	alt
/icons/building-2.svg	Palace Structure

Palace Structure

Wings and rooms give retrieval useful structure. In the project benchmarks, narrowing search scope outperformed flat search.

icon

title

details

src	alt
/icons/search.svg	Semantic Search

Semantic Search

Vector search over verbatim content lets the model retrieve past discussions by topic, project, or room. Backend is pluggable.

icon

title

details

src	alt
/icons/git-merge.svg	Knowledge Graph

Knowledge Graph

Temporal entity-relationship triples in SQLite. Facts can be added, queried, and invalidated over time.

icon

title

details

src	alt
/icons/wrench.svg	19 MCP Tools

19 MCP Tools

MCP tools expose search, filing, knowledge graph, graph navigation, and diary operations to compatible clients.

icon

title

details

src	alt
/icons/shield-check.svg	Zero Cloud

Zero Cloud

Core storage and retrieval run locally. Optional reranking features can add an API dependency but are not required for the benchmark path.

Verbatim Retrieval First

MemPalace stores source text and retrieves it with semantic search. The benchmarked raw mode does not require an LLM at any stage — no extraction, no rerank, no summarisation.

LongMemEval retrieval recall (500 questions):

Mode	R@5	LLM required
Raw (semantic search over verbatim text)	96.6%	None
Hybrid v4, held-out 450q	98.4%	None

The raw 96.6% reproduces on any machine with the committed dataset: result JSONLs, the seed=42 train/held-out split, and the --mode raw / --held-out runners are all in the benchmarks/ directory of the repo.

We deliberately do not publish a side-by-side comparison against other memory systems on this page. Retrieval recall (R@5) and end-to-end QA accuracy are different metrics and are not comparable; where MemPalace can be fairly compared on the same metric, we link to the other project's published source.

Full benchmark methodology →

3.2 KiB Raw Blame History

Verbatim Retrieval First

3.2 KiB

Raw Blame History