T

Pim Messelink 4a0f330cc1 fix(repair): scale HNSW divergence floor with hnsw:sync_threshold

The capacity probe added in #1227 hardcoded a 2,000-row floor for the
"diverged" decision. The comment justifying that number explicitly tied
it to chromadb's *default* sync_threshold of 1,000 — "Two synchronization
windows worth (2 × sync_threshold = 2000) is a safe steady-state ceiling".

#1191 then bumped sync_threshold to 50,000 via _HNSW_BLOAT_GUARD without
updating the floor. Result: any palace created with the bloat guard
flips between OK and DIVERGED on every flush cycle. Steady-state
divergence sits at 0–50K (the natural queue depth), and the 2,000 floor
trips the guardrail the moment the queue exceeds 10% of sqlite_count.
The MCP server then routes search to BM25-only and disables duplicate
detection for ~80% of the write cycle on actively-mined ≥100K palaces,
even though chromadb is behaving correctly.

This change reads the configured `hnsw:sync_threshold` from
`collection_metadata` per palace and scales the floor to 2 × that value.
The 10% relative term and the original #1222 detection capability are
unchanged — a 91%-missing-of-192K palace (the actual #1222 reproducer)
still trips, regardless of whether the collection was created with
sync_threshold=1000 or 50000.

Behavior summary:

| Collection's sync_threshold | New floor | Old floor |
|---|---|---|
| Missing (legacy palace)     | 2000      | 2000 (unchanged)
| 1000 (chromadb default)     | 2000      | 2000 (unchanged)
| 50000 (#1191 bloat guard)   | 100000    | 2000 (the bug)

Tests:
- test_capacity_status_tolerates_lag_under_large_sync_threshold (regression
  for the #1191/#1227 conflict — 100K sqlite + 50K HNSW + sync=50K → OK)
- test_capacity_status_still_flags_real_corruption_under_large_sync (#1222
  shape with bloat-guard collection — still detects corruption)
- test_capacity_status_default_threshold_when_no_sync_metadata (legacy
  palaces without the metadata row use the 2000 fallback floor)
- test_unflushed_path_also_uses_dynamic_floor (the never-flushed branch
  scales too — 30K under sync_threshold=50000 is no longer flagged)

All 18 pre-existing tests in tests/test_hnsw_capacity.py and 45 tests
in tests/test_backends.py still pass.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-01 00:31:47 +00:00

.agents/plugins

feat: add Codex plugin support with hooks, commands, and documentation

2026-04-08 19:10:44 +03:00

.claude-plugin

release: v3.3.3

2026-04-23 16:44:22 -07:00

.codex-plugin

release: v3.3.3

2026-04-23 16:44:22 -07:00

.devcontainer

feat: add VSCode devcontainer matching CI environment

2026-04-14 15:10:23 -03:00

.github

ci: bump Windows and macOS jobs to Python 3.13

2026-04-25 19:59:05 -03:00

assets

MemPalace: palace architecture, AAAK compression, knowledge graph

2026-04-04 18:16:04 -07:00

benchmarks

perf(mining): batch per-chunk upserts and add optional GPU acceleration

2026-04-24 19:42:35 -03:00

docs

docs: RFC 002 — source adapter plugin specification

2026-04-17 23:42:46 -07:00

examples

docs: fix HOOKS_TUTORIAL.md paths, matcher, and missing timeout

2026-04-22 11:31:40 +02:00

hooks

fix(hooks): consolidate transcript ingest, harden shell parsers (#1231 review)

2026-04-27 02:26:53 -03:00

integrations/openclaw

release: v3.3.0 (#839 )

2026-04-13 18:25:01 -07:00

landing

new landing page

2026-04-16 21:46:03 -03:00

mempalace

fix(repair): scale HNSW divergence floor with hnsw:sync_threshold

2026-05-01 00:31:47 +00:00

tests

fix(repair): scale HNSW divergence floor with hnsw:sync_threshold

2026-05-01 00:31:47 +00:00

website

feat(website): apply Crystal Lattice brand — fonts, palette, hero copy

2026-04-22 20:24:39 -07:00

.gitignore

Merge branch 'main' into docs/vitepress-site

2026-04-10 13:22:46 -03:00

.pre-commit-config.yaml

docs+tests: fix CI after README slim (#875 )

2026-04-14 21:59:55 -03:00

AGENTS.md

docs: add CLAUDE.md + mission/principles to AGENTS.md (#720 )

2026-04-12 15:28:01 -07:00

CHANGELOG.md

Merge pull request #1135 from sha2fiddy/feature/max-seq-id-shim-fix

2026-04-27 03:21:49 -03:00

CLAUDE.md

feat(init): context-aware corpus detection

2026-04-26 12:37:26 -07:00

CONTRIBUTING.md

docs: #875 follow-up — repo surfaces + reproduction URLs + CHANGELOG

2026-04-14 21:38:00 -03:00

LICENSE

MemPalace: palace architecture, AAAK compression, knowledge graph

2026-04-04 18:16:04 -07:00

MISSION.md

docs: add CLAUDE.md + mission/principles to AGENTS.md (#720 )

2026-04-12 15:28:01 -07:00

openarena-claim.txt

chore: add OpenArena owner claim verification file

2026-04-24 23:19:29 -03:00

pyproject.toml

perf(mining): batch per-chunk upserts and add optional GPU acceleration

2026-04-24 19:42:35 -03:00

README.md

fix(hooks): consolidate transcript ingest, harden shell parsers (#1231 review)

2026-04-27 02:26:53 -03:00

ROADMAP.md

docs: add ROADMAP.md — v3.1.1 stability patch and v4.0.0-alpha plan

2026-04-11 22:05:00 -07:00

SECURITY.md

docs: tighten SECURITY.md with real version policy and GHPVR-only channel

2026-04-14 11:50:00 -03:00

uv.lock

perf(mining): batch per-chunk upserts and add optional GPU acceleration

2026-04-24 19:42:35 -03:00

README.md

Caution

Scam alert. The only official sources for MemPalace are this GitHub repository, the PyPI package, and the docs site at mempalaceofficial.com. Any other domain — including mempalace.tech — is an impostor and may distribute malware. Details and timeline: docs/HISTORY.md.

MemPalace

Local-first AI memory. Verbatim storage, pluggable backend, 96.6% R@5 raw on LongMemEval — zero API calls.

What it is

MemPalace stores your conversation history as verbatim text and retrieves it with semantic search. It does not summarize, extract, or paraphrase. The index is structured — people and projects become wings, topics become rooms, and original content lives in drawers — so searches can be scoped rather than run against a flat corpus.

The retrieval layer is pluggable. The current default is ChromaDB; the interface is defined in mempalace/backends/base.py and alternative backends can be dropped in without touching the rest of the system.

Nothing leaves your machine unless you opt in.

Architecture, concepts, and mining flows: mempalaceofficial.com/concepts/the-palace.

Install

pip install mempalace
mempalace init ~/projects/myapp

Quickstart

# Mine content into the palace
mempalace mine ~/projects/myapp                    # project files
mempalace mine ~/.claude/projects/ --mode convos   # Claude Code sessions (scope with --wing per project)

# Search
mempalace search "why did we switch to GraphQL"

# Load context for a new session
mempalace wake-up

For Claude Code, Gemini CLI, MCP-compatible tools, and local models, see mempalaceofficial.com/guide/getting-started.

Benchmarks

All numbers below are reproducible from this repository with the commands in benchmarks/BENCHMARKS.md. Full per-question result files are committed under benchmarks/results_*.

LongMemEval — retrieval recall (R@5, 500 questions):

Mode	R@5	LLM required
Raw (semantic search, no heuristics, no LLM)	96.6%	None
Hybrid v4, held-out 450q (tuned on 50 dev, not seen during training)	98.4%	None
Hybrid v4 + LLM rerank (full 500)	≥99%	Any capable model

The raw 96.6% requires no API key, no cloud, and no LLM at any stage. The hybrid pipeline adds keyword boosting, temporal-proximity boosting, and preference-pattern extraction; the held-out 98.4% is the honest generalisable figure.

The rerank pipeline promotes the best candidate out of the top-20 retrieved sessions using an LLM reader. It works with any reasonably capable model — we have reproduced it with Claude Haiku, Claude Sonnet, and minimax-m2.7 via Ollama Cloud (no Anthropic dependency). The gap between raw and reranked is model-agnostic; we do not headline a "100%" number because the last 0.6% was reached by inspecting specific wrong answers, which benchmarks/BENCHMARKS.md flags as teaching to the test.

Other benchmarks (full results in benchmarks/BENCHMARKS.md):

Benchmark	Metric	Score	Notes
LoCoMo (session, top-10, no rerank)	R@10	60.3%	1,986 questions
LoCoMo (hybrid v5, top-10, no rerank)	R@10	88.9%	Same set
ConvoMem (all categories, 250 items)	Avg recall	92.9%	50 per category
MemBench (ACL 2025, 8,500 items)	R@5	80.3%	All categories

We deliberately do not include a side-by-side comparison against Mem0, Mastra, Hindsight, Supermemory, or Zep. Those projects publish different metrics on different splits, and placing retrieval recall next to end-to-end QA accuracy is not an honest comparison. See each project's own research page for their published numbers.

Reproducing every result:

git clone https://github.com/MemPalace/mempalace.git
cd mempalace
pip install -e ".[dev]"
# see benchmarks/README.md for dataset download commands
python benchmarks/longmemeval_bench.py /path/to/longmemeval_s_cleaned.json

Knowledge graph

MemPalace includes a temporal entity-relationship graph with validity windows — add, query, invalidate, timeline — backed by local SQLite. Usage and tool reference: mempalaceofficial.com/concepts/knowledge-graph.

MCP server

29 MCP tools cover palace reads/writes, knowledge-graph operations, cross-wing navigation, drawer management, and agent diaries. Installation and the full tool list: mempalaceofficial.com/reference/mcp-tools.

Agents

Each specialist agent gets its own wing and diary in the palace. Discoverable at runtime via mempalace_list_agents — no bloat in your system prompt: mempalaceofficial.com/concepts/agents.

Auto-save hooks

Two Claude Code hooks save periodically and before context compression: mempalaceofficial.com/guide/hooks.

For per-message recall on top of the file-level chunks the hooks produce, run mempalace sweep <transcript-dir> periodically — it stores one verbatim drawer per user/assistant message, idempotent and resume-safe.

Requirements

Python 3.9+
A vector-store backend (ChromaDB by default)
~300 MB disk for the default embedding model

No API key is required for the core benchmark path.

Docs

Getting started → mempalaceofficial.com/guide/getting-started
CLI reference → mempalaceofficial.com/reference/cli
Python API → mempalaceofficial.com/reference/python-api
Full benchmark methodology → benchmarks/BENCHMARKS.md
Release notes → CHANGELOG.md
Corrections and public notices → docs/HISTORY.md

Contributing

PRs welcome. See CONTRIBUTING.md.

License

MIT — see LICENSE.