mempalace/tests at f5c8b095dd9e057d804ed23daa4e1b07a1c9d63b - mempalace - GIT

jason/mempalace

Files

T

History

eblander f5c8b095dd fix: narrow _fix_blob_seq_ids shim + add repair --mode max-seq-id

The BLOB-seq_id migration shim (PR #664) ran int.from_bytes(..., 'big')
over every BLOB in max_seq_id, including chromadb 1.5.x's own native
format (b'\x11\x11' + 6 ASCII digits). That conversion yields a ~1.23e18
integer that silently suppresses every subsequent embeddings_queue write
for the affected segment (queue filter is seq_id > start), causing
silent drawer-write drops after a 1.5.x upgrade.

Two-part fix:

1. Shim narrowing (mempalace/backends/chroma.py)
   - Drop max_seq_id from the shim loop. chromadb owns that column's
     format; we don't reinterpret it.
   - Defense-in-depth: skip rows in embeddings whose seq_id BLOB has the
     sysdb-10 b'\x11\x11' prefix rather than misconvert.

2. Recovery command (mempalace/repair.py, mempalace/cli.py)
   - mempalace repair --mode max-seq-id [--segment <uuid>]
     [--from-sidecar <path>] [--dry-run] [--yes] [--no-backup]
   - Detects poisoned rows via threshold (seq_id > 2**53).
   - Default heuristic: MAX(embeddings.seq_id) over the collection owning
     the poisoned segment. Matches METADATA max exactly; VECTOR segments
     get a few seq_ids ahead (queue skips an already-indexed window — an
     acceptable loss vs. resetting to 0 and re-processing everything).
   - --from-sidecar copies clean values from a pre-corruption sqlite db.
   - Backs up chroma.sqlite3, closes chroma handles, atomic UPDATEs,
     post-repair verification that raises MaxSeqIdVerificationError if
     any row is still above threshold.

Tests: 8 new in tests/test_repair.py (detection, heuristic, sidecar,
dry-run, segment filter, no-op, backup, rollback-on-verify-failure).
3 new in tests/test_backends.py (max_seq_id untouched by shim,
sysdb-10 prefix skipped in embeddings, legacy big-endian u64 BLOBs
still convert). Full suite: 1103 passed.

2026-04-27 02:57:01 -03:00

..

perf: optimize regex compilation in entity extraction

2026-04-14 17:43:26 +00:00

conftest.py

fix(hnsw): gate quarantine_stale_hnsw to cold-start, not every reconnect

2026-04-26 09:40:25 -07:00

test_backends.py

fix: narrow _fix_blob_seq_ids shim + add repair --mode max-seq-id

2026-04-27 02:57:01 -03:00

test_claude_plugin_hook_wrappers.py

test: normalize wrapper script path for bash on Windows

2026-04-19 10:34:11 +02:00

test_cli.py

feat(init): context-aware corpus detection

2026-04-26 12:37:26 -07:00

test_closet_llm.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_closets.py

test: verify mine_lock via disjoint critical-section intervals

2026-04-13 19:08:57 -03:00

test_collection_metric_invariant.py

fix(test): use tmp_path for full-stack invariant test (Windows CI)

2026-04-25 00:39:37 -03:00

test_config_extra.py

test(config): make palace_path tests portable across POSIX and Windows

2026-04-24 11:13:51 +02:00

test_config.py

test: isolate embedding device env override tests

2026-04-24 23:09:23 +00:00

test_convo_miner_size_cap.py

Raise convo_miner MAX_FILE_SIZE cap 10 MB → 500 MB

2026-04-18 12:52:01 -03:00

test_convo_miner_unit.py

test: tidy embedding follow-up imports

2026-04-24 23:10:20 +00:00

test_convo_miner.py

feat(normalize): auto-rebuild stale drawers via NORMALIZE_VERSION schema gate

2026-04-13 16:20:55 -03:00

test_convo_scanner.py

fix(llm): tighter refinement — word boundaries, JSON extraction, authoritative sources

2026-04-24 01:30:40 -03:00

test_corpus_origin_integration.py

chore(corpus-origin): address Copilot review on #1223

2026-04-26 19:18:57 -03:00

test_corpus_origin.py

feat(init): context-aware corpus detection

2026-04-26 12:37:26 -07:00

test_dedup.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_dialect.py

fix: align cmd_compress dict keys with compression_stats() return values (#569 )

2026-04-11 16:16:31 -07:00

test_embedding.py

test: isolate embedding module state with monkeypatch

2026-04-24 23:11:29 +00:00

test_empty_chromadb_results.py

fix(searcher): guard against empty ChromaDB query results (#195 ) (#865 )

2026-04-15 00:26:38 -07:00

test_entity_detector.py

feat(graph): cross-wing tunnels by shared topics (#1180 )

2026-04-24 23:06:26 -03:00

test_entity_registry.py

fix: make entity_registry.research() local-only by default (#811 )

2026-04-15 00:26:24 -07:00

test_exporter.py

feat: new MCP tools — get/list/update drawer, hook settings, export (resolves #635 ) (#667 )

2026-04-11 21:25:04 -07:00

test_fact_checker.py

merge: full hardened stack + rewrite fact_checker around actual KG API

2026-04-13 18:20:11 -03:00

test_general_extractor.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_hall_detection.py

fix: README audit — 42 TDD tests + hall detection + 7 claim fixes (#835 )

2026-04-13 17:11:11 -07:00

test_hnsw_capacity.py

fix(repair): address Copilot review on #1227

2026-04-26 21:53:56 -03:00

test_hooks_cli.py

fix(hooks): consolidate transcript ingest, harden shell parsers (#1231 review)

2026-04-27 02:26:53 -03:00

test_hooks_shell.py

fix(hooks): MEMPAL_PYTHON override for .sh hooks' internal python3 calls

2026-04-21 01:43:08 -03:00

test_hybrid_search.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_i18n_lang_case.py

fix(i18n): resolve language codes case-insensitively (#927 )

2026-04-15 23:33:42 +02:00

test_i18n.py

Merge pull request #1051 from itfarrier/feat/i18n-belarusian

2026-04-21 01:09:54 -03:00

test_init_gitignore_protection.py

fix(init): auto-add per-project files to .gitignore in git repos (#185 ) (#866 )

2026-04-15 00:26:41 -07:00

test_instructions_cli.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_kg_thread_safety.py

fix: add missing self._lock to KnowledgeGraph.close()

2026-04-14 13:09:10 -07:00

test_knowledge_graph_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_knowledge_graph.py

fix: ruff format test_hooks_cli.py and test_knowledge_graph.py

2026-04-08 15:12:12 -03:00

test_known_entities_registry.py

feat(graph): cross-wing tunnels by shared topics (#1180 )

2026-04-24 23:06:26 -03:00

test_layers.py

Мempalace backend seam (#413 )

2026-04-11 16:16:49 -07:00

test_llm_client.py

feat(privacy): treat Tailscale CGNAT range (100.64.0.0/10) as local

2026-04-26 15:31:44 -07:00

test_llm_refine.py

feat(graph): cross-wing tunnels by shared topics (#1180 )

2026-04-24 23:06:26 -03:00

test_mcp_server.py

fix(mcp): diary_read(wing='') spans all wings for agent (#1145 )

2026-04-23 23:39:34 -03:00

test_mcp_stdio_protection.py

fix(mcp): redirect stdout to stderr during import to protect JSON-RPC channel (#225 ) (#864 )

2026-04-15 00:26:51 -07:00

test_migrate.py

fix(migrate): harden swap rollback against partial cross-device copy

2026-04-24 13:12:10 +09:00

test_miner_jsonl_visibility.py

Harden sweeper for production: verbatim tool blocks, full session_id, logged failures

2026-04-18 13:14:32 -03:00

test_miner.py

test: use shlex.quote in resume-hint assertions for Windows

2026-04-25 01:18:31 -03:00

test_normalize.py

fix: add provenance header and speaker IDs to Slack transcript imports (#815 )

2026-04-15 00:27:01 -07:00

test_onboarding.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_palace_graph_tunnels.py

Merge pull request #1168 from arnoldwender/fix/security-tunnels-permissions

2026-04-25 04:21:44 -03:00

test_palace_graph.py

fix(palace_graph): skip None metadata in build_graph

2026-04-25 11:06:32 -07:00

test_palace_locks.py

fix: Windows CI compat for palace lock tests and path normalization

2026-04-25 04:34:30 -03:00

test_project_scanner.py

feat(graph): cross-wing tunnels by shared topics (#1180 )

2026-04-24 23:06:26 -03:00

test_query_sanitizer.py

fix: make quote trimming explicit

2026-04-12 22:19:58 -03:00

test_readme_claims.py

docs+tests: fix CI after README slim (#875 )

2026-04-14 21:59:55 -03:00

test_repair.py

fix: narrow _fix_blob_seq_ids shim + add repair --mode max-seq-id

2026-04-27 02:57:01 -03:00

test_room_detector_local.py

fix: skip unreachable reparse points in detect_rooms_from_folders (#558 )

2026-04-11 16:16:06 -07:00

test_save_hook_mines.py

test(hooks): skip bash subprocess validator test on Windows

2026-04-27 02:45:04 -03:00

test_save_hook_verbose.py

feat: add MEMPAL_VERBOSE toggle — developers see diaries in chat (#871 )

2026-04-14 10:55:56 -07:00

test_searcher.py

chore: ruff format tests/test_searcher.py

2026-04-25 07:22:53 -07:00

test_sources.py

fix(sources): address Copilot review on #1014

2026-04-18 17:17:50 -03:00

test_spellcheck_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_spellcheck.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_split_mega_files.py

test: expand coverage to 70%, fix mcp_server CI crash (threshold 60%)

2026-04-08 21:07:03 +03:00

test_sweeper.py

Address Copilot review: cursor tie-break, honest metrics, accurate comments

2026-04-18 13:22:18 -03:00

test_version_consistency.py

fix: unify package and MCP version reporting

2026-04-07 08:53:25 +01:00