mempalace/tests at fed69935d343aba8f96b222b0f1f23e21e86583d - mempalace - GIT

jason/mempalace

Files

T

History

MSL fed69935d3 Add tandem sweeper: message-level safety net for dropped transcripts

The primary miners (miner.py, convo_miner.py) operate at file
granularity and can drop data for several reasons: size caps, silent
OSError on read, dedup false positives, extensions the project miner
does not recognize. Even with tonight's hotfixes, any future bug in
the file-level path risks silent data loss.

The sweeper is a second, cooperating miner that works at MESSAGE
granularity:

  - Parses Claude Code .jsonl line by line, yielding only
    user/assistant records (filters progress, file-history-snapshot,
    etc. noise).
  - For each session_id, queries the palace for max(timestamp) and
    treats that as the cursor.
  - Ingests only messages newer than the cursor, as one small drawer
    per exchange (never hits a size cap — each drawer is 1-5 KB).
  - Deterministic drawer IDs from session_id + message UUID make
    reruns idempotent; crash mid-sweep is safe.

Tandem coordination is free: if the primary miner committed up to
timestamp T, the sweeper resumes from T. If the primary miner missed
everything, the sweeper catches it all. Neither duplicates the other.

Smoke test on a real Claude Code transcript:
  1st run: +39 drawers, 0 already present
  2nd run: +0 drawers, 39 already present  (perfect idempotence)

Opt-in via:
  mempalace sweep <file.jsonl>
  mempalace sweep <transcript-dir>

No changes to existing miners. No schema migration. Purely additive.

Tests: tests/test_sweeper.py (7 tests covering parsing, tandem
coordination, idempotency, resume-from-cursor, metadata correctness).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-18 12:52:06 -03:00

..

perf: optimize regex compilation in entity extraction

2026-04-14 17:43:26 +00:00

conftest.py

Fix: ruff format with CI-pinned version (0.4.x)

2026-04-13 18:29:48 -04:00

test_backends.py

Fix: set cosine distance metadata on all collection creation sites

2026-04-13 11:00:52 -04:00

test_cli.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_closet_llm.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_closets.py

test: verify mine_lock via disjoint critical-section intervals

2026-04-13 19:08:57 -03:00

test_config_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_config.py

fix: use permissive validator for KG entity values (closes #455 )

2026-04-14 09:26:47 -04:00

test_convo_miner_size_cap.py

Raise convo_miner MAX_FILE_SIZE cap 10 MB → 500 MB

2026-04-18 12:52:01 -03:00

test_convo_miner_unit.py

fix: store full AI response in convo_miner exchange chunking (#695 )

2026-04-12 14:23:52 -07:00

test_convo_miner.py

feat(normalize): auto-rebuild stale drawers via NORMALIZE_VERSION schema gate

2026-04-13 16:20:55 -03:00

test_dedup.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_dialect.py

fix: align cmd_compress dict keys with compression_stats() return values (#569 )

2026-04-11 16:16:31 -07:00

test_empty_chromadb_results.py

fix(searcher): guard against empty ChromaDB query results (#195 ) (#865 )

2026-04-15 00:26:38 -07:00

test_entity_detector.py

fix(entity_detector): script-aware word boundaries for combining-mark scripts

2026-04-15 22:18:52 -03:00

test_entity_registry.py

fix: make entity_registry.research() local-only by default (#811 )

2026-04-15 00:26:24 -07:00

test_exporter.py

feat: new MCP tools — get/list/update drawer, hook settings, export (resolves #635 ) (#667 )

2026-04-11 21:25:04 -07:00

test_fact_checker.py

merge: full hardened stack + rewrite fact_checker around actual KG API

2026-04-13 18:20:11 -03:00

test_general_extractor.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_hall_detection.py

fix: README audit — 42 TDD tests + hall detection + 7 claim fixes (#835 )

2026-04-13 17:11:11 -07:00

test_hooks_cli.py

fix(hooks): stop precompact hook from blocking compaction (#856 , #858 ) (#863 )

2026-04-15 00:26:54 -07:00

test_hybrid_search.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_i18n_lang_case.py

fix(i18n): resolve language codes case-insensitively (#927 )

2026-04-15 23:33:42 +02:00

test_i18n.py

Add Indonesian language support

2026-04-16 16:15:47 +08:00

test_init_gitignore_protection.py

fix(init): auto-add per-project files to .gitignore in git repos (#185 ) (#866 )

2026-04-15 00:26:41 -07:00

test_instructions_cli.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_kg_thread_safety.py

fix: add missing self._lock to KnowledgeGraph.close()

2026-04-14 13:09:10 -07:00

test_knowledge_graph_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_knowledge_graph.py

fix: ruff format test_hooks_cli.py and test_knowledge_graph.py

2026-04-08 15:12:12 -03:00

test_layers.py

Мempalace backend seam (#413 )

2026-04-11 16:16:49 -07:00

test_mcp_server.py

fix: return empty status instead of error on cold-start palace (#830 ) (#831 )

2026-04-15 00:26:35 -07:00

test_mcp_stdio_protection.py

fix(mcp): redirect stdout to stderr during import to protect JSON-RPC channel (#225 ) (#864 )

2026-04-15 00:26:51 -07:00

test_migrate.py

chore: clarify security guardrails

2026-04-12 22:19:58 -03:00

test_miner_jsonl_visibility.py

Raise MAX_FILE_SIZE cap from 10 MB to 500 MB

2026-04-18 12:52:01 -03:00

test_miner.py

fix: use i18n candidate patterns for entity extraction in miner and palace

2026-04-16 10:35:40 +05:00

test_normalize.py

fix: add provenance header and speaker IDs to Slack transcript imports (#815 )

2026-04-15 00:27:01 -07:00

test_onboarding.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_palace_graph_tunnels.py

test: add palace_graph tunnel helper coverage

2026-04-15 11:38:18 +02:00

test_palace_graph.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_query_sanitizer.py

fix: make quote trimming explicit

2026-04-12 22:19:58 -03:00

test_readme_claims.py

docs+tests: fix CI after README slim (#875 )

2026-04-14 21:59:55 -03:00

test_repair.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_room_detector_local.py

fix: skip unreachable reparse points in detect_rooms_from_folders (#558 )

2026-04-11 16:16:06 -07:00

test_save_hook_mines.py

fix: save hook auto-mines transcript without MEMPAL_DIR (#840 )

2026-04-13 18:09:59 -07:00

test_save_hook_verbose.py

feat: add MEMPAL_VERBOSE toggle — developers see diaries in chat (#871 )

2026-04-14 10:55:56 -07:00

test_searcher.py

feat: include created_at timestamp in search results (#846 )

2026-04-15 00:26:57 -07:00

test_spellcheck_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_spellcheck.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_split_mega_files.py

test: expand coverage to 70%, fix mcp_server CI crash (threshold 60%)

2026-04-08 21:07:03 +03:00

test_sweeper.py

Add tandem sweeper: message-level safety net for dropped transcripts

2026-04-18 12:52:06 -03:00

test_version_consistency.py

fix: unify package and MCP version reporting

2026-04-07 08:53:25 +01:00