mempalace/tests at 36a8f219c251f39e77637a81717281323fd1cd5c - mempalace - GIT

jason/mempalace

Files

T

History

Igor Lins e Silva 10a743d5d8 feat(llm): interactive entity refinement with batching and cancellation

Takes the candidate set produced by phase-1 detection (manifests, git
authors, regex on prose) and asks an LLM to reclassify each candidate
as PERSON / PROJECT / TOPIC / COMMON_WORD / AMBIGUOUS.

Scale approach: never feed the raw corpus to the LLM. For each
candidate, collect up to 3 context lines from sampled prose, cap each
at 240 chars, batch 25 candidates per call. Keeps total input around
50-100K tokens even on large corpora and completes in a few minutes
on a 4B local model.

Interactive UX:
- Stderr progress bar with the current candidate name, updates
  per-batch.
- Ctrl-C interrupts cleanly: returns a RefineResult with
  `cancelled=True` and whatever was classified before the interrupt.
  The partial result is safe to pass straight to confirm_entities.
- Per-batch errors (transport, parse) are recorded in `errors` and
  don't abort the whole run.

Refinement scope: only `uncertain` and low-confidence `projects`
entries are sent. Manifest-backed projects (conf >= 0.95) and git-
authored people are already authoritative and skip the LLM.

Response parser is defensive — accepts `label` or `type` keys,
lowercase/uppercase variants, top-level list or wrapped object, and
strips markdown code fences. Unknown labels become AMBIGUOUS so the
user reviews them rather than silently accepting a bad classification.

`collect_corpus_text` provides a simple stratified prose sampler
(recent first, capped per-file) so callers don't need to build their
own corpus window.

28 tests with a FakeProvider (no network). Covers context collection,
prompt building, response parsing variants, classification apply,
end-to-end refine, and Ctrl-C partial-result behavior.

2026-04-24 00:46:59 -03:00

..

perf: optimize regex compilation in entity extraction

2026-04-14 17:43:26 +00:00

conftest.py

Fix: ruff format with CI-pinned version (0.4.x)

2026-04-13 18:29:48 -04:00

test_backends.py

feat(backends): quarantine_stale_hnsw — recover from HNSW/sqlite drift

2026-04-18 18:04:05 -07:00

test_claude_plugin_hook_wrappers.py

test: normalize wrapper script path for bash on Windows

2026-04-19 10:34:11 +02:00

test_cli.py

test: update test_cli assertions for mempalace-mcp entry point

2026-04-21 01:26:47 -03:00

test_closet_llm.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_closets.py

test: verify mine_lock via disjoint critical-section intervals

2026-04-13 19:08:57 -03:00

test_config_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_config.py

fix: use permissive validator for KG entity values (closes #455 )

2026-04-14 09:26:47 -04:00

test_convo_miner_size_cap.py

Raise convo_miner MAX_FILE_SIZE cap 10 MB → 500 MB

2026-04-18 12:52:01 -03:00

test_convo_miner_unit.py

fix: store full AI response in convo_miner exchange chunking (#695 )

2026-04-12 14:23:52 -07:00

test_convo_miner.py

feat(normalize): auto-rebuild stale drawers via NORMALIZE_VERSION schema gate

2026-04-13 16:20:55 -03:00

test_convo_scanner.py

feat(convo): parse Claude Code conversation dirs into project entities

2026-04-24 00:46:31 -03:00

test_dedup.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_dialect.py

fix: align cmd_compress dict keys with compression_stats() return values (#569 )

2026-04-11 16:16:31 -07:00

test_empty_chromadb_results.py

fix(searcher): guard against empty ChromaDB query results (#195 ) (#865 )

2026-04-15 00:26:38 -07:00

test_entity_detector.py

fix(entity): reduce noise in regex-based detection

2026-04-24 00:20:32 -03:00

test_entity_registry.py

fix: make entity_registry.research() local-only by default (#811 )

2026-04-15 00:26:24 -07:00

test_exporter.py

feat: new MCP tools — get/list/update drawer, hook settings, export (resolves #635 ) (#667 )

2026-04-11 21:25:04 -07:00

test_fact_checker.py

merge: full hardened stack + rewrite fact_checker around actual KG API

2026-04-13 18:20:11 -03:00

test_general_extractor.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_hall_detection.py

fix: README audit — 42 TDD tests + hall detection + 7 claim fixes (#835 )

2026-04-13 17:11:11 -07:00

test_hooks_cli.py

fix: add wing param to diary_write/diary_read, derive from transcript path (#659 )

2026-04-23 15:07:25 -07:00

test_hooks_shell.py

fix(hooks): MEMPAL_PYTHON override for .sh hooks' internal python3 calls

2026-04-21 01:43:08 -03:00

test_hybrid_search.py

merge: pr/closet-llm-generic + harden LLM regen path for production

2026-04-13 18:40:36 -03:00

test_i18n_lang_case.py

fix(i18n): resolve language codes case-insensitively (#927 )

2026-04-15 23:33:42 +02:00

test_i18n.py

Merge pull request #1051 from itfarrier/feat/i18n-belarusian

2026-04-21 01:09:54 -03:00

test_init_gitignore_protection.py

fix(init): auto-add per-project files to .gitignore in git repos (#185 ) (#866 )

2026-04-15 00:26:41 -07:00

test_instructions_cli.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_kg_thread_safety.py

fix: add missing self._lock to KnowledgeGraph.close()

2026-04-14 13:09:10 -07:00

test_knowledge_graph_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_knowledge_graph.py

fix: ruff format test_hooks_cli.py and test_knowledge_graph.py

2026-04-08 15:12:12 -03:00

test_layers.py

Мempalace backend seam (#413 )

2026-04-11 16:16:49 -07:00

test_llm_client.py

feat(llm): pluggable provider abstraction for entity refinement

2026-04-24 00:46:43 -03:00

test_llm_refine.py

feat(llm): interactive entity refinement with batching and cancellation

2026-04-24 00:46:59 -03:00

test_mcp_server.py

fix(mcp): guard tool_status/list_wings/list_rooms/get_taxonomy against None metadata

2026-04-18 12:38:23 -07:00

test_mcp_stdio_protection.py

fix(mcp): redirect stdout to stderr during import to protect JSON-RPC channel (#225 ) (#864 )

2026-04-15 00:26:51 -07:00

test_migrate.py

chore: clarify security guardrails

2026-04-12 22:19:58 -03:00

test_miner_jsonl_visibility.py

Harden sweeper for production: verbatim tool blocks, full session_id, logged failures

2026-04-18 13:14:32 -03:00

test_miner.py

fix(miner): same None-metadata guard for status() histogram loop

2026-04-18 10:26:11 -07:00

test_normalize.py

fix: add provenance header and speaker IDs to Slack transcript imports (#815 )

2026-04-15 00:27:01 -07:00

test_onboarding.py

fix: add explicit UTF-8 encoding to read_text() calls (#776 )

2026-04-16 16:00:29 +05:00

test_palace_graph_tunnels.py

test: add palace_graph tunnel helper coverage

2026-04-15 11:38:18 +02:00

test_palace_graph.py

fix: clarify cache docs, skip caching empty graphs

2026-04-16 09:00:27 -07:00

test_project_scanner.py

feat(init): scan manifests and git authors for real entity signal

2026-04-24 00:20:53 -03:00

test_query_sanitizer.py

fix: make quote trimming explicit

2026-04-12 22:19:58 -03:00

test_readme_claims.py

docs+tests: fix CI after README slim (#875 )

2026-04-14 21:59:55 -03:00

test_repair.py

refactor: route all chromadb access through ChromaBackend

2026-04-14 00:31:16 -03:00

test_room_detector_local.py

fix: skip unreachable reparse points in detect_rooms_from_folders (#558 )

2026-04-11 16:16:06 -07:00

test_save_hook_mines.py

fix: save hook auto-mines transcript without MEMPAL_DIR (#840 )

2026-04-13 18:09:59 -07:00

test_save_hook_verbose.py

feat: add MEMPAL_VERBOSE toggle — developers see diaries in chat (#871 )

2026-04-14 10:55:56 -07:00

test_searcher.py

fix(searcher): guard API path + closet loop against None metadata too

2026-04-18 10:37:05 -07:00

test_sources.py

fix(sources): address Copilot review on #1014

2026-04-18 17:17:50 -03:00

test_spellcheck_extra.py

test: bring coverage to 85%, set threshold to 85, reset version to 3.0.11

2026-04-08 21:38:12 +03:00

test_spellcheck.py

style: format test files with ruff

2026-04-08 21:08:49 +03:00

test_split_mega_files.py

test: expand coverage to 70%, fix mcp_server CI crash (threshold 60%)

2026-04-08 21:07:03 +03:00

test_sweeper.py

Address Copilot review: cursor tie-break, honest metrics, accurate comments

2026-04-18 13:22:18 -03:00

test_version_consistency.py

fix: unify package and MCP version reporting

2026-04-07 08:53:25 +01:00