mempalace

Author	SHA1	Message	Date
Igor Lins e Silva	c4e52954fe	Merge upstream/main into bench/scale-test-suite to resolve conflicts Merged both the PR's benchmark suite additions (psutil dep, pytest markers, --ignore=tests/benchmarks) and upstream's coverage changes (pytest-cov, --cov-fail-under=30, coverage config) so both coexist. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-04-08 16:28:06 -03:00
Igor Lins e Silva	a0bcd0c836	fix: ruff format test_hooks_cli.py and test_knowledge_graph.py	2026-04-08 15:12:12 -03:00
Igor Lins e Silva	af42a850f6	fix: split semicolon statements onto two lines for ruff E702	2026-04-08 15:11:55 -03:00
Igor Lins e Silva	bf88daa649	fix: address review — re-mine modified files, idempotent add_drawer, cleanup ChromaDB handles	2026-04-08 15:11:55 -03:00
Igor Lins e Silva	a4149ab248	fix: use upsert and deterministic IDs to prevent data stagnation MCP tool_add_drawer: - Make drawer_id content-based: hash full content instead of content[:100] + timestamp. Same content → same ID, eliminating TOCTOU race conditions - Switch from col.add() to col.upsert() so re-filing with updated content updates the existing drawer miner.add_drawer: - Switch from collection.add() to collection.upsert() so re-mining a modified file updates instead of silently failing - Remove the try/except catching 'already exists' — upsert handles this naturally Findings: #11 (HIGH — add ignores updates), #6 (MEDIUM — TOCTOU), #13 (MEDIUM — non-deterministic IDs) Includes test infrastructure from PR #131. 92 tests pass.	2026-04-08 15:11:55 -03:00
Ben Sigman	59d011a23b	Merge pull request #270 from tmuskal/main Package MemPalace as standard Claude and Codex plugins with easy installation	2026-04-08 10:41:45 -07:00
Tal Muskal	9de302f881	feat: update README and CI configuration, add tests for hooks functionality	2026-04-08 20:40:03 +03:00
Igor Lins e Silva	ebc26f3960	fix: resolve formatting, regression logic, and pytest defaults - Run ruff format on all benchmark files (fixes CI lint job) - Fix check_regression() substring ambiguity: ordered keyword matching so "latency_improvement_pct" is correctly classified as higher-is-better - Update stale comments in conftest.py referencing wrong fixture - Add pytest addopts to skip benchmark/slow/stress markers by default	2026-04-08 10:56:39 -03:00
Igor Lins e Silva	7e4db33061	fix: resolve ruff lint errors in benchmark suite Remove unused imports (shutil, string, datetime, os, yaml, time, SCALE_CONFIGS) and unused variable assignments in timing-only calls.	2026-04-08 05:10:39 -03:00
Igor Lins e Silva	e8017ca2ec	bench: add per-room recall threshold test Concentrates all drawers into a single wing+room to isolate the embedding model's retrieval limit independent of palace filtering. Confirms recall degrades to ~0.4-0.5 at 5K drawers per room even with wing+room filters applied — the spatial structure helps by keeping buckets small, but can't fix the underlying embedding ceiling.	2026-04-08 05:06:31 -03:00
Igor Lins e Silva	7b89291334	bench: add scale benchmark suite (94 tests) Benchmark mempalace at configurable scale (1K–100K drawers) to find real-world performance limits. Tests cover MCP tool OOM thresholds, ChromaDB query degradation, search recall@k, mining throughput, knowledge graph concurrency, memory leak detection, palace boost quantification, and Layer1 unbounded fetch behavior. - tests/benchmarks/ with 8 test modules + data generator + report system - Deterministic data factory with planted needles for recall measurement - JSON report output with regression detection (--bench-report flag) - CI benchmark job on PRs at small scale - psutil added as dev dependency for RSS tracking	2026-04-08 05:06:31 -03:00
Igor Lins e Silva	47696bef8c	fix: address Copilot review — derive MCP version, improve test isolation and portability	2026-04-08 04:41:03 -03:00
Igor Lins e Silva	a67b00d7c7	perf: cache ChromaDB PersistentClient instead of re-instantiating per call The MCP server previously created a new PersistentClient on every tool call via _get_collection(). This incurs HNSW index loading overhead on each request. Cache the client and collection at module level. The cache resets naturally on process restart (MCP runs as a subprocess). Also adds a _reset_mcp_cache fixture to conftest.py for test isolation. Includes test infrastructure from PR #131. 92 tests pass.	2026-04-08 04:39:19 -03:00
Ben Sigman	a8de2911e5	Merge pull request #136 from igorls/fix/kg-hardening fix: enable SQLite WAL mode and add consistent LIMIT to KG timeline	2026-04-07 16:05:13 -07:00
Igor Lins e Silva	d3145e9a7b	fix: update dialect tests for PR #147 stats API and remove unused fixture param	2026-04-07 18:58:25 -03:00
Igor Lins e Silva	6fa985eac2	fix: update dialect tests for PR #147 stats API and remove unused fixture param	2026-04-07 18:58:20 -03:00
Igor Lins e Silva	b45bff9db1	test: add WAL mode and entity timeline limit assertions	2026-04-07 18:27:19 -03:00
Igor Lins e Silva	5ac4947d02	fix: preserve CLI exit codes, log tracebacks, sanitize search errors, validate fixture	2026-04-07 18:26:39 -03:00
Ben Sigman	27623a3b17	Merge pull request #131 from igorls/test/expand-coverage-and-uv-migration test: expand coverage from 20 to 92 tests, migrate to uv	2026-04-07 14:15:01 -07:00
Igor Lins e Silva	96de23cd97	fix: CI failures — update workflow for uv migration, fix lint and format - Switch CI install step from `pip install -r requirements.txt` to `pip install -e ".[dev]"` since requirements.txt was removed - Add noqa: E402 to intentionally-late imports in conftest.py (HOME must be isolated before mempalace imports) - Remove unused KnowledgeGraph import in test_knowledge_graph.py - Apply ruff formatting to test files	2026-04-07 17:59:21 -03:00
Ben Sigman	3068f75c2d	Merge pull request #22 from sheetsync/bugfix/split-known-names-loading refactor: consolidate split known-names config loading	2026-04-07 13:58:54 -07:00
Igor Lins e Silva	cd8b245fdc	fix: address Copilot review — remove unused imports, isolate HOME in tests, restore dev extra	2026-04-07 17:55:10 -03:00
Igor Lins e Silva	72c548b729	test: expand coverage from 20 to 92 tests, migrate to uv - Migrate from setuptools to hatchling build backend - Add dependency-groups (PEP 735) for dev tooling (pytest, ruff) - Remove redundant requirements.txt in favor of uv.lock - Fix __version__ mismatch (2.0.0 -> 3.0.0 to match pyproject.toml) New test files: - conftest.py: shared fixtures (isolated palace, KG, ChromaDB collection) - test_knowledge_graph.py: 17 tests (entity CRUD, temporal queries, timeline) - test_mcp_server.py: 25 tests (protocol dispatch, read/write/KG/diary tools) - test_searcher.py: 7 tests (search_memories API, filters, error handling) - test_dialect.py: 13 tests (AAAK compression, entity/emotion detection, zettel encoding) All 92 tests pass on Python 3.13 with chromadb 0.6.3.	2026-04-07 17:55:10 -03:00
Ben Sigman	e8f9b47e31	Merge pull request #16 from sheetsync/bugfix/version-consistency fix: unify package and MCP version reporting	2026-04-07 13:54:03 -07:00
ac-opensource	c8c220d789	fix: support nested .gitignore rules during mining	2026-04-08 00:02:21 +08:00
ac-opensource	9b9daa9b4b	fix: respect .gitignore during project mining	2026-04-07 22:26:06 +08:00
James Cane	0808ad96c2	refactor: consolidate split known-names config loading	2026-04-07 09:16:07 +01:00
James Cane	55152ce476	fix: unify package and MCP version reporting	2026-04-07 08:53:25 +01:00
bensig	0f8fa8c7d5	bench: add benchmark runners, results docs, and test suite Benchmarks: LongMemEval, LoCoMo, ConvoMem, MemBench runners with methodology docs and hybrid retrieval analysis. Tests: config, miner, convo_miner, normalize — 9 tests, all passing.	2026-04-04 18:33:42 -07:00

29 Commits