website/guide/searching.md

# Searching Memories

MemPalace uses ChromaDB's semantic vector search to find relevant memories. When you search, you get **verbatim text** — the exact words, never summaries.

## CLI Search

```bash
# Search everything
mempalace search "why did we switch to GraphQL"

# Filter by wing (project)
mempalace search "database decision" --wing myapp

# Filter by room (topic)
mempalace search "auth decisions" --room auth-migration

# Filter by both
mempalace search "pricing" --wing driftwood --room costs

# More results
mempalace search "deploy process" --results 10
```

## How Search Works

1. Your query is embedded using the vector store's default model (`all-MiniLM-L6-v2` with the default ChromaDB backend).
2. The embedding is compared against all drawers using cosine similarity.
3. Optional wing/room filters narrow the search scope — standard metadata filtering in the underlying vector store.
4. Results are returned with similarity scores and source metadata.

### Why Scoping Matters

Wing/room filtering is useful when a single palace contains many unrelated projects or people. Narrowing the search to a specific wing (or wing + room) means the vector store only scores candidates inside that scope, which keeps retrieval predictable as the palace grows.

This is a metadata-filter feature of the vector store, not a novel retrieval mechanism. Treat it as an operational convenience: clear scoping rules that a human or an agent can apply predictably.

## Programmatic Search

Use the Python API for integration:

```python
from mempalace.searcher import search_memories

results = search_memories(
    query="auth decisions",
    palace_path="~/.mempalace/palace",
    wing="myapp",
    room="auth",
    n_results=5,
)

for hit in results["results"]:
    print(f"[{hit['similarity']}] {hit['wing']}/{hit['room']}")
    print(f"  {hit['text'][:200]}")
```

The `search_memories()` function returns a dict:

```python
{
    "query": "auth decisions",
    "filters": {"wing": "myapp", "room": "auth"},
    "results": [
        {
            "text": "We decided to migrate auth to Clerk because...",
            "wing": "myapp",
            "room": "auth-migration",
            "source_file": "session_2026-01-15.md",
            "similarity": 0.892,
        },
        # ...
    ],
}
```

## MCP Search

When connected via MCP, your AI searches automatically:

> *"What did we decide about auth last month?"*

The AI calls `mempalace_search` behind the scenes. You never type a search command.

See [MCP Integration](/guide/mcp-integration) for setup.

## Wake-Up Context

Instead of searching, you can load a compact context of your world:

```bash
# Load identity + top memories (~600-900 tokens in typical use)
mempalace wake-up

# Project-specific context
mempalace wake-up --wing driftwood
```

This loads Layer 0 (identity) and Layer 1 (essential story) as bounded startup context before the first retrieval call.

See [Memory Stack](/concepts/memory-stack) for details on the 4-layer architecture.
docs: add VitePress documentation site 2026-04-09 19:11:23 -03:00			`# Searching Memories`

			`MemPalace uses ChromaDB's semantic vector search to find relevant memories. When you search, you get verbatim text — the exact words, never summaries.`

			`## CLI Search`

			```bash
			`# Search everything`
			`mempalace search "why did we switch to GraphQL"`

			`# Filter by wing (project)`
			`mempalace search "database decision" --wing myapp`

			`# Filter by room (topic)`
			`mempalace search "auth decisions" --room auth-migration`

			`# Filter by both`
			`mempalace search "pricing" --wing driftwood --room costs`

			`# More results`
			`mempalace search "deploy process" --results 10`
			```

			`## How Search Works`

docs(website): align mempalaceofficial.com with honest benchmarks 2026-04-14 21:37:45 -03:00			1. Your query is embedded using the vector store's default model (`all-MiniLM-L6-v2` with the default ChromaDB backend).
			`2. The embedding is compared against all drawers using cosine similarity.`
			`3. Optional wing/room filters narrow the search scope — standard metadata filtering in the underlying vector store.`
			`4. Results are returned with similarity scores and source metadata.`
docs: add VitePress documentation site 2026-04-09 19:11:23 -03:00
docs(website): align mempalaceofficial.com with honest benchmarks 2026-04-14 21:37:45 -03:00			`### Why Scoping Matters`
docs: add VitePress documentation site 2026-04-09 19:11:23 -03:00
docs(website): align mempalaceofficial.com with honest benchmarks 2026-04-14 21:37:45 -03:00			`Wing/room filtering is useful when a single palace contains many unrelated projects or people. Narrowing the search to a specific wing (or wing + room) means the vector store only scores candidates inside that scope, which keeps retrieval predictable as the palace grows.`
docs: add VitePress documentation site 2026-04-09 19:11:23 -03:00
docs(website): align mempalaceofficial.com with honest benchmarks 2026-04-14 21:37:45 -03:00			`This is a metadata-filter feature of the vector store, not a novel retrieval mechanism. Treat it as an operational convenience: clear scoping rules that a human or an agent can apply predictably.`
docs: add VitePress documentation site 2026-04-09 19:11:23 -03:00
			`## Programmatic Search`

			`Use the Python API for integration:`

			```python
			`from mempalace.searcher import search_memories`

			`results = search_memories(`
			`query="auth decisions",`
			`palace_path="~/.mempalace/palace",`
			`wing="myapp",`
			`room="auth",`
			`n_results=5,`
			`)`

			`for hit in results["results"]:`
			`print(f"[{hit['similarity']}] {hit['wing']}/{hit['room']}")`
			`print(f" {hit['text'][:200]}")`
			```

			The `search_memories()` function returns a dict:

			```python
			`{`
			`"query": "auth decisions",`
			`"filters": {"wing": "myapp", "room": "auth"},`
			`"results": [`
			`{`
			`"text": "We decided to migrate auth to Clerk because...",`
			`"wing": "myapp",`
			`"room": "auth-migration",`
			`"source_file": "session_2026-01-15.md",`
			`"similarity": 0.892,`
			`},`
			`# ...`
			`],`
			`}`
			```

			`## MCP Search`

			`When connected via MCP, your AI searches automatically:`

			`> "What did we decide about auth last month?"`

			The AI calls `mempalace_search` behind the scenes. You never type a search command.

			`See [MCP Integration](/guide/mcp-integration) for setup.`

			`## Wake-Up Context`

			`Instead of searching, you can load a compact context of your world:`

			```bash
			`# Load identity + top memories (~600-900 tokens in typical use)`
			`mempalace wake-up`

			`# Project-specific context`
			`mempalace wake-up --wing driftwood`
			```

			`This loads Layer 0 (identity) and Layer 1 (essential story) as bounded startup context before the first retrieval call.`

			`See [Memory Stack](/concepts/memory-stack) for details on the 4-layer architecture.`