pi-memory

Pi coding agent extension for memory with qmd-powered semantic search across daily logs, long-term memory, and scratchpad

Packages

Package details

extension

Install pi-memory from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:pi-memory

Package: pi-memory
Version: 0.3.14
Published: Jun 10, 2026
Downloads: 5,016/mo · 1,419/wk
Author: jayzeng
License: MIT
Types: extension
Size: 81.5 KB
Dependencies: 0 dependencies · 3 peers

Pi manifest JSON

{
  "extensions": [
    "./index.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-memory

Memory extension for pi with semantic search powered by qmd.

Thanks to https://github.com/skyfallsin/pi-mem for inspiration.

Your coding agent forgets everything between sessions. pi-memory gives it a memory: durable facts and decisions, a running daily log, and a scratchpad of things to come back to — all as plain markdown files you can read, edit, and commit. With optional qmd it also gets keyword, semantic, and hybrid search across everything it has ever remembered.

What it feels like

# Session 1
you ▸ I always use pnpm in this repo, never npm. Remember that.
pi  ▸ Got it — saved to long-term memory.   (writes MEMORY.md)

# …days later, brand new session…
you ▸ add prettier as a dev dependency
pi  ▸ pnpm add -D prettier
      (recalled your package-manager preference from memory — no reminder needed)

Everything lives in ~/.pi/agent/memory/ as markdown, so you can also just cat it:

$ cat ~/.pi/agent/memory/MEMORY.md
<!-- 2026-06-07 10:12:03 [a1b2c3d4] -->
#preference [[package-manager]] Always use pnpm in this repo, never npm.

Installation

# Install from npm (recommended)
pi install npm:pi-memory

# …or from a local checkout
pi install ./pi-memory

That's it — the four core tools (memory_write, memory_read, scratchpad, memory_status) work immediately with no other setup. Search is opt-in below.

Optional: enable search with qmd

memory_search (and selective injection) need qmd. Either install method works:

npm install -g @tobilu/qmd                      # no Bun required
bun install -g https://github.com/tobi/qmd      # ensure ~/.bun/bin is on PATH

When qmd is present, the extension automatically creates the pi-memory collection and path contexts on the next session start — no manual step. Run memory_status any time to confirm qmd, the collection, and embeddings are ready.

Semantic/deep modes need vector embeddings; the extension keeps them current automatically (qmd embed runs in the background at session start and after writes). The very first embed downloads the embedding model, so semantic search may take a minute to come online on a fresh install. To set the collection up by hand:

qmd collection add ~/.pi/agent/memory --name pi-memory
qmd context add /daily "Daily append-only work logs organized by date" -c pi-memory
qmd context add / "Curated long-term memory: decisions, preferences, facts, lessons" -c pi-memory
qmd embed

Without qmd, the core tools still work fully — only memory_search and selective injection require it.

Tools

Tool	Description
`memory_write`	Write to MEMORY.md (long-term) or daily log
`memory_read`	Read any memory file or list daily logs
`scratchpad`	Add/done/undo/clear/list checklist items
`memory_search`	Search across all memory files (requires qmd)
`memory_status`	Health check: where files live, qmd/collection/embeddings state, active config

memory_search modes

Mode	Speed	Method	Best for
`keyword`	~30ms	BM25	Specific terms, dates, names, #tags, [[links]]
`semantic`	~2s	Vector search	Related concepts, different wording
`deep`	~10s	Hybrid + reranking	When other modes miss

If the first search doesn't find what you need, try rephrasing or switching modes.

File layout

~/.pi/agent/memory/
  MEMORY.md              # Curated long-term memory
  SCRATCHPAD.md           # Checklist of things to fix/remember
  daily/
    2026-02-15.md         # Daily append-only log
    2026-02-14.md
    ...

How it works

Context injection

Before every agent turn, the following are injected into the system prompt (in priority order):

Open scratchpad items (up to 2K chars)
Today's daily log (up to 3K chars, tail)
MEMORY.md (up to 4K chars, middle-truncated)
Yesterday's daily log (up to 3K chars, tail — lowest priority, trimmed first)

Total injection is capped at 16K chars.

KV cache-stable snapshot (default)

Local prefix-caching runtimes (llama.cpp, vLLM, MLX) invalidate from the first divergent token onward. If the injected memory block changes turn-to-turn, every subsequent user / assistant / tool token gets reprocessed — effectively the entire conversation history each turn.

To keep the prefix byte-stable, the extension snapshots the memory context at deliberate checkpoints and emits the same bytes for every turn in between. Snapshots refresh on:

session_start — fresh snapshot per session
session_before_compact — handoff is written then snapshot refreshes (one intentional cache boundary at compaction)
memory_write with target: long_term — marks the snapshot dirty so the next turn refreshes (long-term writes are rare, intentional, and the user expects them to stick as ambient context)
Day rollover — snapshot's captured date no longer matches today

memory_write with target: daily and scratchpad writes do not mark dirty — they're high-frequency and the write content is already echoed via tool-call args. The model can always call memory_read / memory_search for the authoritative latest state.

Set PI_MEMORY_SNAPSHOT=per-turn to opt out and restore the old per-turn rebuild behavior, including automatic per-prompt qmd search injection.

Selective injection (opt-in via `per-turn` mode)

When PI_MEMORY_SNAPSHOT=per-turn is set and qmd is available, the extension automatically searches memory using the user's prompt before each turn. The top 3 keyword results are injected alongside the standard context. This surfaces relevant past decisions without an explicit memory_search call, at the cost of busting the KV cache every turn (the search is prompt-dependent and cannot be cached).

The search has a 3-second timeout and fails silently. In the default stable mode, the model gets the same capability by calling memory_search on demand.

Tags and links

Use #tags and [[wiki-links]] in memory content to improve searchability:

#decision [[database-choice]] Chose PostgreSQL for all backend services.
#preference [[editor]] User prefers Neovim with LazyVim config.
#lesson [[api-versioning]] URL prefix versioning (/v1/) avoids CDN cache issues.

These are content conventions, not enforced metadata. qmd's full-text indexing makes them searchable for free.

Session handoff

When the context window compacts, the extension automatically captures a handoff entry in today's daily log:

<!-- HANDOFF 2026-02-15 14:30:00 [a1b2c3d4] -->
## Session Handoff
**Open scratchpad items:**
- [ ] Fix auth bug
- [ ] Review PR #42
**Recent daily log context:**
...last 15 lines of today's log...

This ensures in-progress context survives compaction and is visible in the next turn (via today's daily log injection).

Other behavior

Persistence: Memory files are plain markdown on disk — readable, editable, and git-friendly.
Tool response previews: Write/scratchpad tools return size-capped previews instead of full file contents.
qmd auto-setup: On first session start with qmd available, the extension creates the collection and path contexts automatically.
qmd re-indexing: After every write, a debounced qmd update runs in the background (fire-and-forget, non-blocking) unless disabled via PI_MEMORY_QMD_UPDATE.
qmd embeddings: Vector embeddings for semantic/deep search are kept current automatically — qmd embed (incremental) runs in the background after each re-index and as a catch-up at session start. Disabled along with re-indexing via PI_MEMORY_QMD_UPDATE.
Graceful degradation: If qmd is not installed, core tools work fine. memory_search returns install instructions.

Configuration

Variable	Values	Default	Description
`PI_MEMORY_DIR`	path	`~/.pi/agent/memory`	Override the memory storage directory
`PI_MEMORY_SNAPSHOT`	`stable`, `per-turn`	`stable`	`stable` snapshots memory at checkpoints for KV cache stability; `per-turn` rebuilds every turn (legacy behavior)
`PI_MEMORY_QMD_UPDATE`	`background`, `manual`, `off`	`background`	Controls automatic `qmd update` + `qmd embed` after writes
`PI_MEMORY_NO_SEARCH`	`1`	unset	Disable selective injection in `per-turn` mode (no effect in `stable` mode)
`PI_MEMORY_SUMMARIZE_TRANSITIONS`	`1`, `true`, `yes`, `on`	unset	Also write exit summaries during lifecycle transitions (`/reload`, `/new`, `/resume`, `/fork`). By default these transitions skip summaries for speed.

Troubleshooting

Run the memory_status tool first — it reports most of these at a glance.

Symptom	Cause	Fix
`memory_search` says qmd is required	qmd not installed or not on `PATH`	Install qmd (`npm install -g @tobilu/qmd`); if installed via Bun, ensure `~/.bun/bin` is on `PATH`
Search returns nothing for terms you know exist	Index is stale	A background `qmd update` runs after writes; if disabled (`PI_MEMORY_QMD_UPDATE=off`), run `qmd update` manually
“need embeddings” on semantic/deep search	Vectors not built yet	Embedding starts automatically in the background — retry shortly. If `PI_MEMORY_QMD_UPDATE` is `manual`/`off`, run `qmd embed` yourself
Collection `pi-memory` missing	Auto-setup didn't run (qmd installed mid-session)	Run any `memory_search` (auto-creates it) or `qmd collection add ~/.pi/agent/memory --name pi-memory`
qmd works in the shell but not from pi on Windows	Broken `.cmd`/`.ps1` shims	The extension bypasses them by invoking qmd's JS entry with `node`; make sure the npm global `node_modules` dir is on `PATH`
Memory isn't being injected after a write	Cache-stable snapshot only refreshes at checkpoints	Long-term writes refresh next turn; for daily/scratchpad use `memory_read`, or set `PI_MEMORY_SNAPSHOT=per-turn`

Running tests

# Unit tests (no LLM, no qmd — fast, deterministic). Requires Bun.
npm test

# End-to-end tests (requires pi + API key, optionally qmd)
npm run test:e2e

# Recall effectiveness eval (requires pi + API key + qmd)
npm run test:eval

# Pin provider/model for cheaper eval runs
PI_E2E_PROVIDER=openai PI_E2E_MODEL=gpt-4o-mini npm run test:eval

# Multiple runs for statistical robustness
EVAL_RUNS=3 npm run test:eval

All tests back up and restore existing memory files.

Test levels

Level	Command	Requirements	What it tests
Unit	`npm test` (`test/unit.test.ts`)	Bun	Context builder, truncation, handoff, scratchpad parsing, qmd plumbing
E2E	`npm run test:e2e` (`test/e2e.ts`)	pi + API key	Tool registration, write/recall, scratchpad lifecycle, search
Eval	`npm run test:eval` (`test/eval-recall.ts`)	pi + API key + qmd	Recall accuracy with vs without selective injection

Development

This is a single-file extension (index.ts). No build step required — pi loads TypeScript directly.

# Test with pi directly
pi -p -e ./index.ts "remember: I prefer dark mode"

# Verify memory was written
cat ~/.pi/agent/memory/MEMORY.md

Publishing (maintainers)

Releases are tag-driven. Pushing a v* tag runs the publish workflow, which lints, builds, runs the unit tests, verifies the tag matches package.json, and then publishes to npm.

# Bump version + create the matching git tag (updates package.json)
npm version patch   # or minor / major

# Push the commit and tag — this triggers .github/workflows/publish-npm.yml
git push --follow-tags

# Verify the published install
pi install npm:pi-memory

Changelog

See CHANGELOG.md.

pi-memory

What it feels like

Installation

Optional: enable search with qmd

Tools

memory_search modes

File layout

How it works

Context injection

KV cache-stable snapshot (default)

Selective injection (opt-in via per-turn mode)

Tags and links

Session handoff

Other behavior

Configuration

Troubleshooting

Running tests

Test levels

Development

Publishing (maintainers)

Changelog

Selective injection (opt-in via `per-turn` mode)