@adamjen/pi-compact-fast

/compact-fast command for Pi — compacts sessions with a fast local model instead of your main conversation model.

Package details

extension

Install @adamjen/pi-compact-fast from npm and Pi will load the resources declared by the package manifest.

$ pi install npm:@adamjen/pi-compact-fast

Pi manifest JSON

{
  "extensions": [
    "./extensions/index.ts"
  ]
}

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

/compact-fast command for Pi — compact your session using a fast local model instead of your main conversation model.

Pi normally uses your current conversation model for compaction (summarizing the session). This extension adds /compact-fast which:

This saves tokens on expensive models and speeds up compaction when using a smaller/faster local model.

pi install npm:@adamjen/pi-compact-fast@latest

Or try without installing:

pi -e npm:@adamjen/pi-compact-fast

/compact-fast

That's it — Pi will compact the session using qwen-35b-moe instead of your current model.

Edit extensions/index.ts and change line 24:

const COMPACT_MODEL_ID = "your-model-id";

The model must be defined in your ~/.pi/agent/models.json or provider config.

Intercepts session_before_compact event
Makes a direct API call to the target model via completeSimple() from @mariozechner/pi-ai
Returns { compaction: { summary, firstKeptEntryId, tokensBefore, details } } which Pi uses for compaction

System prompt tells model to ONLY summarize, not continue conversation
Conversation comes BEFORE instructions — the model follows the last pattern
Previous summary in <previous-summary> tags between conversation and instructions (for incremental updates)
Split turns generate two summaries merged with separator (mirrors pi's native behavior)
File operations tracked in both summary text AND details object

✅ Guard against empty content — aborts cleanly instead of writing garbage "empty conversation" summary
✅ File operations tracking — <read-files> and <modified-files> tags appended to summary, plus details object for pi's internal tracking
✅ Split turn handling — parallel dual summaries with proper merge separator when a single turn exceeds keepRecentTokens
✅ Correct token budgets — dynamic from settings.reserveTokens * 0.8 / * 0.5 instead of hardcoded 8192
✅ Added TURN_PREFIX_SUMMARIZATION_PROMPT for split-turn prefix summaries (different format: "Original Request", "Early Progress")
✅ Switched from complete() to completeSimple() — matches pi's internal API usage
✅ Added stopReason === "error" check — handles API failures gracefully instead of producing empty strings
✅ Removed debug logging (console.error)
⚠️ Default model changed from qwen3.6-35b to qwen-35b-moe (configure your own in models.json)

MIT