pi-token-burden

Pi extension that shows a token-budget breakdown of the assembled system prompt

Package details

← Back

extension

Install pi-token-burden from npm and Pi will load the resources declared by the package manifest.

npm report

$ pi install npm:pi-token-burden

Package: pi-token-burden
Version: 0.6.3
Published: Apr 28, 2026
Downloads: 710/mo · 193/wk
Author: whamp
License: MIT
Types: extension
Size: 1.4 MB
Dependencies: 1 dependency · 3 peers

Pi manifest JSON

{
  "extensions": [
    "./src/index.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-token-burden

See where your system prompt tokens go.

A pi extension that parses the assembled system prompt and shows a token-budget breakdown by section. Run /token-burden to see how much of your context window is consumed by the base prompt, AGENTS.md files, skills, SYSTEM.md overrides, and metadata.

Install

pi install npm:pi-token-burden

Or from git:

pi install git:github.com/Whamp/pi-token-burden

To try it for a single session without installing, use pi -e npm:pi-token-burden.

Requirements

pi v0.55.1 or later

Usage

Type /token-burden in any pi session. An overlay appears with a stacked bar and a drill-down table:

The table is sorted by token count (descending). Use arrow keys to navigate, Enter to drill down into children (e.g., individual skills or AGENTS.md files), and / to fuzzy-search items.

Drill-down views:

Keyboard shortcuts

Key	Context	Action
`↑` / `↓`	All modes	Navigate rows
`Enter`	Sections	Drill into children or enter skill-toggle
`/`	Sections/skills	Fuzzy search
`e`	Sections	Open the selected section in `$EDITOR`
`e`	Drilldown (AGENTS)	Open the AGENTS.md file in `$EDITOR`
`Enter`	Tools view	Expand/collapse the Inactive group
`e`	Tools view	Open tool JSON definition in `$EDITOR`
`t`	Sections	Trace Base prompt sources (attribution view)
`s`	Sections	Enter skill-toggle mode
`Enter`	Skill-toggle	Cycle skill state (on → hidden → disabled)
`Ctrl+S`	Skill-toggle	Save pending skill changes
`Enter`	Trace view	Drill into bucket (line-level evidence)
`r`	Trace view	Refresh trace
`Esc`	Any	Go back / close overlay

Base prompt source tracing

Press t when the cursor is on the Base prompt row to run an on-demand attribution trace. This analyzes extension tool registrations and matches their prompt snippets and guidelines against the lines in the Base prompt, showing:

Built-in/core — tools and guidelines hardcoded in pi
Extension buckets — lines contributed by specific extensions
Shared — lines registered by multiple extensions
Unattributed — lines that couldn't be matched to any source

Press Enter on any bucket to see line-level evidence with per-line token counts.

Tool definitions

Tool definitions are the function schemas (name, description, parameter JSON schema) sent to the LLM alongside the system prompt. They are not part of the system prompt text, but they still consume context window tokens through the tool-calling API.

/token-burden compares Pi's full registered tool catalog with the current active tool set:

The top-level Tool definitions row counts only active tool schemas and shows active/total inventory, for example Tool definitions (4 active, 11 total).
Selecting Tool definitions opens a dedicated read-only Tools view.
Active is expanded by default. Active rows show plain token costs such as 182 tok, sorted by token cost descending.
Inactive is collapsed by default. Inactive tools remain visible as counterfactual costs such as +182 tok if enabled, but they do not affect the stacked bar, section totals, or percentages.
Press e on any tool row to see its full JSON definition in your editor.

Tool-related guideline text remains accounted under Base prompt. The Tool definitions section is limited to schema payload.

What each section measures

Section	Content
Base prompt	pi's built-in instructions, tool descriptions, guidelines
SYSTEM.md / APPEND_SYSTEM.md	Your custom system prompt overrides
AGENTS.md files	Each AGENTS.md file, listed individually
Skills	The `<available_skills>` block, with per-skill breakdown
Tool definitions	Active LLM function schemas; inactive schemas shown as counterfactual `if enabled` costs
Metadata	The `Current date and time` / `Current working directory` footer

Token estimation

Tokens are counted using gpt-tokenizer with the o200k_base encoding (used by GPT-4o, o1, o3, and other modern models). This gives exact BPE token counts rather than a character-based approximation.

Development

git clone https://github.com/Whamp/pi-token-burden.git
cd pi-token-burden
pnpm install
pnpm run test     # 111 unit tests
pnpm run test:e2e # 29 e2e tests (requires tmux)
pnpm run check    # lint, typecheck, format, dead code, duplicates, tests

Test locally: pi -e ./src/index.ts, then type /token-burden.

Contributing

Contributions are welcome. Please open an issue before starting work on larger changes.

Changelog

See CHANGELOG.md for release history.

License

MIT