@gotgenes/pi-subagents

A focused, in-process sub-agent core for pi — autonomous agents plus a typed API and lifecycle events other extensions build on. Friendly fork of @tintinweb/pi-subagents.

Packages

Package details

extension

Install @gotgenes/pi-subagents from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:@gotgenes/pi-subagents

Package: @gotgenes/pi-subagents
Version: 17.0.0
Published: Jun 18, 2026
Downloads: 21.5K/mo · 3,566/wk
Author: gotgenes
License: MIT
Types: extension
Size: 2.7 MB
Dependencies: 1 dependency · 3 peers

Pi manifest JSON

{
  "extensions": [
    "./src/index.ts"
  ],
  "video": "https://github.com/gotgenes/pi-subagents/raw/main/media/demo.mp4",
  "image": "https://github.com/gotgenes/pi-subagents/raw/main/media/screenshot.png"
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

@gotgenes/pi-subagents

A pi extension that gives pi a focused, in-process sub-agent core — autonomous agents that run inside the same pi runtime (no spawned subprocesses), plus a typed API and lifecycle events other extensions build on. Spawn specialized agents that run in isolated sessions — each with its own tools, system prompt, model, and thinking level. Run them in foreground or background, steer them mid-run, resume completed sessions, and define your own custom agent types.

Originally forked from tintinweb/pi-subagents by @tintinweb, now an independently maintained hard fork. See Comparison with upstream for a feature-by-feature comparison and guidance on which to choose.

https://github.com/user-attachments/assets/8685261b-9338-4fea-8dfe-1c590d5df543

Features

In-process & native — agents run inside the same pi runtime (no spawned subprocesses), sharing tool names, calling conventions, and UI patterns (subagent, get_subagent_result, steer_subagent) — feels native
Parallel background agents — spawn multiple agents that run concurrently with automatic queuing (configurable concurrency limit, default 4) and individual completion notifications
Live widget UI — persistent above-editor widget with animated spinners, live tool activity, token counts, and colored status icons
Conversation viewer — select any agent in /agents to open a live-scrolling overlay of its full conversation (auto-follows new content, scroll up to pause)
Custom agent types — define agents in .pi/agents/<name>.md with YAML frontmatter: custom system prompts, model selection, thinking levels, tool restrictions
Mid-run steering — inject messages into running agents to redirect their work without restarting
Session resume — pick up where an agent left off, preserving full conversation context
Graceful turn limits — agents get a "wrap up" warning before hard abort, producing clean partial results instead of cut-off output
Case-insensitive agent types — "explore", "Explore", "EXPLORE" all work. Unknown types fall back to general-purpose with a note
Fuzzy model selection — specify models by name ("haiku", "sonnet") instead of full IDs, with automatic filtering to only available/configured models
Context inheritance — optionally fork the parent conversation into a sub-agent so it knows what's been discussed
Styled completion notifications — background agent results render as themed, compact notification boxes (icon, stats, result preview) instead of raw XML. Expandable to show full output
Event bus — lifecycle events (subagents:created, started, completed, failed, steered, compacted) emitted via pi.events, enabling other extensions to react to sub-agent activity

Install

pi install npm:@gotgenes/pi-subagents

Or load directly for development:

pi -e ./src/index.ts

Quick Start

The parent agent spawns sub-agents using the subagent tool:

subagent({
  subagent_type: "Explore",
  prompt: "Find all files that handle authentication",
  description: "Find auth files",
  run_in_background: true,
})

Foreground agents block until complete and return results inline. Background agents return an ID immediately and notify you on completion.

UI

The extension renders a persistent widget above the editor showing all active agents:

● Agents
├─ ⠹ Agent  Refactor auth module · ⟳5≤30 · 5 tool uses · 33.8k token (62%) · 12.3s
│    ⎿  editing 2 files…
├─ ⠹ Explore  Find auth files · ⟳3 · 3 tool uses · 12.4k token (8%) · 4.1s
│    ⎿  searching…
├─ ⠹ Agent  Long-running task · ⟳42 · 38 tool uses · 91.0k token (84% · ↻2) · 2m17s
│    ⎿  reading…
└─ 2 queued

The token field is annotated with two optional signals inside parens:

NN% — context-window utilization (color-coded: <70% dim, 70–85% warning, ≥85% error). Omitted when the model has no declared contextWindow, or briefly right after compaction.
↻N — number of times the session has compacted, when > 0. Stays dim; the percent's color carries urgency.

Individual agent results render inline in the conversation:

State	Example
Running	`⠹ ⟳3≤30 · 3 tool uses · 12.4k token (8%)` / `⎿ searching, reading 3 files…`
Completed	`✓ ⟳8 · 5 tool uses · 33.8k token (62%) · 12.3s` / `⎿ Done`
Wrapped up	`✓ ⟳50≤50 · 50 tool uses · 89.1k token (84% · ↻2) · 45.2s` / `⎿ Wrapped up (turn limit)`
Stopped	`■ ⟳3 · 3 tool uses · 12.4k token (8%)` / `⎿ Stopped`
Error	`✗ ⟳3 · 3 tool uses · 12.4k token (8%)` / `⎿ Error: timeout`
Aborted	`✗ ⟳55≤50 · 55 tool uses · 102.3k token (95% · ↻3)` / `⎿ Aborted (max turns exceeded)`

Completed results can be expanded (ctrl+o in pi) to show the full agent output inline.

Background agent completion notifications render as styled boxes:

✓ Find auth files completed
  ⟳3 · 3 tool uses · 12.4k token · 4.1s
  ⎿  Found 5 files related to authentication...
  transcript: .pi/output/agent-abc123.jsonl

The LLM receives structured <task-notification> XML for parsing, while the user sees the themed visual.

Default Agent Types

Type	Tools	Model	Prompt Mode	Description
`general-purpose`	all 7	inherit	`append` (parent twin)	Inherits the parent's full system prompt — same rules, CLAUDE.md, project conventions
`Explore`	read, bash, grep, find, ls	haiku (falls back to inherit)	`replace`	Fast codebase exploration (read-only); inherits the parent prompt as a base
`Plan`	read, bash, grep, find, ls	inherit	`replace`	Software architect for implementation planning (read-only); inherits the parent prompt as a base

The general-purpose agent is a parent twin — it receives the parent's entire system prompt plus a sub-agent context bridge, so it follows the same rules the parent does. Explore and Plan use replace mode: the parent prompt is the cacheable base and their specialist read-only instructions are appended last, giving them the final say.

Default agents can be ejected (/agents → select agent → Eject) to export them as .md files for customization, overridden by creating a .md file with the same name (e.g. .pi/agents/general-purpose.md), or disabled per-project with enabled: false frontmatter.

Custom Agents

Define custom agent types by creating .md files. The filename becomes the agent type name. Any name is allowed — using a default agent's name overrides it.

Agents are discovered from two locations (higher priority wins):

Priority	Location	Scope
1 (highest)	`.pi/agents/<name>.md`	Project — per-repo agents
2	`$PI_CODING_AGENT_DIR/agents/<name>.md` (default `~/.pi/agent/agents/<name>.md`)	Global — available everywhere

Project-level agents override global ones with the same name, so you can customize a global agent for a specific project. The global location follows the upstream PI_CODING_AGENT_DIR env var — set it to relocate all pi-coding-agent state (agents, skills, settings) to a custom directory.

Example: `.pi/agents/auditor.md`

---
description: Security Code Reviewer
tools: read, grep, find, bash
model: anthropic/claude-opus-4-6
thinking: high
max_turns: 30
---

You are a security auditor.
Review code for vulnerabilities including:

- Injection flaws (SQL, command, XSS)
- Authentication and authorization issues
- Sensitive data exposure
- Insecure configurations

Report findings with file paths, line numbers, severity, and remediation advice.

Then spawn it like any built-in type:

subagent({ subagent_type: "auditor", prompt: "Review the auth module", description: "Security audit" })

Frontmatter Fields

All fields are optional — sensible defaults for everything.

Field	Default	Description
`description`	filename	Agent description shown in tool listings
`display_name`	—	Display name for UI (e.g. widget, agent list)
`tools`	all 7	Comma-separated built-in tools: read, bash, edit, write, grep, find, ls. `none` for no tools
`model`	inherit parent	Model — `provider/modelId` or fuzzy name (`"haiku"`, `"sonnet"`)
`thinking`	inherit	off, minimal, low, medium, high, xhigh
`max_turns`	unlimited	Max agentic turns before graceful shutdown. `0` or omit for unlimited
`prompt_mode`	`append`	`replace`: parent prompt is the cacheable base; body is appended last with full control (no `<sub_agent_context>` bridge, no `<agent_instructions>` wrapper). `append`: parent prompt is the base; body is wrapped in `<agent_instructions>` and a sub-agent context bridge is injected (agent acts as a "parent twin")
`inherit_context`	`false`	Fork parent conversation into agent
`run_in_background`	`false`	Run in background by default
`enabled`	`true`	Set to `false` to disable an agent (useful for hiding a default agent per-project)

Frontmatter is authoritative. If an agent file sets model, thinking, max_turns, inherit_context, or run_in_background, those values are locked for that agent. subagent tool parameters only fill fields the agent config leaves unspecified.

Tools

`subagent`

Launch a sub-agent.

Parameter	Type	Required	Description
`prompt`	string	yes	The task for the agent
`description`	string	yes	Short 3-5 word summary (shown in UI)
`subagent_type`	string	yes	Agent type (built-in or custom)
`model`	string	no	Model — `provider/modelId` or fuzzy name (`"haiku"`, `"sonnet"`)
`thinking`	string	no	Thinking level: off, minimal, low, medium, high, xhigh
`max_turns`	number	no	Max agentic turns. Omit for unlimited (default)
`run_in_background`	boolean	no	Run without blocking
`resume`	string	no	Agent ID to resume a previous session
`inherit_context`	boolean	no	Fork parent conversation into agent

`get_subagent_result`

Check status and retrieve results from a background agent.

Parameter	Type	Required	Description
`agent_id`	string	yes	Agent ID to check
`wait`	boolean	no	Wait for completion
`verbose`	boolean	no	Include full conversation log

`steer_subagent`

Send a steering message to a running agent. The message interrupts after the current tool execution.

Parameter	Type	Required	Description
`agent_id`	string	yes	Agent ID to steer
`message`	string	yes	Message to inject into agent conversation

Commands

Command	Description
`/agents`	Interactive agent management menu

The /agents command opens an interactive menu:

Running agents (2) — 1 running, 1 done     ← only shown when agents exist
Agent types (6)                             ← unified list: defaults + custom
Create new agent                            ← manual wizard or AI-generated
Settings                                    ← max concurrency, max turns, grace turns

Agent types — unified list with source indicators: • (project), ◦ (global), ✕ (disabled). Select an agent to manage it:
- Default agents (no override): Eject (export as .md), Disable
- Default agents (ejected/overridden): Edit, Disable, Reset to default, Delete
- Custom agents: Edit, Disable, Delete
- Disabled agents: Enable, Edit, Delete
Eject — writes the embedded default config as a .md file to project or personal location, so you can customize it
Disable/Enable — toggle agent availability. Disabled agents stay visible in the list (marked ✕) and can be re-enabled
Create new agent — choose project/personal location, then manual wizard (step-by-step prompts for name, tools, model, thinking, system prompt) or AI-generated (describe what the agent should do and a sub-agent writes the .md file). Any name is allowed, including default agent names (overrides them)
Settings — configure max concurrency, default max turns, and grace turns at runtime

Graceful Max Turns

Instead of hard-aborting at the turn limit, agents get a graceful shutdown:

At max_turns — steering message: "Wrap up immediately — provide your final answer now."
Up to 5 grace turns to finish cleanly
Hard abort only after the grace period

Status	Meaning	Icon
`completed`	Finished naturally	`✓` green
`steered`	Hit limit, wrapped up in time	`✓` yellow
`aborted`	Grace period exceeded	`✗` red
`stopped`	User-initiated abort	`■` dim

Concurrency

Background agents are subject to a configurable concurrency limit (default: 4). Excess agents are automatically queued and start as running agents complete. The widget shows queued agents as a collapsed count.

Foreground agents bypass the queue — they block the parent anyway.

Persistent Settings

Runtime tuning values set via /agents → Settings (max concurrency, default max turns, grace turns) persist across pi restarts. Two files, merged on load:

Global: ~/.pi/agent/subagents.json — your machine-wide defaults. Edit by hand; the /agents menu never writes here.
Project: <cwd>/.pi/subagents.json — per-project overrides. Written by /agents → Settings.

Precedence: project overrides global on any field present in both. Missing fields fall back to the hardcoded defaults (max concurrency 4, default max turns unlimited, grace turns 5).

Example — global defaults for a beefy machine:

mkdir -p ~/.pi/agent
cat > ~/.pi/agent/subagents.json <<'EOF'
{
  "maxConcurrent": 16,
  "graceTurns": 10
}
EOF

Every project now starts with concurrency 16 and grace 10, without ever touching the menu. Individual projects can still override via /agents → Settings.

Failure behavior: missing file is silent; malformed JSON logs a [pi-subagents] Ignoring malformed settings at … warning to stderr; invalid/out-of-range field values are dropped per-field; write failures downgrade the /agents toast to a warning with (session only; failed to persist).

Events

Agent lifecycle events are emitted via pi.events.emit() so other extensions can react:

Event	When	Key fields
`subagents:created`	Background agent registered	`id`, `type`, `description`, `isBackground`
`subagents:started`	Agent transitions to running (including queued→running)	`id`, `type`, `description`
`subagents:completed`	Agent finished successfully	`id`, `type`, `durationMs`, `tokens` (lifetime `{ input, output, total }`), `toolUses`, `result`
`subagents:failed`	Agent errored, stopped, or aborted	same as completed + `error`, `status`
`subagents:steered`	Steering message sent	`id`, `message`
`subagents:compacted`	Agent's session successfully compacted	`id`, `type`, `description`, `reason` (`"manual"` / `"threshold"` / `"overflow"`), `tokensBefore`, `compactionCount`
`subagents:settings_loaded`	Persisted settings applied at extension init	`settings` (merged global + project)
`subagents:settings_changed`	`/agents` → Settings mutation was applied	`settings`, `persisted` (`boolean` — `false` on write failure)

tokens.total = input + output + cacheWrite. cacheRead is excluded — each turn's cacheRead is the cumulative cached prefix re-read on that one API call, so summing per-message would over-count it. Use contextUsage.percent (surfaced as (NN%) in the widget) for current context size.

Worktree Isolation

Worktree isolation lives in a companion package, not this core. Install @gotgenes/pi-subagents-worktrees and list the agent types you want isolated in its worktreeAgents config — opted-in agents run in a temporary git worktree, and their changes are saved to a branch on completion. The earlier isolation: "worktree" spawn flag and isolation: frontmatter key were removed from the core.

Removed: agent memory and skill preloading

Persistent agent memory (the memory: frontmatter key) and skill preloading (the skills: frontmatter key) were removed when the core was slimmed down. Children now always inherit the parent's skills and extensions, so the isolated, extensions, and skills frontmatter keys no longer exist.

Migrating from `disallowed_tools`

The disallowed_tools frontmatter field has been removed. Use @gotgenes/pi-permission-system's permission: frontmatter instead — it provides richer semantics (allow/ask/deny vs. binary hide):

# Before (no longer supported)
disallowed_tools: bash

# After
permission:
  bash: deny

Permission System Integration

When @gotgenes/pi-permission-system is installed, this extension integrates automatically:

Per-agent permission policies — define permission: in agent YAML frontmatter to set allow/ask/deny rules per agent type. The permission system resolves the agent name from the <active_agent> tag in the child system prompt.
Tool filtering — the permission system's before_agent_start handler removes denied tools from the child session before the agent starts.
ask-state forwarding — when a child session triggers an ask permission, the prompt forwards to the parent session's UI. The parent approves or denies, and the child resumes.
Deterministic child detection — this extension publishes subagents:child:session-created before bindExtensions() fires; the permission system subscribes and registers the child session synchronously, so detection does not rely on env vars or filesystem heuristics.

No configuration is required. When @gotgenes/pi-permission-system is not installed, the lifecycle events have no subscriber — a harmless no-op.

For Extension Authors

This package exposes two public subpath exports for companion extensions to import from the published tarball.

`@gotgenes/pi-subagents` — cross-extension service contract

Access the subagent service from another extension at runtime:

const { getSubagentsService } = await import("@gotgenes/pi-subagents");
const svc = getSubagentsService();
svc?.spawn("Explore", "Check for stale TODOs");

Declare this package as an optional peer dependency. See src/service/service.ts for the full SubagentsService interface and the WorkspaceProvider seam.

`@gotgenes/pi-subagents/settings` — layered config loader

Extensions that store configuration in JSON files can use the shared layered loader, which reads a global file (<agentDir>/<filename>) and a project file (<cwd>/.pi/<filename>) and merges them — project wins on conflicts, missing files are silent, malformed files warn and fall back:

import { loadLayeredSettings, type LayeredSettingsSource } from "@gotgenes/pi-subagents/settings";

interface MyConfig { enabled?: boolean; limit?: number }

function sanitize(raw: unknown): Partial<MyConfig> {
  if (!raw || typeof raw !== "object") return {};
  const r = raw as Record<string, unknown>;
  const out: Partial<MyConfig> = {};
  if (typeof r.enabled === "boolean") out.enabled = r.enabled;
  if (typeof r.limit === "number") out.limit = r.limit;
  return out;
}

const config = loadLayeredSettings<MyConfig>({
  agentDir,          // Pi runtime agent home directory
  cwd,               // project root — project file lives at <cwd>/.pi/<filename>
  filename: "my-extension.json",
  sanitize,
  warnLabel: "my-extension",  // prefix for the malformed-file stderr warning
});

loadLayeredSettings returns Partial<T> (all fields optional); apply your defaults after the call. It never throws — all error conditions produce a console.warn and return {}.

Architecture

This extension is a minimal, composable core: it owns agent spawning, execution, and result retrieval, and exposes a typed SubagentsService plus lifecycle events that other extensions build on.

See docs/architecture/architecture.md for the full architecture document — design principles, domain decomposition, module dependency flow, Mermaid diagrams, and the improvement roadmap.

Relationship to upstream

This package is an independently maintained hard fork of tintinweb/pi-subagents by @tintinweb. It has diverged substantially in scope and architecture: a minimal core with a typed service API and lifecycle events, with tool-restriction policy and worktree isolation delegated to companion packages. Upstream remains the batteries-included option, keeping scheduling, cross-extension RPC, model-scope enforcement, and a built-in tool denylist in a single package.

See Comparison with upstream for a full feature-by-feature comparison against the current upstream release and guidance on which to choose.

License

MIT — tintinweb (upstream) and Chris Lasher (fork)

@gotgenes/pi-subagents

Features

Install

Quick Start

UI

Default Agent Types

Custom Agents

Example: .pi/agents/auditor.md

Frontmatter Fields

Tools

subagent

get_subagent_result

steer_subagent

Commands

Graceful Max Turns

Concurrency

Persistent Settings

Events

Worktree Isolation

Removed: agent memory and skill preloading

Migrating from disallowed_tools

Permission System Integration

For Extension Authors

@gotgenes/pi-subagents — cross-extension service contract

@gotgenes/pi-subagents/settings — layered config loader

Architecture

Relationship to upstream

License

Example: `.pi/agents/auditor.md`

`subagent`

`get_subagent_result`

`steer_subagent`

Migrating from `disallowed_tools`

`@gotgenes/pi-subagents` — cross-extension service contract

`@gotgenes/pi-subagents/settings` — layered config loader