@monotykamary/pi-retry

Extension suite for pi coding agent that handles 400/413 errors and connection errors with automatic retry

Packages

Package details

extension

Install @monotykamary/pi-retry from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:@monotykamary/pi-retry

Package: @monotykamary/pi-retry
Version: 0.3.7
Published: Jun 8, 2026
Downloads: not available
Author: monotykamary
License: MIT
Types: extension
Size: 33.5 KB
Dependencies: 0 dependencies · 0 peers

Pi manifest JSON

{
  "extensions": [
    "./retry.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

🔄 pi-retry

Automatic retry for every error in pi

400/413, connection errors, credit errors, stream exhaustion — retry them all.

Overview

This extension automatically detects and retries all errors by default, with only a tiny blacklist of known permanent failures (invalid API key, missing model, etc.).

Error Type	Retry Behavior	Use Case
Any retryable error (catch-all)	Indefinite with capped backoff	Everything else — provider hiccups, stream exhaustion, credit issues, unknown errors
HTTP 400/413	Indefinite with capped backoff, NO compaction	Transient context overflow that might resolve
Credit / payment errors	Indefinite with capped backoff	"Not Enough Credits", insufficient balance, 402
Connection errors	Indefinite with capped backoff	Network hiccups, connection drops, socket errors, stream exhaustion
Max tokens (`stopReason: "length"`)	Auto-continue indefinitely (invisible — no prompt pollution)	Model hits output token limit mid-generation

The Problem

By default, pi has built-in retry for some errors (rate limits, 5xx, overloaded), but:

400/413 errors are treated as context overflow → triggers compaction but NO retry
Connection errors sometimes get only limited retries before giving up
Credit errors ("Not Enough Credits") are never retried
Stream exhaustion ("Max outbound streams") and other provider-specific errors are never retried
Any unknown error from a new provider is silently ignored

The Solution

This extension provides automatic infinite retry with sensible exponential backoff (2s → 4s → 8s → ... → 60s max).

Philosophy: retry EVERYTHING by default. The only things we skip are a tiny blacklist of known permanent failures (invalid API key, model not found, unsupported model, etc.).

Features:

Catch-all retry — Any stopReason: "error" is retried, regardless of error message
Automatic detection of 400/413, connection, credit, and stream exhaustion errors
Auto-continuation when the model hits its max output tokens (stopReason: "length") — indefinite, no cap, invisible to the LLM
Indefinite retry — Keeps retrying until success
Exponential backoff with cap: max 60s between retries
ALL triggers are invisible — custom messages with display: false, stripped by context handler (no TUI clutter, no conversation pollution)
Manual controls via unified /retry command
Non-retryable errors are explicitly logged so you know why we didn't retry

Installation

Option 1: Install via pi package (Recommended)

Install directly from GitHub as a pi package:

pi install https://github.com/monotykamary/pi-retry

Or add to your settings.json:

{
  "packages": [
    "https://github.com/monotykamary/pi-retry"
  ]
}

Option 2: Global Installation

Copy the extension to pi's global extensions directory:

cp retry.ts ~/.pi/agent/extensions/

Option 3: Project-Local Installation

Copy to your project's .pi/extensions/ directory:

mkdir -p .pi/extensions
cp retry.ts .pi/extensions/

Option 4: Quick Test

pi -e ./retry.ts

Usage

Once loaded, the extension automatically detects and retries all errors.

Manual Controls

Command	Description
`/retry`	Manually trigger immediate retry (auto-detects: 400/413, credit, connection, max_tokens, or any other error)
`/retry status`	Show current retry diagnostics for all error types + continuation state
`/retry reset`	Reset all retry counters and state

Configuration

Edit the constants at the top of retry.ts:

const BASE_DELAY_MS = 2000;        // Start with 2 seconds
const MAX_DELAY_MS = 60000;        // Cap at 60 seconds
const BACKOFF_MULTIPLIER = 2;      // Double each time
// Continuation is now invisible — no CONTINUATION_PROMPT needed

How It Works

Listen to agent_end event — Fires after each agent turn completes
Check for any error — Examine the last assistant message for stopReason === "error"
Blacklist check — Skip known permanent failures (invalid API key, model not found, etc.)
Categorize for messaging — Classify into 400/413, credit, connection, or other for nice UI notifications
Retry or continue (both invisible) — Wait (exponential backoff for errors), then trigger a new turn via pi.sendMessage() with customType, display: false, and triggerTurn: true
Context cleanup — The context event strips all custom-type triggers before the LLM sees them (insurance against custom convertToLlm overrides)
Indefinite continuation — Max_tokens auto-continues are uncapped; each continuation produces valid output and the model naturally terminates when done

The pi's built-in transform-messages already strips aborted/errored assistant messages from the LLM context, so the model never sees the failed attempts.

Detected Error Patterns

Catch-All (Any Error)

Any assistant message with stopReason === "error" is retried by default
Unknown provider errors, stream errors, unexpected failures — all handled automatically
Only skipped if it matches a known permanent failure (invalid API key, missing model, etc.)

Non-Retryable (Permanent Failures)

These are explicitly not retried:

Invalid API key / invalid authentication
API key not found / missing / revoked
Model not found / unknown model / no such model / model does not exist
Unsupported model

Max Tokens (stopReason: "length")

The model hit its max_tokens / output token limit
The model's response was truncated mid-generation
Auto-continuation sends an invisible custom message — no visible "Continue" prompt in the conversation

400/413 Errors

HTTP 400 Bad Request
HTTP 413 Payload Too Large
"bad request" messages
"payload too large" messages

Credit / Payment Errors

"Not Enough Credits"
"insufficient credits"
"insufficient balance"
"out of credits"
"Payment Required"
HTTP 402 status code

Connection Errors

Connection / network errors
Fetch failures
Socket hang up / socket errors
ECONNRESET, ECONNREFUSED, ETIMEDOUT, ENOTFOUND
DNS lookup failures
"Request ended without sending any chunks"
Upstream connect errors
TLS handshake errors
Timeouts awaiting response
Stream exhaustion ("Max outbound streams is 100, 100 open")
Stream limit errors

Development

Running Tests

# Run all tests
npm test

# Run tests in watch mode
npm run test:watch

# Run tests with coverage
npm run test:coverage

# Type check
npm run typecheck

# Dead code detection
npm run lint:dead

Project Structure

.
├── retry.ts                   # Main unified extension
├── src/                       # Shared utilities (testable, DRY)
│   ├── error-patterns.ts      # Error pattern matching, custom types, hasMaxTokensStop
│   ├── retry-logic.ts         # Retry utilities (calculateDelay, RetryState, ContinuationState, etc.)
│   └── index.ts               # Barrel exports
├── __tests__/                 # Unit tests
│   └── unit/
│       ├── error-patterns.test.ts
│       └── retry-logic.test.ts
├── vitest.config.ts           # Test configuration
└── knip.json                  # Dead code detection config

Code Quality

# Run all quality checks
npm test              # 99 unit tests
npm run typecheck     # TypeScript type checking
npm run lint:dead     # Dead code detection with knip

Troubleshooting

Extension not working?

Check that it's loaded in the startup header:

Loaded extensions: retry.ts

Retry not triggering?

Use the status command to diagnose:

/retry status

Want to see what's happening?

The extensions send notifications on retry attempts. Look at the footer status line for retry status updates. Non-retryable errors are logged as errors so you know why we stopped.

Too many retries?

Use /retry reset to clear the counters, or press Ctrl+C to abort the session.

Comparison with @georgebashi/pi-retry

The npm package @georgebashi/pi-retry handles "aborted" streaming errors but explicitly excludes "connection error" (assuming pi's built-in retry handles it). This extension:

Handles ALL errors via a catch-all — no more playing whack-a-mole with new error patterns
Handles connection errors that pi might not retry sufficiently
Handles 400/413 errors without compaction
Handles credit errors and stream exhaustion

They can work together for maximum coverage:

pi install npm:@georgebashi/pi-retry
# Plus install this extension

Limitations

Extensions cannot override pi's internal isRetryableError() check — they run after pi decides not to auto-retry
Error messages remain in the session history (but are invisible to the LLM)
May hit the same error repeatedly if the issue is persistent (use Ctrl+C to abort)
Warning: Retrying 400/413 without reducing context may fail repeatedly if the payload is genuinely too large
Non-retryable errors (invalid API key, missing model) are logged but not retried — you'll need to fix the underlying issue

Pi Coding Agent Extensions Docs
@georgebashi/pi-retry — Handles "aborted" streaming errors
Issue #252: Connection error with no retry

License

MIT