pi-weighted-model-router

Pi extension that balances model/provider usage across weighted model pools.

Packages

Package details

extension

Install pi-weighted-model-router from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:pi-weighted-model-router

Package: pi-weighted-model-router
Version: 0.4.0
Published: Jun 15, 2026
Downloads: 775/mo · 187/wk
Author: eiei114
License: MIT
Types: extension
Size: 54.5 KB
Dependencies: 0 dependencies · 4 peers

Pi manifest JSON

{
  "extensions": [
    "./src/index.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-weighted-model-router

Pi extension that selects a model from weighted pools at session start, then keeps the session on that model unless a provider error or input capability requires fallback.

What It Does

Picks one model from a named pool when a pi session starts.
Uses a daily balanced weighted strategy, so 7 / 2 / 1 stays close to that ratio across sessions in the same day.
Restores the same selected model when a session is resumed.
Falls back to another pool candidate on provider failure statuses such as 400, 429, 500, 502, 503, and 504.
Switches to a compatible model before image prompts when the selected model does not support image input.
Exposes one tool, model_router_config, so the agent can help update config after confirmation.
Adds colon flat commands such as /model-router:status, /model-router:next, and /model-router:configure for status, reselection, and guided weight setup.

Install

From npm:

pi install npm:pi-weighted-model-router

Project-local install:

pi install -l npm:pi-weighted-model-router

To pin a specific version:

pi install npm:pi-weighted-model-router@0.3.0

From a local checkout:

pi install /absolute/path/to/pi-weighted-model-router

For this repository only, .pi/settings.json loads the local package from ../. Start pi from this repository root and run /reload if an existing pi session is already open.

For temporary testing:

pi -e npm:pi-weighted-model-router
pi -e /absolute/path/to/pi-weighted-model-router

Config

When this repository is loaded through its project-local .pi/settings.json, config is stored at:

.pi/weighted-model-router/config.json

When installed globally, config is stored at:

~/.pi/agent/weighted-model-router/config.json

Example config. Replace provider and model IDs with entries that exist in your pi model registry:

{
  "version": 1,
  "defaultPool": "main",
  "strategy": "smooth-weighted-daily",
  "runtimeFallback": {
    "enabled": true,
    "statuses": [400, 429, 500, 502, 503, 504]
  },
  "sessionBoundary": {
    "restoreOn": ["startup", "resume"],
    "reselectOn": ["new", "reload", "fork"]
  },
  "pools": {
    "main": {
      "entries": [
        {
          "provider": "openai-codex",
          "model": "gpt-5.5",
          "weight": 7,
          "label": "Primary GPT-5.5"
        },
        {
          "provider": "cursor",
          "model": "gpt-5.5",
          "weight": 2,
          "label": "Secondary GPT-5.5"
        },
        {
          "provider": "another-provider",
          "model": "gpt-5.5",
          "weight": 1,
          "label": "Tertiary GPT-5.5"
        }
      ]
    }
  }
}

Provider and model IDs must exist in pi's model registry. If a model is registered but lacks credentials, the router skips it during selection. Some providers can also return 400 when a registered model is temporarily unavailable, disabled for the account, or unsupported by the upstream backend; by default that response is treated as a runtime fallback signal. The sample values are placeholders, not endorsements or guarantees that a provider exposes a specific model name.

sessionBoundary is optional. Defaults restore the saved model for startup and resume, but reselect on new, reload, and fork even when the previous session contains a saved router selection.

Session Boundary Behavior

The router decides whether to restore or reselect based on the session start reason. Defaults are:

Session start reason	Default action	Notes
`startup`	Restore	Attempts to reuse the saved router selection from the prior session.
`resume`	Restore	Continues the last session with the same router-selected model.
`new`	Reselect	Chooses a fresh weighted entry and records reason `new`.
`reload`	Reselect	Reloading the extension picks a new weighted entry.
`fork`	Reselect	Forked sessions can diverge from the parent selection.

Manual boundaries that trigger a reselect without starting a new session:

Trigger	Action	Notes
`/model-router:next`	Reselect	Keeps the same session, excludes the previous selection, reason `next`.
Config save	Reselect	`model_router_config` save (or `/model-router:configure`) uses reason `config`.
Manual `/model` or Ctrl+P	Outside router	Manual picks persist until the next router boundary (`new`, `reload`, `fork`, `next`, `config`).

Manual Model Changes

Manual model selection through pi (for example /model or the Ctrl+P model picker) is outside the router's control. The manual choice remains active until the next router boundary that reselects a model, such as new, reload, fork, /model-router:next, or a confirmed config save.

Usage

Start guided setup from the command:

/model-router:configure

The command sends a normal agent prompt that asks you about model candidates and desired weights one question at a time, then saves through model_router_config after confirmation.

You can also ask the agent in normal language:

Configure the model router so my primary GPT-5.5 provider has weight 7, my secondary GPT-5.5 provider has weight 2, and my tertiary GPT-5.5 provider has weight 1.

The agent should call model_router_config, show the change, and ask for confirmation before saving.

Show current status:

/model-router:status

Status includes current pool, current model, today's success counts, and config path.

Reselect a model at the current session boundary without starting a new session or reloading:

/model-router:next

The conversation history stays in the same session; the router appends a new weighted-model-router-selection entry with reason next and commits ledger usage only after the first successful provider response.

Legacy /model-router with a selection menu and /model-router next remain available for one release; prefer the colon commands above.

Privacy

The README uses placeholder provider and model IDs. Do not publish local config files, API keys, account identifiers, or provider-specific contract details.

Concurrency

Selection, ledger commit, and session-boundary handlers share in-memory state inside one extension instance. Overlapping async handlers can interleave unless serialized. See docs/RACE_CONDITIONS.md for risks, mitigations, and follow-up ideas.

Security

Pi packages can execute code with your local permissions. Review extensions before installing third-party packages.

For vulnerability reporting, see SECURITY.md.

Development

npm install
npm run check

The core selection, ledger, and config logic is testable without starting pi.