pi-weighted-model-router

Pi extension that balances model/provider usage across weighted model pools.

Packages

Package details

extension

Install pi-weighted-model-router from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:pi-weighted-model-router

Package: pi-weighted-model-router
Version: 0.3.0
Published: May 31, 2026
Downloads: 263/mo · 263/wk
Author: eiei114
License: MIT
Types: extension
Size: 48.9 KB
Dependencies: 0 dependencies · 4 peers

Pi manifest JSON

{
  "extensions": [
    "./src/index.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-weighted-model-router

Pi extension that selects a model from weighted pools at session start, then keeps the session on that model unless a provider error or input capability requires fallback.

What It Does

Picks one model from a named pool when a pi session starts.
Uses a daily balanced weighted strategy, so 7 / 2 / 1 stays close to that ratio across sessions in the same day.
Restores the same selected model when a session is resumed.
Falls back to another pool candidate on provider failure statuses such as 400, 429, 500, 502, 503, and 504.
Switches to a compatible model before image prompts when the selected model does not support image input.
Exposes one tool, model_router_config, so the agent can help update config after confirmation.
Adds /model-router for a short status view or command-initiated weight setup.

Install

From npm:

pi install npm:pi-weighted-model-router

Project-local install:

pi install -l npm:pi-weighted-model-router

To pin a specific version:

pi install npm:pi-weighted-model-router@0.3.0

From a local checkout:

pi install /absolute/path/to/pi-weighted-model-router

For this repository only, .pi/settings.json loads the local package from ../. Start pi from this repository root and run /reload if an existing pi session is already open.

For temporary testing:

pi -e npm:pi-weighted-model-router
pi -e /absolute/path/to/pi-weighted-model-router

Config

When this repository is loaded through its project-local .pi/settings.json, config is stored at:

.pi/weighted-model-router/config.json

When installed globally, config is stored at:

~/.pi/agent/weighted-model-router/config.json

Example config. Replace provider and model IDs with entries that exist in your pi model registry:

{
  "version": 1,
  "defaultPool": "main",
  "strategy": "smooth-weighted-daily",
  "runtimeFallback": {
    "enabled": true,
    "statuses": [400, 429, 500, 502, 503, 504]
  },
  "sessionBoundary": {
    "restoreOn": ["startup", "resume"],
    "reselectOn": ["new", "reload", "fork"]
  },
  "pools": {
    "main": {
      "entries": [
        {
          "provider": "openai-codex",
          "model": "gpt-5.5",
          "weight": 7,
          "label": "Primary GPT-5.5"
        },
        {
          "provider": "cursor",
          "model": "gpt-5.5",
          "weight": 2,
          "label": "Secondary GPT-5.5"
        },
        {
          "provider": "another-provider",
          "model": "gpt-5.5",
          "weight": 1,
          "label": "Tertiary GPT-5.5"
        }
      ]
    }
  }
}

Provider and model IDs must exist in pi's model registry. If a model is registered but lacks credentials, the router skips it during selection. Some providers can also return 400 when a registered model is temporarily unavailable, disabled for the account, or unsupported by the upstream backend; by default that response is treated as a runtime fallback signal. The sample values are placeholders, not endorsements or guarantees that a provider exposes a specific model name.

sessionBoundary is optional. Defaults restore the saved model for startup and resume, but reselect on new, reload, and fork even when the previous session contains a saved router selection.

Session Boundary Behavior

The router decides whether to restore or reselect based on the session start reason. Defaults are:

Session start reason	Default action	Notes
`startup`	Restore	Attempts to reuse the saved router selection from the prior session.
`resume`	Restore	Continues the last session with the same router-selected model.
`new`	Reselect	Chooses a fresh weighted entry and records reason `new`.
`reload`	Reselect	Reloading the extension picks a new weighted entry.
`fork`	Reselect	Forked sessions can diverge from the parent selection.

Manual boundaries that trigger a reselect without starting a new session:

Trigger	Action	Notes
`/model-router next`	Reselect	Keeps the same session, excludes the previous selection, reason `next`.
Config save	Reselect	`model_router_config` save (or `/model-router` configure) uses reason `config`.
Manual `/model` or Ctrl+P	Outside router	Manual picks persist until the next router boundary (`new`, `reload`, `fork`, `next`, `config`).

Manual Model Changes

Manual model selection through pi (for example /model or the Ctrl+P model picker) is outside the router's control. The manual choice remains active until the next router boundary that reselects a model, such as new, reload, fork, /model-router next, or a confirmed config save.

Usage

Start guided setup from the command:

/model-router

Choose Configure model weights. The command sends a normal agent prompt that asks you about model candidates and desired weights one question at a time, then saves through model_router_config after confirmation.

You can also ask the agent in normal language:

Configure the model router so my primary GPT-5.5 provider has weight 7, my secondary GPT-5.5 provider has weight 2, and my tertiary GPT-5.5 provider has weight 1.

The agent should call model_router_config, show the change, and ask for confirmation before saving.

Show current status from the same command:

/model-router

Choose Show status. Status includes current pool, current model, today's success counts, and config path.

Reselect a model at the current session boundary without starting a new session or reloading:

/model-router next

You can also choose Next model from /model-router. The conversation history stays in the same session; the router appends a new weighted-model-router-selection entry with reason next and commits ledger usage only after the first successful provider response.

Privacy

The README uses placeholder provider and model IDs. Do not publish local config files, API keys, account identifiers, or provider-specific contract details.

Development

npm install
npm run check

The core selection, ledger, and config logic is testable without starting pi.