vision-handoff

Vision handoff extension for pi - send images to a vision-capable model for analysis

Package details

extension

Install vision-handoff from npm and Pi will load the resources declared by the package manifest.

$ pi install npm:vision-handoff

Pi manifest JSON

{
  "extensions": [
    "./extensions/vision-handoff"
  ]
}

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

A pi extension that sends images to a separate vision-capable model for analysis when the current model cannot see images.

Install as a pi package:

pi install /home/dwi/Project/vision-handoff-package

Or install from a git repository after publishing.

Create a vision.json configuration file in your project root or .pi/ directory:

{
  "provider": "anthropic",
  "model": "claude-3-5-sonnet-20241022",
  "apiKey": "your-api-key-here"
}

Or place it in ~/.pi/vision.json for global configuration.

Configuration options:

provider - Provider name (e.g., "anthropic", "openai", "google")
model - Model ID that supports vision (must have "image" in input capabilities)
baseUrl - Optional custom base URL for the provider
api - Optional API type override
apiKey - Optional API key (can also use environment variables or auth storage)

The extension registers a vision_handoff tool that the LLM can use when it needs to analyze images but cannot see them itself.

Single image:

vision_handoff({
  prompt: "What's in this image?",
  imagePath: "/path/to/image.png"
})

Multiple images:

vision_handoff({
  prompt: "Compare these images and describe the differences",
  imagePaths: ["/path/to/image1.png", "/path/to/image2.jpg"]
})

With existing base64 data:

vision_handoff({
  prompt: "Analyze this image",
  images: ["data:image/png;base64,iVBORw0KGgo..."]
})

The extension will show a notification on session start if a vision model is configured
If no config is found, the tool returns an error
The tool checks that the configured model supports image input
API keys are resolved via the same auth system as pi (environment variables, auth.json, etc.)

MIT