glm-vision
Pi extension that gives non-vision GLM models (z.ai) image understanding via GLM-4.6V
$ pi install npm:glm-visionExtensions, skills, prompt templates, and themes published to npm. Install with pi install npm:<package>. See the package docs for details.
Pi extension that gives non-vision GLM models (z.ai) image understanding via GLM-4.6V
$ pi install npm:glm-visionAccurate spatial reasoning over images — vision model extracts bounding boxes, LLM calculates exact distances
$ pi install npm:pi-accurate-visionOpenRouter multimodal tools for Pi — search, fetch, image gen, vision, video, PDF, TTS, STT
$ pi install npm:@dtmirizzi/pi-openrouter-multimodalAutomatic image, video and audio description for any model in Pi. Routes media to a multimodal model and injects descriptions into context.
$ pi install npm:pi-multimodal-proxyPi extension that registers an inspect_image tool — analyzes local image files using a configurable vision-capable model via OpenAI-compatible API
$ pi install npm:pi-inspect-imageImage analysis tool using multimodal vision models with memory sessions.
$ pi install npm:@zhushanwen/pi-visionSimple image understanding for Pi Coding Agent. Install it, set your API key, and ask normal questions about images.
$ pi install npm:@lokiyou/modelscope-visionPi Agent extension that adds a describe_image tool, letting non-multimodal models delegate image analysis to a vision-capable model (like Qwen VL)
$ pi install npm:pi-vision-tool