@0xkobold/pi-web

Web search and content extraction for pi agents — DuckDuckGo/SearX search, cascade fetching (fast → readability → Playwright), deep research

Package details

← Back

extension

Install @0xkobold/pi-web from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:@0xkobold/pi-web

Package: @0xkobold/pi-web
Version: 0.2.0
Published: Apr 10, 2026
Downloads: 373/mo · 92/wk
Author: moikapy
License: MIT
Types: extension
Size: 75.9 KB
Dependencies: 0 dependencies · 2 peers

Pi manifest JSON

{
  "extensions": [
    "./dist/index.js"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-web

Web search and content extraction for pi-coding-agent — cascade fetching, multi-engine search, deep research.

Installation

pi install npm:@0xkobold/pi-web

Or as part of the meta-extension:

pi install npm:@0xkobold/pi-kobold

Features

Cascade Fetching — Fast HTML → Readability → Playwright for JS sites
Multi-Engine Search — DuckDuckGo (default) + SearXNG (fallback)
Deep Research — Search + fetch + synthesize from multiple sources
Playwright Pool — Browser reuse with concurrency limits and retries
Optional Dependency — Playwright is optional; fast fetch works without it

Tools

Tool	Description
`web_fetch`	Fetch and extract content from a URL
`web_search`	Search web, optionally fetch content from results
`web_research`	Deep research: search + fetch + multi-source synthesis

Commands

Command	Description
`/deep-fetch <url>`	Fetch JS-rendered content using Playwright
`/web-search-deep <query>`	Search + fetch from top results

Parameters

web_fetch

Parameter	Type	Default	Description
`url`	string	required	Full URL to fetch
`max_length`	number	5000	Maximum characters to retrieve
`use_playwright`	boolean	false	Force Playwright for JS content
`timeout_ms`	number	15000	Timeout in ms (max: 60000)

web_search

Parameter	Type	Default	Description
`query`	string	required	Search query
`limit`	number	5	Number of results (1-10)
`fetch_content`	boolean	false	Fetch full content from top results
`fetch_sources`	number	3	How many sources to fetch

web_research

Parameter	Type	Default	Description
`question`	string	required	Research question
`sources`	number	5	Number of sources to analyze (1-10)

Architecture

┌────────────────────────────────────────┐
│              pi-web                     │
├─────────────┬──────────────────────────┤
│  Search     │  Content Extraction      │
│  ─────────  │  ──────────────────────  │
│  DuckDuckGo │  1. Fast fetch (HTML)    │
│  SearXNG    │  2. Readability (regex)  │
│             │  3. Playwright (JS)       │
├─────────────┴──────────────────────────┤
│        Playwright Browser Pool          │
│        (concurrency: 2, pool TTL: 2m)  │
└────────────────────────────────────────┘

Development

cd packages/pi-web
bun install
bun run build

Optional: Playwright

For JavaScript-rendered content, install Playwright:

npm install playwright
npx playwright install chromium

Without Playwright, web_fetch still works using fast HTML fetch and readability extraction.