@artale/pi-eval

Agent evaluation harness. Judge sessions on success, tool usage, efficiency, methodology. Inspired by opencc.

Package details

extension

Install @artale/pi-eval from npm and Pi will load the resources declared by the package manifest.

$ pi install npm:@artale/pi-eval
Package
@artale/pi-eval
Version
1.3.2
Published
Apr 21, 2026
Downloads
198/mo · 18/wk
Author
artale
License
MIT
Types
extension
Size
20.9 KB
Dependencies
0 dependencies · 0 peers
Pi manifest JSON
{
  "commands": [
    "eval"
  ],
  "tools": [
    "eval_judge",
    "eval_handoff"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

@artale/pi-eval

Agent evaluation harness. Judge coding sessions on methodology, efficiency, and success.

Install

npm install -g @artale/pi-eval

Tools

  • eval_judge — Score a session's tool calls, errors, completion
  • eval_handoff — Validate agent handoff confidence calibration

Commands

  • /eval — Evaluation utilities