@artale/pi-eval
Agent evaluation harness. Judge sessions on success, tool usage, efficiency, methodology. Inspired by opencc.
Package details
Install @artale/pi-eval from npm and Pi will load the resources declared by the package manifest.
$ pi install npm:@artale/pi-eval- Package
@artale/pi-eval- Version
1.3.2- Published
- Apr 21, 2026
- Downloads
- 198/mo · 18/wk
- Author
- artale
- License
- MIT
- Types
- extension
- Size
- 20.9 KB
- Dependencies
- 0 dependencies · 0 peers
Pi manifest JSON
{
"commands": [
"eval"
],
"tools": [
"eval_judge",
"eval_handoff"
]
}Security note
Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.
README
@artale/pi-eval
Agent evaluation harness. Judge coding sessions on methodology, efficiency, and success.
Install
npm install -g @artale/pi-eval
Tools
- eval_judge — Score a session's tool calls, errors, completion
- eval_handoff — Validate agent handoff confidence calibration
Commands
/eval— Evaluation utilities