@artale/pi-voice

Voice input for Pi. Multi-provider STT with Deepgram streaming, Groq Whisper, OpenAI Whisper. 56+ languages.

Package details

← Back

extension

Install @artale/pi-voice from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:@artale/pi-voice

Package: @artale/pi-voice
Version: 2.0.0
Published: Mar 14, 2026
Downloads: 104/mo · 18/wk
Author: artale
License: MIT
Types: extension
Size: 24.1 KB
Dependencies: 0 dependencies · 0 peers

Pi manifest JSON

{
  "commands": [
    "voice"
  ],
  "tools": [
    "voice_capture"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

pi-voice

Voice input for Pi. Multi-provider STT with Deepgram streaming, Groq Whisper, and OpenAI Whisper. 56+ languages.

Install

pi install npm:@artale/pi-voice

Setup

Set at least one API key:

# Best quality — Deepgram Nova 3 with streaming ($200 free credit)
export DEEPGRAM_API_KEY="your-key"

# Fast and free — Groq Whisper
export GROQ_API_KEY="your-key"

# Reliable — OpenAI Whisper
export OPENAI_API_KEY="your-key"

Install an audio capture tool:

brew install sox        # macOS/Linux (recommended)
brew install ffmpeg     # alternative
# arecord is pre-installed on most Linux distros

Verify: /voice test

Commands

/voice [seconds]        — record and transcribe (default 10s)
/voice dictate          — continuous dictation (30s)
/voice test             — check audio + provider setup
/voice config [prov]    — set provider (auto/deepgram/groq/openai)
/voice lang [code]      — set language (en, es, fr, de, ja, zh, ko, ...)
/voice history          — recent transcriptions
/voice stop             — stop active recording

Provider fallback chain

Deepgram Nova 3 — streaming via WebSocket, best quality, 56+ languages
Groq Whisper Large v3 Turbo — fast, free tier
OpenAI Whisper-1 — reliable, paid

Auto-detection: uses the first available provider. Override with /voice config deepgram.

56+ Languages

/voice lang es          — Spanish
/voice lang ja          — Japanese
/voice lang zh          — Chinese (auto-switches to Nova 2)
/voice lang de          — German
/voice lang fr          — French

Full list: /voice lang

What's new in v2.0

Deepgram streaming via WebSocket + REST fallback
Multi-provider fallback chain (Deepgram → Groq → OpenAI)
56+ languages with auto model selection
Continuous dictation mode (/voice dictate)
Provider diagnostics (/voice test)
Better Windows support (ffmpeg dshow)
History with provider and language tracking

License

MIT