@senad-d/micme

Local voice-to-text for pi, optimized for fast Whisper-style backends.

Packages

Package details

extension

Install @senad-d/micme from npm and Pi will load the resources declared by the package manifest.

npm repo home report

$ pi install npm:@senad-d/micme

Package: @senad-d/micme
Version: 0.2.0
Published: Jun 19, 2026
Downloads: 195/mo · 195/wk
Author: senad-d
License: MIT
Types: extension
Size: 1,011.8 KB
Dependencies: 0 dependencies · 2 peers

Pi manifest JSON

{
  "extensions": [
    "./src/extension.ts"
  ]
}

Security note

Pi packages can execute code and influence agent behavior. Review the source before installing third-party packages.

README

Micme is a pi extension for short coding prompts. It records your microphone with ffmpeg, transcribes locally with whisper.cpp or another local backend, and inserts the transcript into pi.

Local-first: no telemetry and no cloud STT service by default.
Review-first: transcripts paste into the editor unless you enable auto-submit.
Pi-native: install globally, project-locally, from git, or from a source checkout.
Configurable: use /micme conf, ~/.pi/agent/micme.json, or shell variables.

Security: pi packages run with your full system permissions. Micme can use your microphone, spawn local commands, write MICME_* settings to ~/.pi/agent/micme.json, and optionally download Whisper model files. Read SECURITY.md.

Quick Start
Installation
Backend Setup
Configuration
Commands
Models and Backends
Troubleshooting
Diagnostics
Update and Uninstall
Development
Publishing
License

Quick Start

pi install npm:@senad-d/micme

Install a local backend. On macOS:

brew install ffmpeg whisper-cpp

Start pi and configure Micme:

pi

/micme conf
/micme devices

Use it:

Press alt+m / Option+m to start recording.
Speak your prompt.
Press alt+m again to stop.
Review the pasted transcript and press Enter.

Micme is toggle-based. Press-and-hold recording is not used because terminal key-up events are not portable.

Installation

Scope	Command	Notes
Global	`pi install npm:@senad-d/micme`	Loads in every trusted pi project.
Project-local	`pi install npm:@senad-d/micme -l`	Writes to `.pi/settings.json` in the current project.
One run	`pi -e npm:@senad-d/micme`	Try without changing settings.
Git	`pi install git:github.com/senad-d/micme@<tag>`	Pin a tag or commit.
Local checkout	`pi -e .`	Develop or test this repository.

Source checkout:

git clone https://github.com/senad-d/micme.git
cd micme
npm install --ignore-scripts
npm run doctor
pi -e .

Use the checkout globally while developing:

pi install /absolute/path/to/micme

Backend Setup

Micme does not bundle recorder, transcriber, or model binaries.

OS	Install	Device setting
macOS	`brew install ffmpeg whisper-cpp`	`MICME_AUDIO_DEVICE=0` or `MICME_AVFOUNDATION_INPUT=:0`
Linux	Install `ffmpeg` and `whisper.cpp` with your package manager, Nix, Homebrew, or source build.	`MICME_PULSE_SOURCE=default`
Windows	`winget install Gyan.FFmpeg`, then install/build `whisper.cpp`.	`MICME_DSHOW_AUDIO_DEVICE=Microphone Name`

List devices inside pi:

/micme devices

macOS device listing outside pi:

ffmpeg -hide_banner -f avfoundation -list_devices true -i ""

If whisper-cli is not on PATH, set it explicitly:

MICME_WHISPER_CPP_BIN=/path/to/whisper-cli
MICME_WHISPER_CPP_MODEL=/path/to/ggml-small.en.bin

Configuration

Micme reads settings from shell environment variables and from the global Micme config file at ~/.pi/agent/micme.json. Shell variables win. /micme conf writes only MICME_* keys to that JSON file, so settings follow you across pi projects on the same machine.

The usual setup path is to run:

/micme conf

You can also edit the JSON manually. Minimal ~/.pi/agent/micme.json:

{
  "$schema": "https://raw.githubusercontent.com/senad-d/micme/main/micme.schema.json",
  "MICME_LANGUAGE": "en",
  "MICME_AUTO_DOWNLOAD_MODEL": "1",
  "MICME_DEFAULT_WHISPER_CPP_MODEL": "small.en",
  "MICME_AUTO_SUBMIT": "0",
  "MICME_KEEP_AUDIO": "0"
}

For temporary one-off overrides, set shell variables when starting pi, for example MICME_KEEP_AUDIO=1 pi.

Common settings:

Variable	Meaning
`MICME_TRANSCRIPTION_MODE=clip`	Stable default: transcribe after recording stops.
`MICME_TRANSCRIPTION_MODE=stream`	Experimental append-only live dictation with `whisper-stream`.
`MICME_STREAM_KEEP_CONTEXT=0`	Stream default: avoid Whisper prompt carry-over so live chunks are less likely to rewrite each other.
`MICME_STREAM_FLUSH_MS=650`	Stream profile quiet interval before tentative words are committed append-only.
`MICME_STREAM_FINALIZE_WITH_CLIP=0`	Keep the append-only live transcript on stop. Set `1` to opt in to final clip replacement.
`MICME_AUTO_SUBMIT=0`	Paste for review. Set `1` to send automatically.
`MICME_SHORTCUT=alt+m`	Toggle shortcut. Restart or `/reload` after changing.
`MICME_PRINTABLE_SHORTCUTS=§`	macOS Option-key fallback.
`MICME_VALIDATE_AUDIO=1`	Reject near-silent recordings.
`MICME_KEEP_AUDIO=0`	Delete successful temp audio. Set `1` for debugging.
`MICME_MODEL_DIR=~/.cache/whisper.cpp`	Model cache/discovery directory.

See micme.example.json for the full JSON template.

Streaming mode treats whisper-stream output as repeated, overlapping hypotheses rather than committed word deltas. Micme only appends committed words to the editor during live streaming; tentative words are held internally and committed after agreement, overlap, stop, or the quiet interval configured by MICME_STREAM_FLUSH_MS. If MICME_STREAM_FINALIZE_WITH_CLIP=1, the stop-time clip transcript can intentionally replace the live append-only text.

Commands

Command	Description
`/micme`	Toggle recording.
`/micme conf`	Open the TUI configuration screen.
`/micme devices`	Show compact audio/video capture device inventory.
`/micme last`	Paste the previous transcript again.
`/micme audio`	Show the last kept audio directory.
`/micme help`	Show short help.

Models and Backends

Default backend: whisper.cpp via whisper-cli.

With MICME_AUTO_DOWNLOAD_MODEL=1, Micme downloads missing standard models into MICME_MODEL_DIR. Disable downloads with:

MICME_AUTO_DOWNLOAD_MODEL=0

Recommended model progression: base.en for speed, small.en for a stronger default, medium.en for accuracy.

Advanced users can replace the recorder or transcriber:

MICME_RECORD_COMMAND=ffmpeg -hide_banner -loglevel error -f avfoundation -i :0 -ac 1 -ar 16000 -y {audio}
MICME_TRANSCRIBE_COMMAND=whisper-cli -m /path/to/model.bin -f {audio} -otxt -of {tempDirRaw}/out -nt -np && cat {tempDirRaw}/out.txt

{audio}, {tempDir}, and {transcript} are shell-quoted. *Raw placeholders bypass quoting and should only be used when you fully control the command.

Troubleshooting

Problem	Try
No backend found	Install `whisper.cpp`, put `whisper-cli` on `PATH`, or set `MICME_WHISPER_CPP_BIN`.
Wrong microphone	Run `/micme devices`, then set the OS-specific device variable.
Unrelated transcript	You probably recorded silence. Set `MICME_KEEP_AUDIO=1` and check `/micme audio`.
Slow transcription	Use `whisper.cpp`, a smaller model, and shorter recordings.
Option/Alt inserts `§` or `µ`	Set `MICME_PRINTABLE_SHORTCUTS=§` or change `MICME_SHORTCUT`, then `/reload`.
Need automatic sending	Set `MICME_AUTO_SUBMIT=1`.

Diagnostics

npx -p @senad-d/micme micme-doctor

From a source checkout:

npm run doctor

The doctor checks Node, pi, ffmpeg, whisper.cpp, optional whisper-stream, model paths, and macOS devices when available. Custom command values are redacted.

For stream state investigation, start pi with MICME_STREAM_DIAGNOSTICS=1. Diagnostics are opt-in and include sanitized frames, extraction mode, committed words, and tentative candidate words.

Update and Uninstall

pi update --extensions                # update installed pi packages
pi update npm:@senad-d/micme          # update Micme only
pi remove npm:@senad-d/micme          # remove global install
pi remove npm:@senad-d/micme -l       # remove project-local install

Development

npm ci
npm run validate

Publishing

Micme publishes to npm as @senad-d/micme. You need an npm account with publish access to the @senad-d scope.

npm login
npm whoami
npm run publish:npm

The publish script asks for the version number, validates the package, runs npm version <version> to update package.json and package-lock.json, creates the v<version> git tag, publishes with npm publish --access public, and then offers to push the release commit and tag.

Run it only from a clean working tree after updating CHANGELOG.md.

License

MIT

Table of Contents