Voxtype
Voxtype is a push-to-talk voice dictation tool. Hold a key, speak, release — the transcription is typed into whatever is focused. Built on OpenAI Whisper, compiled with Vulkan GPU acceleration for real-time inference on FreeBSD.
Install
Section titled “Install”voxtype-installBuilds from source using cargo with the gpu-vulkan feature flag. Requires wtype and pipewire. Downloads the base.en Whisper model automatically on first install.
| Action | Command |
|---|---|
| Start recording | voxtype-start |
| Stop and transcribe | voxtype-stop |
| Check status | voxtype-status |
The keybinding Super + Z is configured as hold-to-record: press to start, release to transcribe and type the result.
Keybinding
Section titled “Keybinding”| Shortcut | Action |
|---|---|
| Super + Z (hold) | Record audio — release to transcribe and type |
GPU Acceleration
Section titled “GPU Acceleration”If vulkaninfo is available at install time, Voxtype is configured to use Vulkan for Whisper inference. This keeps transcription fast without requiring CUDA or a Linux environment.
voxtype setup gpu --enable # enable Vulkan manuallyvoxtype status # confirm GPU is activeConfig
Section titled “Config”Default config is installed to ~/.config/voxtype/config.toml. Edit to change model, language, or output method.