Numen is voice control for desktop computing without a keyboard or mouse. It works system-wide on your Linux machine.
A short demonstration can be found on: https://numenvoice.com
go (>=1.19) is required.
The speech recognition library and an english
language model (about 40MB) can be installed with
If this throws a 404 error, the Vosk team hasn't provided a binary for
The standard mode requires the dotool command,
which can be installed with
numen itself can be installed with
The standard mode requires permission to /dev/uinput to create the virtual input device. This permission is granted to users in group input, which your user is likely in already.
If numen does complain about permission, you could give your user permission
Once you've got a microphone, you can run it with:
You should be able to type "hey" by saying "hoof eve yank", and transcribe a sentence after saying "scribe". You can terminate it by pressing Ctrl+c or saying "troll cap".
If nothing happened, you might need to specify the right microphone with the
--mic option. See
numen --list-mics for what's available.
The default phrases are in the
/etc/numen/phrases/ directory and I'd
start by looking at
character.phrases with the alphabet and symbols, and
control.phrases with the modifiers, backspace and friends. Have a go in
your text editor.
Voice control makes an efficient keyboard but a wack mouse. At first I thought I'd need something like eye tracking, but now I just use keyboard based programs, which are thankfully the most productive kind of program. The main two are Neovim, my text editor, and qutebrowser, my web browser.
I've also made a desktop environment that works well with voice control, called tiles.
GPLv3 only, see LICENSE.
Copyright (c) 2022-2023 John Gebbie