Transcription for your desktop.
A software that records what you say when you press a button down, and types what you said when you release it.
[!IMPORTANT] ⚠️ This video contains sound and is intended to be listened to with audio on. ⚠️
https://github.com/sloganking/desk-talk/assets/16965931/e5da605b-3a9d-4394-b4ec-a3de65605a65
Make sure ffmpeg is installed and added to your PATH
Assign your OpenAI API key to the OPENAI_API_KEY
environment variable and run:
desk-talk --ptt-key scroll-lock
Or pass your OpenAI API key as a flag like so:
desk-talk --ptt-key scroll-lock --api-key [YOUR_API_KEY]
desk-talk
will now record every time you hold down the ptt-key, and type what you spoke every time you release it.
[!NOTE]
You can manage your OpenAI API keys at https://platform.openai.com/api-keys
Other Usage
To find the name of a key by pressing it, run:
desk-talk show-key-presses
If your key shows as Unknown(number)
, pass number
to the --special-ptt-key
flag like so:
desk-talk --special-ptt-key 125
To use a microphone other than the system default, run
desk-talk list-devices
to get a list of system microphone names. And pass the desired microphone name to --device
like so:
desk-talk --ptt-key scroll-lock --device "Microphone (3- USB Audio Device)"