https://github.com/stts-se/chromedictator
Requires Go >= 1.12.
If you do not already have Go installed, download and install the most recent stable version from https://golang.org/dl/, then:
git clone https://github.com/stts-se/chromedictator
cd chromedictator
go get
To start the server:
go run chromedictator.go
or
go build
./chromedictator
Go to the following URL in Google Chrome:
http://localhost:7654
The server will create a audio_files
sub-directory in the corrent directory if it does not already exist.
The server will create a abbrevs.gob
file, containing mappings from abbreviations to expanded forms, if it does not already exist.
Download the latest zip file from releases, unzip, and run the binary for your OS.
The make
command will generate a zip file containing everything needed to run the server, including default executables for the following operating systems:
Here's a neat trick to record from your audio output of your computer, using PulseAudio (for Linux): https://unix.stackexchange.com/questions/130774/creating-a-virtual-microphone/153528#153528
Audio (media) file used by Google Chrome. Can be converted into .wav or other formats using e.g. ffmpeg
.
Metadata file accompanying the .webm file with the following fields:
Sample JSON can be found in audio_files/default/audiotst.json:
{
"session_id": "default",
"start_time": "2018-11-16T15:38:00.606Z",
"end_time": "2018-11-16T15:38:03.305Z",
"time_code_start": 12593,
"time_code_end": 15292
}
Text file containing the original recognition result.
Text file containing manually edited recognition result. May be identical to the contents of the .rec file.