transcribe4all
transcribe4all is a self-hosted web application for painless speech-to-text transcription of audio files.
Install
$ go get github.com/hack4impact/transcribe4all
$ cd $GOPATH/src/github.com/hack4impact/transcribe4all
To set up Sphinx for transcription read the following instructions.
Dependency management
If you add new dependencies to the app, run
$ godep save ./...
Configuration
The app looks for a file named config.toml
in the current directory. The file should look something like this:
BackblazeAccountID = ""
BackblazeApplicationKey = ""
BackblazeBucket = ""
Debug = true
EmailUsername = "user@gmail.com"
EmailPassword = ""
EmailSMTPServer = "smtp.gmail.com"
EmailPort = 587
IBMUsername = ""
IBMPassword = ""
MongoURL = ""
Port = 8080
SecretKey = ""
- Supply your Backblaze credentials to store audio files in the cloud after transcription is complete. [Or leave empty.]
- Set
Debug
to true
if you want extra verbose log messages.
- Supply email credentials so that the app can email users when transcription is complete. [Or leave empty.]
- Supply your IBM Speech-To-Text credentials in order to transcribe audio files using the IBM Watson Speech-To-Text API.
- Supply your MongoDB instance url to store transcription information (such as timestamps, confidence, and keywords).
- Set
SecretKey
to a random string. You can generate one here.
Run the app
$ go build
$ ./transcribe4all
How to use the app
- Navigate to the app's index page at http://localhost:8080 (substitute 8080 for the port you set).
- Enter the url of the audio file.
- Enter a comma-separated list of all the email addresses which should be notified when transcription is complete.
- Enter a comma-separated list of all keywords to listen for in the audio.
License
MIT License