transcribe4all

transcribe4all is a self-hosted web application for painless speech-to-text transcription of audio files.

example1

Install

$ go get github.com/hack4impact/transcribe4all
$ cd $GOPATH/src/github.com/hack4impact/transcribe4all

To set up Sphinx for transcription read the following instructions.

Dependency management

If you add new dependencies to the app, run

$ godep save ./...

Configuration

The app looks for a file named config.toml in the current directory. The file should look something like this:

BackblazeAccountID = ""
BackblazeApplicationKey = ""
BackblazeBucket = ""
Debug = true
EmailUsername = "user@gmail.com"
EmailPassword = ""
EmailSMTPServer = "smtp.gmail.com"
EmailPort = 587
IBMUsername = ""
IBMPassword = ""
MongoURL = ""
Port = 8080
SecretKey = ""

Supply your Backblaze credentials to store audio files in the cloud after transcription is complete. [Or leave empty.]
Set Debug to true if you want extra verbose log messages.
Supply email credentials so that the app can email users when transcription is complete. [Or leave empty.]
Supply your IBM Speech-To-Text credentials in order to transcribe audio files using the IBM Watson Speech-To-Text API.
Supply your MongoDB instance url to store transcription information (such as timestamps, confidence, and keywords).
Set SecretKey to a random string. You can generate one here.

Run the app

$ go build
$ ./transcribe4all

How to use the app

Navigate to the app's index page at http://localhost:8080 (substitute 8080 for the port you set).
Enter the url of the audio file.
Enter a comma-separated list of all the email addresses which should be notified when transcription is complete.
Enter a comma-separated list of all keywords to listen for in the audio.

License

MIT License

hack4impact-upenn / transcribe4all

readme