A modern web application for transcribing audiofiles on your own server.
Powered by models like whisper-v3 and pyannote/speaker-diarization. Try it yourself!

Why Shoutout?

Shoutout is a state of the art web application for transcribing audiofiles including speaker diarization and time stamps. Due to its high accuracy level, ideal data privacy and easy navigation, shoutout is perfectly suited for transcriptions of interviews in qualitative research or sensitive corporate recordings.

Shoutout provides:

a simple to use web-interface.
highly accurate transcriptions leveraging the open source transcription model whisper-v3.
automatic speaker detection including timestamps.
perfect data privacy: Shoutout runs 100% local and does not share any of your data with external services.
highly efficient and fast transcriptions using GPU acceleration and a scalable architecture.
easy deployment due to a completely dockerized build.

Screenshots

Creating a new job

.assets/Shoutout_1.png

Web-interface with an overview of all jobs

.assets/Shoutout_2.png

Downloading transcripts after finishing a job

.assets/Shoutout_3.png

Results example:

SPEAKER_00 00:00:00
Sure, okay, so for documentation purposes. Are you okay with me recording the interview?

SPEAKER_01 00:00:12
Yes, I agree to the audio recording.

SPEAKER_00 00:00:17
Okay, then let's start. Could you first briefly introduce yourself, describe your background and what your doing...

SPEAKER_01 00:00:28
Well, my name is ...

Architecture

.assets/arch.png

quickstart

Its recommended to use docker and docker-compose

Docker

To setup all services just run following command in the root directory.

Before running the following command, please update the environment variable MINIO_ENDPOINT inside the docker-compose.yml to an external reachable hostname! This container is called directly from the frontend.

If you want the worker to support your gpu you have to install the nvidia-container-toolkit on the host

docker compose -f docker-compose.prod.yml up -d

It will setup 6 containers:

Build of the dashboard at localhost:8000
PostgresDB on port 5433
MinIO S3 Bucket at localhost:9001
A MinIO client initializing the s3 bucket permissions
RabbitMQ at localhost:15672
Worker-Container (gpu support)

Development

Make sure that all services (postgres, minio, rabbitmq) are running

Dashboard

To start developing the dashboard run following commands, it will start the dev.

cd dashboard

npm i

npm run dev

Database

When making any changes to the database, be aware to migrate them!

npx prisma migrate dev --name {MigrationName}

Open http://localhost:3000

Worker

First activate and install all requirements into your virtualenv.

cd worker

pip install -r requirements.txt

To develop and test the worker just run the script without a container.

Be aware to stop the worker-container if it's running!

python3 main.py

Environment variables

Dashboard

NAME	DEFAULT VALUE	DESCRIPTION
`DATABASE_URL`	`postgresql://admin:admin@localhost:5433/postgres?schema=public`	It is required for prisma to connect with the postgres database.
`RABBITMQ_URL`	`amqp://rabbit:rabbit@localhost`	URL of the rabbitmq
`QUEUE_NAME`	`jobs`	The name of the job-queue
`MINIO_ENDPOINT`	`localhost`	This is the endpoint of minio server. It will be the IP address of the server.
`MINIO_PORT`	`9000`	Minio port for communication from dashboard.
`MINIO_ACCESS_KEY`	`shoutoutdevuser`	Access key for minio dev user.
`MINIO_SECRET_KEY`	`shoutoutdevuser`	Secret key for minio dev user.
`MINIO_JOB_BUCKET`	`shoutout-job-bucket`	Bucket name to store all audio files.
`DOWNLOAD_FILE_TARGET_DIR`	`finished-files/`	Folder on S3-Bucket containing transcribed files
`FINISHED_FILE_FORMAT`	`.txt`	The download format of the finished file
`UPLOAD_FILE_TARGET_DIR`	`to-transcribe/`	Folder on S3-Bucket to upload mp3 files to
`MINIO_SSL_ENABLED`	`false`	SSL setting for S3 Bucket

Worker

NAME	DEFAULT VALUE	DESCRIPTION
`DATABASE_HOST`	`localhost`	PostgresDB host
`DATABASE_NAME`	`postgres`	Database name
`DATABASE_USER`	`admin`	PostgresDB username
`DATABASE_PASSWORD`	`admin`	PostgresDB password
`DATABASE_PORT`	`5433`	PostgresDB port
`RABBITMQ_HOST`	`localhost`	Rabbitmq host
`RABBITMQ_USER`	`rabbit`	Username for rabbitmq
`RABBITMQ_PASSWORD`	`rabbit`	Password for rabbitmq
`RABBITMQ_QUEUE`	`jobs`	The name of the job queue
`MINIO_JOB_BUCKET`	`shoutout-job-bucket`	Bucket name to store all audio files.
`MINIO_SECRET_KEY`	`shoutoutdevuser`	Secret key for minio dev user.
`MINIO_ACCESS_KEY`	`shoutoutdevuser`	Access key for minio dev user.
`MINIO_URL`	`http://localhost:9000`	URL of S3 Bucket
`TMP_FILE_DIR`	`tmp_downloads`	Local directory where all temporary files which are needed for transcription are stored
`UPLOAD_FILE_TARGET_DIR`	`finished-files/`	Folder on S3-Bucket to upload finished transcription to
`DOWNLOAD_FILE_DIR`	`to-transcribe/`	Folder on S3-Bucket containing mp3 files to transcribe
`WHISPER_MODEL`	`large-v3`	openai whisper model size
`FINISHED_FILE_FORMAT`	`.txt`	File format of the transcribed file

RWTH-TIME / shoutout

readme

Why Shoutout?

Screenshots

Creating a new job

Web-interface with an overview of all jobs

Downloading transcripts after finishing a job

Results example:

Architecture

quickstart

Docker

Development

Dashboard

Database

Worker

Environment variables

Dashboard

Worker