G5-EIT-VR-Landsby / STT

MIT License
0 stars 0 forks source link

EiT - Speech to Text Module

This module uses WhisperLive (https://github.com/collabora/WhisperLive) package to achieve a nearly real-time transcription module including VAD (Voice Activity Detection). Modifications are made on the WhisperLive source code to fit this project.

Specifications

Requirements

Running server

With Docker

GPU (recommended):

  docker build . -t whisper-live -f docker/Dockerfile.gpu
  docker run -it --gpus all -p 9090:9090 whisper-live:latest

CPU:

docker build . -t whisper-live -f docker/Dockerfile.cpu
docker run -it -p 9090:9090 whisper-live:latest

Locally

python run_server.py

Running client

python run_client.py