Sign Language Translator

This project provides a sign language translation service using MediaPipe for keypoint extraction and a fine-tuned Whisper model for translation. The service is deployed on Google Cloud Run and managed with Terraform.

Documentation

Project Structure

sign-language-translator/
├── app/                  # Application code
├── terraform/           # Infrastructure as Code
│   ├── environments/    # Environment-specific configs
│   └── modules/         # Reusable Terraform modules
├── build/              # Build configurations
└── setup/              # Setup scripts

Prerequisites

Google Cloud SDK
Terraform
Python 3.9+
Docker
(Optional) Hugging Face account and access token for the real model
Build essentials (for Linux users):

sudo apt-get update
sudo apt-get install python3-dev build-essential

Getting Started

Initialize the GCP project:

./setup/init-gcp-project.sh

Set up infrastructure:

cd terraform/environments/dev
terraform init
terraform plan
terraform apply

Development Options

Local Development

Install Requirements

# Option A: Recommended - Using Virtual Environment
# This keeps your project dependencies isolated and prevents conflicts
python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate
pip install -r app/requirements.txt

# Option B: Direct Installation
# Not recommended but faster for quick testing
pip install -r app/requirements.txt

Mock Model (Default)

# Option 1: Default behavior
python app/main.py

# Option 2: Explicit mock configuration
export USE_MOCK_MODEL=true
python app/main.py

Real Model

export USE_MOCK_MODEL=false
export HUGGINGFACE_TOKEN=your_token_here
export MODEL_PATH=your-org/sign-language-translator
python app/main.py

Docker Development

Mock Model

docker build -t sign-language-translator ./app
docker run -p 8080:8080 sign-language-translator

Real Model

docker build -t sign-language-translator ./app
docker run -p 8080:8080 \
  -e USE_MOCK_MODEL=false \
  -e HUGGINGFACE_TOKEN=your_token_here \
  -e MODEL_PATH=your-org/sign-language-translator \
  sign-language-translator

Deployment

Mock Model Deployment

module "cloudrun" {
  source      = "../../modules/cloudrun"
  project_id  = var.project_id
  region      = var.region
  environment = "dev"

  env_variables = {
    USE_MOCK_MODEL = "true"
  }
}

Production Model Deployment

module "cloudrun" {
  source      = "../../modules/cloudrun"
  project_id  = var.project_id
  region      = var.region
  environment = "dev"

  env_variables = {
    USE_MOCK_MODEL = "false"
    MODEL_PATH     = "your-org/sign-language-translator"
  }

  secrets = {
    HUGGINGFACE_TOKEN = {
      secret_id = "huggingface-token-dev"
      version   = "latest"
    }
  }
}

Testing the API

Health Check

curl http://localhost:8000/health

Process Video

# Using your own video
curl -X POST -F "video=@path/to/your/video.mp4" http://localhost:8000/process-sign-language

# Using the example ASL video
curl -X POST -F "video=@app/data/samples/asl_example.mp4" http://localhost:8000/process-sign-language

You can find an example ASL video in the repository at app/data/samples/asl_example.mp4. This example video is sourced from Pexels, a free stock video platform.

Environment Variables

Variable	Description	Default
USE_MOCK_MODEL	Use mock model instead of real model	true
HUGGINGFACE_TOKEN	Token for accessing HF model	None
MODEL_PATH	Path to HF model	your-org/sign-language-translator
PORT	Port for the application	8000

Infrastructure Components

Cloud Run service for the application
Artifact Registry for container images
Secret Manager for Hugging Face token
IAM configurations for service accounts
Monitoring and alerting setup

opencampus-sh / sign-language-translator

readme