Sign Segmentation: Include Sign Segmentation Model

sign / translate

Effortless Real-Time Sign Language Translation

Other

491 stars 85 forks source link

Problem

Given a pose sequence, we would like to perform two types of segmentation.

Sentence segmentation - every sentence should be then translated independently. Sign segmentation - every sign in a sentence should be transcribed to SignWriting independently.

Description

We currently have such a segmentation model https://github.com/sign-language-processing/transcription/tree/main/pose_to_segments Which works reasonably well for sentences, but not at all well for signs.

Should perhaps look into developing an autoregressive model like https://arxiv.org/pdf/2301.02214.pdf

That way, we could also perform this live.

Alternatives

Use the existing model, which is bi-directional, and will require re-running on the sequence every single time.

Additional context

No response

sign / translate