-
I get the following error when I run both, a huggingface model and a faster whisper model on the same GPU:
```bash
self.model = ctranslate2.models.Whisper(
^^^^^^^^^^^^^^^^^^^^^…
-
### **Problem summary**
In order to create child cluster with controllers as VMs (not as pods) on vSphere with the help of CAPV, you need to create `VsphereCluster` resource:
```yaml
apiVersion…
-
Hi. I'm trying to finetune NLLB on a new unseen language according to the steps from [here](https://www.reddit.com/r/MachineLearning/comments/w4jg7q/d_hey_reddit_were_a_bunch_of_research_scientists/?a…
-
### Model description
Hi!
I recently trained a CLIP model with an NLLB text encoder to extend CLIP capabilities to 201 languages of the Flores-200 dataset. As far as the implementation goes, it is…
-
It would be great to have ggml support for Facebook's No Language Left Behind 200x200 translation model:
https://ai.facebook.com/research/no-language-left-behind/
https://huggingface.co/facebook…
-
I tried using Meta's `facebook/nllb-200-distilled-600M` model, but it seems that `hidden_states` is not being set on the `self.emb_model` output (line 65). I'm getting:
> ValueError: You have to sp…
-
As the second recipe after NLLB, write the w2v-BERT (and wav2vec2) pretraining recipe for users to check out. This will likely branch to several subtasks once we start working on it.
-
## 🐛 Bug
When translating eng_Latn to zho_Hant, there are always missing parts to be translated. It doesn't happen in Zho_hans. Evan yue_hant is better than zho_hant.
### To Reproduce
Steps …
-
I've been observing that for models that take a large amount of steps to reach the early stopping criteria (~20k+ steps), increasing the learning rate significantly (5e-5 --> 2e-4) often cuts the numb…
-
### System Info
- `transformers` version: 4.42.0.dev0
- Platform: Windows-10-10.0.20348-SP0
- Python version: 3.9.7
- Huggingface_hub version: 0.23.3
- Safetensors version: 0.4.3
- Accelerate …