-
I'm using meta's NLLB for translation, now I have a situation where I have a stream of sentences in language A that I want to translate to language B
something akin to subtitles in movies or captions…
-
原文
1
00:00:04,042 --> 00:00:08,322
地球連邦軍並びにジオン公国の戦士に次ぐ
2
00:00:08,902 --> 00:00:10,742
我々はデラーズフリート
3
00:00:11,328 --> 00:00:13,488
デラーズフリート
4
00:00:13,488 --> 00:00:15,868
いわゆる一連戦争…
-
Hi,
For my research in low resource languages native for sri lanka (sin_Sinh and tam_Taml) we are following the great work done in NLLB paper by Facebook research. Apart from the nllb mined dataset, …
-
Hi everyone!
I tried to run the ReSeTox code, and discovered that it often produces cyclical hallucinations when I use the default decoding parameters from your code.
the code to reproduce m…
-
I followed the GPT-J tuto: https://opennmt.net/CTranslate2/guides/transformers.html#gpt-j
First I converted the model with this command:
`ct2-transformers-converter --model EleutherAI/gpt-j-6B -…
-
Is there a way to use Spanish instead of English, I recently setup the code, I will be trying to do it but any help on this would be awesome.
-
### System Info
Hello @LysandreJik,
Thanks a lot for your work on no language left behind.
Is there any plan to add the 54.4B Model?
Kindest regards
### Who can help?
_No response_
### I…
-
Environment:
- tokenizers: 0.13.3
- transformers: 4.28.1
- OS: Breaks on both Linux and Windows
Reproduction:
```python
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_p…
-
Dataset https://huggingface.co/datasets/Muennighoff/flores200 has more than 40,000 configs. It's too much for our infrastructure for now. We should set a limit on it.
-
I made a wrapper around CTranslate2, an API server. It batches requests, supports running these batches on multiple gpus (sort of round-robin), requests can have different priority levels, requests ar…