-
This is the list of proposed tasks. It is to be extented. You can propose more tasks.
You can also find the previous lists here:
2021/2022: https://github.com/ClickHouse/ClickHouse/issues/29601
2…
-
Currently, ngrams options add 6 query parameters in the route, some of which (e.g., size, position) might also be used as field names on corpora, in which case they'd clash with associated filters. Al…
-
**Debugging checklist**
[x] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensur…
-
Hi @maartenmarx ,
De vooruitgang van deze week is als volgt:
1. Het invoegen van de vertaalde landen is momenteel werkend, maar ik ben er achter gekomen dat het totaal inefficient is om per line d…
-
Hi, thanks for your great contributions!
When I run the command:
lm_eval --tasks xxx(sst2,hellaswag,mmlu) --model hf --model_args pretrained=/local/path/to/model --device cuda:1 --batch_size 20
…
-
## 🐛 Bug
**Describe the bug**
Whenever i try to download unsupervised learning dataset: EnWik9 i get error as shown below. I tried it 3 times and it failed with the same error every time.
-----…
-
**Describe the bug**
I am getting the following error:
Error scanning zip file "/content/MetArt/2024/test.zip": failed to lookup charset IBM424_ltr, language he
I narrowed down that if the jpg …
-
Hi @maartenmarx ,
De vooruitgang van deze week is als volgt:
- Alle ngrams tot en met 5 lang zijn nu geindexeerd, echter ben ik er achter gekomen dat er twee kleine problemen waren:
- Woorden d…
-
We sometimes keep track of tokens matched to dictionary patters but it is not easy (see https://github.com/quanteda/quanteda/issues/2063). `tokens_replace()` can be used add keys to original tokens (e…
-
Hey, I had an error in my training.
```
trainer = Trainer(order=6, max_vocab_size=100000, min_count=32)
trainer.train(w, workers=2, batch_size=1000)
```
but I got an error `AttributeError: Ca…