-
Dataloader name: `vimmrc/vimmrc.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vimmrc
| Dataset| vimmrc |
|-------------|---|
| Description | ViMMRC, a challenging mach…
-
### Describe the bug
I have a script downloading `amazon_reviews_multi`.
When the download starts, I get
```
Downloading data files: 0%| | 0/1 [00:00
AccessDeniedAccess DeniedAGJWS…
-
The full Hungarian wiki has ~4.3 GB of data, but ~2.5GB of unique string content:
> cat data/huwiki-latest-pages-meta-current.xml | sed 's/[\t ]/\n/g' | grep -v ^$ | sort | uniq | wc -m
>
> 25073845…
-
# Training, evaluating, and interpreting topic models | Julia Silge
At the beginning of this year, I wrote a blog post about how to get started with the stm and tidytext packages for topic modeling. …
-
### Describe the bug
`get_eval_refs` returns a *string* instead of a *list* when loading a dataset that has been saved with the HF `save_to_disk` API.
This means that if you try to run an eval u…
-
### Cautions:
**Before starting the task, please refer to [Add data of ML-YouTube-Courses](https://github.com/orgs/ocademy-ai/projects/3/views/1?filterQuery=label%3Adata&pane=issue&itemId=36101499)…
-
### 🐛 Describe the bug
File :OLMo/olmo/train.py
In the following training loop, we will break our pre-training for only 1 epoch ?
```
@property
def max_epochs(self) -> int:
if isinstance(se…
-
Click here for Docs
Table of Contents
- [Housekeeping](#housekeeping)
- [Named Concepts](#named-concepts)
- [Summary](#summary)
- [Reference-Level Explanation](#reference-level-explanation)
- [Alt…
-
如题,readme里面看的有点懵,with_lora设为true就是使用lora微调吗?但在代码中没有找到显式选择ptv2微调的参数,求大佬解惑
-
The following error occurred while running the script finetune_moe.sh:
The model has moe layers, but None of the param groups are marked as MoE. Create a param group with 'moe' key set to True before…