-
Add relevant datasets/benchmarks with links to papers.
-
Thanks for the great work, I was wondering if there is a timeline for dataset and benchmark release.
-
- [x] A script to load a model and then run the model on evaluation benchmark datasets
- [x] Hellaswag
- [x] MMLU
- [ ] Winograde5
- [ ] ARC
- [ ] GSM8K
- [ ] In the YAML config file, specify …
-
Great Paper!
Is the GWS15k benchmark dataset available somewhere?
-
### Dataset name
NorBench
### Dataset link
https://github.com/ltgoslo/norbench
### Dataset languages
- [ ] Danish
- [ ] Swedish
- [X] Norwegian (Bokmål or Nynorsk)
- [ ] Icelandic
- [ ] Faroese
-…
-
### Issue
I need help getting the example code on the README.md to work. I am now concentrating on the Benchmark datasets (https://github.com/microsoft/torchgeo?tab=readme-ov-file#benchmark-datasets)…
-
-
hello i would like to know if the weights of this implementation have been released, or if it has been implemented in any system. the paper is interesting.
are there any benchmarks with other exist…
-
Hi.
I am unable to reproduce the benchmark results in the paper for test split in `distil-whisper/tedlium` using model `distil-whisper/distil-large-v2` when using `run_eval.py`. However, I am able…
-
### Dataset name
Einbuergerungstest
### Dataset link
https://www.lsa.umich.edu/german/hmr/231/LRC/Einbuergerungstest.html
### Dataset languages
- [ ] Danish
- [ ] Swedish
- [ ] Norwegian (Bokmål …