-
I'm working on a [C++ implementation of Plutus](https://github.com/sierkov/daedalus-turbo/tree/main/lib/dt/plutus) aimed at optimizing batch synchronization. We'd like to benchmark our implementation …
-
Hi,
Thank you for your amazing work on this paper. I found it truly insightful. I wanted to inquire about the release of the Universal NER Benchmark Data mentioned in the paper and outlined in Appe…
-
### Dataset name
Schibsted text tasks
### Dataset link
https://huggingface.co/collections/Schibsted/schibsted-text-tasks-66655bce94d0f40432519347
### Dataset languages
- [ ] Danish
- [X] Swedish
…
-
**Is your feature request related to a problem? Please describe.**
There are multiple artificial dataset creation functions. It should be clear which ones are most useful and when.
**Describe the sol…
-
I'm working on a [C++ implementation of Plutus](https://github.com/sierkov/daedalus-turbo/tree/main/lib/dt/plutus) aimed at optimizing batch synchronization. We'd like to benchmark our implementation …
-
Create script to transform external datasets to accepted format and include benchmark datasets ReDial and INSPIRED.
-
### System Info
GPU: Nvidia H100
Model: Llama3 8B
### Who can help?
@kaiyux
### Information
- [x] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An officially suppo…
-
### Your current environment
The output of `python collect_env.py`
```text
Your output of `python collect_env.py` here
```
### Model Input Dumps
_No response_
### 🐛 Describe the bug
…
-
Add relevant datasets/benchmarks with links to papers.
-
I propose adding a Model Evaluation and Benchmarking System to ML Nexus to help users assess their model performance on standardized datasets and compare it against benchmarked scores. This feature wo…