-
this is just a suggestion that this repo can add datasets and benchmarks, such as openai/gsm8k, lighteval/MATH etc... where CoT format solution is provided.
As you know, only some datasets have en…
-
Hey @docyx
Thanks a ton for the datasets and the gem! They’ve been super useful for my school project.
Quick question—any chance you could add compatibility and benchmark info for the necessary…
-
Add relevant datasets/benchmarks with links to papers.
-
Hi,
Thank you for your insightful work and this repository.
I was wondering if you plan to release the benchmarks proposed in the paper?
Thanks!
-
Hi,
Thank you for your amazing work on this paper. I found it truly insightful. I wanted to inquire about the release of the Universal NER Benchmark Data mentioned in the paper and outlined in Appe…
-
Hi GLOMAP team,
Thank you so much for the work! Since GLOMAP paper mentions benchmark results on Lamar datasets, I want to know how to integrate GLOMAP with Lamar pipeline so that I can benchmark o…
-
Hi
Really excellent work collating this benchmark (& all your excellent contributions to mutation effect prediction). My team and I have found the data really useful, and we're super excited to see…
-
Hi, I'm curious about how Nomos-v2 is collected? What's more, is it better than other datasets? Is there a benchmark on performance?
-
The above two datasets are also frequently used benchmarks in few-shot-learning
Do you have any plans to provide code support for that part?
-
### Is your feature request related to a problem or challenge?
JOB (Join Order Benchmark) was proposed by a research team from TUM in the paper ["How Good Are Query Optimizers, Really?"](https://w…