-
Hi,
I am interested in reproducing the results from the CIRR dataset. Could you please share the evaluation code for CIRR?
I saw the CIRR evaluation results from your paper.
Thank you.
-
The functionality defined in `src\utiliy` should be moved to a separate python package.
Perferably, we should include the evaluation functionality in CADET-Process.
This could encompass a tool tha…
-
Hi,
I am interested in reproducing the results from the DTIN dataset. Could you please share the evaluation code for DTIN?
Thank you.
-
Hi,
I would like to express my sincere gratitude for your excellent work. I'm trying to reproduce the subtype evaluation part and I'm wondering if there are any additional processing steps or scripts…
-
Thank you very much for the code! Can you provide the evaluation code of matlab version again?
-
Can you provide the evaluation code? When I tested it on the MMBenchmark with a 1B model, the performance was quite low, only around 19.
-
Hi. Thank you for the nice dataset. I am trying to find out how your evaluation code works in detail.
To evaluate on the test set, we have N=17536 samples/frames and C=7 conditions. So my assumptio…
-
Hi there
Thanks for sharing your work!
I wonder if you can share the pre-trained model and the evaluation codes for downstream tasks to ensure reproducibility.
Thank you.
-
Hi, may I know whether the evaluation code related to the metrics mentioned in the paper has been released? i.e.: FID
-
Code evaluation task/benchmark such as HumanEval and MBPP are missing from **lm-evaluation-harness**, but are present and maintained in **bigcode-evaluation-harness**.
https://github.com/bigcode-pr…