-
Hi, I would like to know the run command you used on the gibson dataset and how the gibson dataset configuration file is set up.
-
Thanks for the great work!
In the paper, for the CRA metric, the authors mentioned that "We train a content classifier as a feature extractor using [49] on a subset of the HumanML3D test set with a…
-
Currently, many evaluations of long text models reference LongBench results. However, n-gram based metrics do not truly reflect the quality of responses. Many papers have adopted the method of using G…
-
Hi, thanks for your excellent work! I have run this project, but I cannot acheive the results shown on your paper. Since I cannot find out why(the results are as follows), I wonder if you provide some…
-
Could you kindly provide the rendering code for evaluation?
-
Just want to confirm my understanding is right.
In the evaluation of SOT performance of Elysium from `otb.py`:
https://github.com/Hon-Wong/Elysium/blob/5e6d14ed6939cde3cbffaba5424d5d929c38e492/eva…
-
Thanks for such great work! Can I get the MD and LPIPS by just running the run_eval.py?
Could you provide a more detailed description of training and evaluation in readme?
-
* Dart version and tooling diagnostic info (`dart info`)
```bash
If providing this information as part of reporting a bug, please review the information
below to ensure it only contains things you'…
-
icrot_hipporag.py include a recall program.
I have question in evaluation process about below source code.
below code shows a title-level recall evaluation. (means if sp is in some title == answer…
-
### User story
As a challenge manager, in order to ensure that a valid form is available for the evaluators to use, I would like a form to be validated for errors and valid logic before it is saved.
…