-
Hi, I got an error when I ran the lm-eval command:
`Traceback (most recent call last):
File "/vol3/ctr/.conda/envs/hzx1/bin/lm_eval", line 8, in
sys.exit(cli_evaluate())
File "/vol3/ctr/l…
-
Trying to run the test.py and inference.py using [pretrained_models](https://drive.google.com/drive/folders/1NoDE3plZoqF_O00xei0woH5cPy67arXq?usp=sharing) returns 0.0 avg_precision and random keypoint…
-
Hope this message finds you well. Your work has been a great inspiration, and I look forward to potentially contributing to this area of research. While the paper provides a comprehensive methodology…
-
The current implementation lacks cross-validation, which could lead to biased results. Implementing k-fold cross-validation would provide a better assessment of model performance and generalization.
-
After generating a file for java: `generations_multiple-java.json`
and building a docker image for multiple eval: `sudo make DOCKERFILE=Dockerfile-multiple all`
I get the following error when runn…
-
**My observation**
- With https://github.com/ys-zong/VLGuard/blob/main/VLGuard_eval.py, I am able to reproduce results not too far from Table 2 for **VLGuard dataset**.
- However, **I cannot reprodu…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…
-
move links from community resources to MED to better structure website by relevant content
-
-
Hi, everyone.
I’ve noticed a significant discrepancy between the evaluation results of the MM Math dataset and the results reported in the original paper.
In the original MM Math paper, GPT-4o…