-
Ways to measure how well the model is performing:
Perplexity
Human Eval (optional)
This is on feat/evaluation:
https://github.com/MichiganDataScienceTeam/F24-mini-copilot/tree/feat/evaluation
…
-
When I try to evaluate the quantized AWQ models using the video evalaution script, I'm getting FileNotFoundError.
```
FileNotFoundError: No such file or directory: "/hfhub/hub/models--Efficient-La…
-
Thanks for your great work! Could you additionally evaluate the Qwen2.5 and Qwen2 models? They also support 128K context length.
-
The MOT merely states that the model is pending evaluation without giving any information as to what this means and what it will take to change this.
This actually appears to be rooted in the fact th…
-
Dear author, the link to the T2M evaluation model does not grant me access. Could you please update it or grant permission?
![image](https://github.com/user-attachments/assets/ae44f465-de1b-4a7e-beac…
-
@castower, I hope you are having a great day and thank you for all of your fantastic work in preparing the lectures and explanations for us.
I just wanted to confirm that I am understanding lab 05 …
-
Hello
First of all, thank you for publishing this code. I'm having difficulty in evaluating the trained model. Adopting eval.py form SCAN seems not straight forward and I'm not sure whether I've do…
-
Hello,
do you have teh code for model evaluation
Regards,
-
![image](https://github.com/user-attachments/assets/77dd98ab-8a0c-4c40-933e-65f9f6e128f6)
![image](https://github.com/user-attachments/assets/3f61c5dc-2ed9-48dd-895d-a4790c4a4e2c)
During the train…
-
As discussed in our previous meeting, in our ML model interface we should distinguish between parameters used for training and evaluation.
- Allows for different configurations being used in the re…