-
Hi guys, thanks for open-sourcing this great work!
It seems LLama3 is using “right” padding and using “eos_token“ as the “padding_token”. Could you help verify that if I want to train this model, wh…
-
#### Describe the workflow you want to enable
If the target is discrete multiclass, but ordinal (ordered) in nature (e.g. Likert scale, user ratings, preference levels), as opposed to nominal, wo…
-
Thanks for releasing models with reduced size https://twitter.com/xenovacom/status/1698742891118493905 .
I was thinking of further reduction using compression algorithm like brotli. I have tested c…
-
I am trying to run tutorial notebook in my local computer. Every step is fine until i face the error when running cell for Train and Evaluation. It says IndexError: Target 15 is out of bounds. I don't…
-
Thank for your great work !
When I remove the note at the line of 264 picture below to use the API of _decode_, I get the reconstructed picture
with mosaic patches, and I do not know what someth…
-
Dear Eagle Team:
Hello, and thank you very much for your excellent work for the community. Recently, while attempting to replicate Eagle, I encountered some issues that I have been unable to resolv…
-
A short discussion among @drastogi4, Moet and @minxu74 on Oct. 14, 2020, we discussed that:
- the heat map generated by Moet contains more than 75 indices describing both mean and extreme performan…
-
## Description
Slyforce@ [has reported](https://discuss.mxnet.io/t/mx-nd-argmax-slow-on-gpu-with-high-reduction-dimensions/1231) a slow performance of argmax compared to max. I've tried it on EC2 mac…
-
What is the method that makes the visual of the feature of bins?T-Sne?
-
### What is the feature?
Current RTMO is designed for one class human keypoint detection. I try to train it for multi-class keypoint detection, but get a error
../aten/src/ATen/native/cuda/Scatter…