-
### Description & Motivation
I have same issue like https://github.com/Lightning-AI/pytorch-lightning/issues/17856 when training dcgan with fabric + deepspeed.
The official example works fine with…
-
## ❓ Questions and Help
#### What is your question?
**If we fine-tune RoBERTa on RTE (or another GLUE dataset) and then perform inference at test time on the dev set, is it guaranteed that t…
-
I have realized after working on #1087 that there are other functions in some files with failing doctests / code examples in the functions documentation. In addition, the idea of running doctests in C…
-
Hi, I am using some waveforms not included in LALsuite (as well as LALsuite-extra) to analyze several GW events. However, I encountered some troubles when trying to use them in bilby as I am following…
-
Hi - I've been doing a lot of work lately with interpolation in latent space, and I think linear interpolation might not be the best interpolation operator for high dimensional spaces. Though admitted…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…
-
Hi
Thanks for publishing the code in Pytorch ! I have a few questions however.
[1] for the loss associated with the auxilliary classifier fc you are using `NLL Loss` but the last layer is `Softma…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
### model
# model_name_or_path: /mnt/nas/shanzhi/eval_models/Qwen2-7B
model_name_or_path: /mnt/nas/liya…
-
**Describe the bug**
This test failed during a random test. Using `removenan` branch, possibly doesn't exist on main
```
__________________________________________________________________________…
-
Hi!
Thank you for your work on this project!
I'm training the s-model on a custom dataset, and I’ve encountered an issue after several successful epochs. Up until the 12th epoch, training seems to…