-
When training the MLLM, we want to unfreeze the Oryx ViT. We're using DeepSpeed ZeRO-3, and there are issues with gradient backpropagation.
![20241101-150723](https://github.com/user-attachments/…
-
I'm not satisfied by any of the explanations I have seen of backpropagation, and I want to see if I can do better.
-
Hi,
I have implemented the forward pass using a convolution + sigmoid_fwd activation and am now working on the backpropagation of the graph. However, according to the [document](https://docs.nvidia.…
-
Hi.
[ Disclaimer: I am pretty new - to github, ANNs and NEAT as well. ]
I was wondering if executing backpropagation with a certain scheme might help training.
Is it currently possible to do so? …
-
- [x] Optimise-then-discretise
- [ ] Reversible solvers
- [ ] Interpolating adjoints
- [x] Checkpointing?
- [ ] Quadrature?
-
### What is happening and why is it wrong?
The [`num_unique_paulis` field](https://qiskit.github.io/qiskit-addon-obp/stubs/qiskit_addon_obp.utils.metadata.SliceMetadata.html#qiskit_addon_obp.utils.me…
-
## Description and motivation
I'm trying to make inference with different timesteps on a neural network that's trained with Feedback Alignment from biotorch, however it is showing the same accuracy…
-
I want to use `mkpts_0, mkpts_1 = xfeat.match_xfeat(im1, im2, top_k = 4096)` to **optimize** different targets based on the distance between matches.
However, even if I remove the inference stuff, th…
-
-
Hello, Alexey!
Do you have plans to implement such feature, so it will be possible to visualise what the network has learnt, or which parts of an input image are most characteristic of certain class?…