-
I was able to run the playground demo as late as last Friday, although at the time, there was also dependency conflicts as I run first code block to install libraries (as shown below)
```
ERROR: p…
-
I am building GPT kinda model in Equinox, and right now the forward pass is extremely slow compared to my torch implementation. I think this is one of the cases where I would like to attach a profiler…
-
The newest version of jax seems to require jaxlib v0.3.7, which breaks the trainer script:
```bash
$ ./run_pretrain.sh
2022-04-16 23:34:03.151271: W tensorflow/stream_executor/platform/default/dso…
-
The Simultaneous Perturbation Stochastic Approximation (SPSA) optimisation method is a faster optimisation method.
> If the number of terms being optimized is p, then the finite-difference method…
-
I'm having a hard time getting `optax.MultiSteps` to work. With my own model I was getting an error that made it seem like the optimizers tree structure was changing between updates, so to simplify th…
-
I noticed that there are [no other choices of optimizers other than scale_by_adafactor()](https://github.com/google-research/big_vision/blob/c62890a3e4487b1d6751794b090138b9da5d18e1/big_vision/optax.p…
-
Hi Ye,
[hamy12398](https://github.com/hamy12398), my student, and I are trying to make scVI-3D working. We are encountering the same issue on different computers.
First, installation has torch v…
-
### Description
Hi,
I've been trying to port Tensor Flow code in Google Colab (free version) to JAX but the execution time is 5 times slower:
The original TF code takes 4 seconds to run:
htt…
-
**Describe the bug**
AdamW implementation (see [here](https://github.com/NVIDIA/apex/blob/a7de60e57f0534266841e1733262601ad76aaa74/csrc/multi_tensor_adam.cu#L333)) does not truly decouple the weight…
-
Hi,
I'm playing around with clients learning rate but I cannot find a clean way of modifying it.
Basically, I need to change the LR following a schedule based on the current round.
Is that possi…