-
Dear team, thanks for releasing the code of this powerful tool.
I tested it on a few complex targets of my interest; the results are very promising. However, it failed interfering very large protei…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [+] I am using the latest TensorFlow Model Garden release and TensorFlow 2.
- [+] I am reporting…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What would your feature do ?
Google offers the Google Cloud TPU VM, an e…
-
Thank you so much again for making this resource available!
This is basically a duplicate of #646 but updated since the issues are much more intermittent.
## Expected Behavior
Consistently re…
-
Traceback (most recent call last):
File "/usr/local/lib/python3.5/dist-packages/tensorflow/python/framework/op_def_library.py", line 510, in _apply_op_helper
preferred_dtype=default_dtype)
…
-
### Description
When I run the following code block, it produces different output with and without sharding:
```
import jax
import jax.numpy as jnp
import numpy as np
mesh = jax.sharding.Mesh(np…
-
Is there a plan to support PEFT methods like LoRA training in maxtext to support larger model fine-tuning / continue pretraining so that bigger models like LLaMA-3-70B can be trainined even with small…
-
### Description
My program fails silently when using `at[index].set(value)` in the multi-host setting with `shard_map` and `jit`.
Here is a minimal example to reproduce the error:
```python
…
-
It [has been pointed out by Wojtek Czarnowski](https://groups.google.com/a/tensorflow.org/d/msg/swift/NyAg6MV5PT0/0KjPYSGdBQAJ) that in specific cases the Transformer model (or components used within …
-
I just tried running the tests using `sudo python setup.py test` and got the following error, which is seems related to the fact that the `shard_to_cpu` parameter was [removed a while ago](https://git…