-
How can one use this project to fine-tune using a TPU-v4 instance?
I tried everything, but always get errors.
Most commonly:
UserWarning: cloud_tpu_init failed: **KeyError('v4-8')**
This a JAX…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the commu…
-
``` bash
markusheimerl@t1v-n-a16d1e4e-w-0:~/gimli$ cd ~/gemma_cktp/ && curl -o archive.tar.gz "https://storage.googleapis.com/kaggle-models-data/5305/11357/bundle/archive.tar.gz?X-Goog-Algorithm=GO…
-
### Bug description
On a TPU VM, using `WandbLogger` causes training to crash. I am using the nightly build which I know states "no guarantees", so apologies in advance if this is currently being w…
-
While trying to run the following code on tpu-vm, it didn't work.
```python
tf: 2.15
keras: 3.0.5
tpu = tf.distribute.cluster_resolver.TPUClusterResolver(tpu="local")
strategy = tf.distribute…
innat updated
4 months ago
-
For GPUs, I (and I think most folks) am used to debugging memory usage and performance usage using `nvidia-smi`. TPU's don't have a great equivalent for this. Right now, we log a bunch of other system…
-
### System Info
```Shell
accelerate
```
### Information
- [ ] The official example scripts
- [X] My own modified scripts
### Tasks
- [ ] One of the scripts in the examples/ folder …
-
Does OpenNMT-tf support training on Cloud TPUs?
-
### 🚀 The feature, motivation, and pitch
trlX uses HuggingFace accelerate under the hood. Accelerate has the capability to leverage Google's TPUs for faster training. I'm interested in supporting trl…
-
你好作者:
想问一下我在运行训练代码的时候过了一个小时还是这样:
I0315 19:24:30.427282 140093487493760 tpu_estimator.py:2308] examples/sec: 13.3812
INFO:tensorflow:global_step/sec: 0.213636
I0315 19:24:35.107750 14009348…
XD-mu updated
3 weeks ago