-
I am trying to save all coherence and phase data into a GeoTIFF using sbas.export_geotiff.
decimator = sbas.decimator(resolution=15)
corr_sbas = decimator(ds_sbas.correlation)
sbas.export_geotiff…
-
I have tried #8223 on a ~3.4TB gzipped Parquet dataset.
I tried four runs so far, with two different behaviours
- First I tried the whole dataset. I got to the last step (`to_parquet`), but then r…
-
### System Info
```shell
+-----------------------------------------------------------------------------+
| HL-SMI Version: hl-1.17.0-fw-51.1.0 |
| Driver Ver…
-
**What would you like to be added**:
Generally,
- if user use object stores, they can use fluid as distributed caching system
- if user use oci images, they can use dragonfly for p2p acceler…
-
### System Info
- `transformers` version: 4.45.0.dev0
- Platform: Linux-5.15.0-105-generic-x86_64-with-glibc2.35
- Python version: 3.10.14
- Huggingface_hub version: 0.24.5
- Safetensors version:…
-
我有两张gpu,一张P40一张1070,架构都是帕斯卡,正常推理使用没问题,但是训练的时候出现以下情况:
Traceback (most recent call last):
File "/mnt/e/rwkv/./finetune/lora/v6/train.py", line 540, in
trainer.fit(model, data_loader)
File "/usr/loc…
-
**Describe the issue**:
FastAPI is known to can't be pickled by default pickle and dill.
Click [here](https://github.com/encode/starlette/discussions/2308) for the details.
However, Ray pickle…
-
### 🐛 Describe the bug
Torch compile stochastically fails during multinode training with FileNotFoundError in torch._dynamo. Full stack trace below. This is a difficult bug to provide a minimal rep…
-
My tests indicate that, for all but very small files of only a few MBs, dbz2 works in parallel on a single node but appears to persistently fail when distributed over multiple nodes in a HPC cluster. …
-
### 🐛 Describe the bug
I’m encountering a runtime error while using PyTorch with the RPC backend on my system. The error message is confusing, and I can't find it documented anywhere on the internet.…