-
**Build Scans:**
- [elasticsearch-periodic #4529 / 8.1.3_bwc](https://gradle-enterprise.elastic.co/s/hazlznbxs36sk)
- [elasticsearch-pull-request #37407 / 8.1.3_bwc](https://gradle-enterprise.elastic.…
-
Hi,
Thank for such an wonderful repo. I was trying to train the model with a custom dataset using the lora script and getting the below error:
```
[2024-10-29 17:59:25,985] [INFO] [real_accelerat…
-
### System Info
- Platform: Linux-5.10.227-219.884.amzn2.x86_64-x86_64-with-glibc2.26
- Python version: 3.10.15
- PyTorch version: 2.5.1
- CUDA device(s): Tesla T4, Tesla T4, Tesla T4, Tesla T4
-…
-
I'm trying to specify resources for builtin dask functions such a `dd.read_csv`, with an end goal of running certain functions on "CPU workers" and other functions on "GPU workers". Here's a minimal e…
-
- [x] #159
- [x] Create a README under `docs/` explaining how to manage documentation
- [x] organize RST files in subfolders reflecting the order of the navigation
- [x] Add python script for tutor…
-
Opening this mostly for discussion.
Say all your data lives in ``. After doing some selecting/filtering/transforming, you want to export your data out of the DB and into a different distributed sys…
-
### Willingness to contribute
No. I cannot contribute this feature at this time.
### Proposal Summary
This feature request proposes to add support for logging FullyShardedDataParallel models …
-
Below queries rely on cuML models from for ML GPU . Depending on the performance we need to decide b/w Distributed (dask-ml) vs non distributed (sklearn) implementation for the ML portion of these q…
-
**What happened**:
When dealing with large NumPy arrays, scatter starts to fail with a strange error: `Exception: too many values to unpack (expected 1)` rather than telling me the array is t…
-
**Describe the issue**:
I am running into an issue with deploying dask using LocalCUDACluster() on an HPC. I am trying to do RandomForest, and the amount of data I am inputting exits the limit of a si…