-
While sparse arrays are [supported in dask](http://dask.pydata.org/en/latest/array-sparse.html), this issue aims to open the discussion on how this could be applied in the the context of dask-ml.
…
-
### System Info / 系統信息
![Uploading 5.PNG…]()
### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [X] docker / docker
- [ ] pip install / 通过 pip install 安装
- [ ] installation from s…
-
## Description
Whenever I run this code, the dask job crashes and all the workers get lost and then the task just hangs forever. While if I provide small size files then the same code works fine. (
-
官方最新docker镜像部署 xinference 0.10.3
当replica=2,GPU id 设置为2、3 时报错,详细情况如下:
024-05-31 03:04:47,135 xinference.core.worker 94 INFO You specify to launch the model: custom-chatglm3-6b-128k on GPU…
-
2024-08-31 12:49:22,685] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2024-08-31 12:49:22,685] [INFO] [real_accelerator.py:203:get_accelerator] Setti…
-
### Describe the bug
Hello, I implemented my own custom pipeline referring StableDiffusionPipeline (RepDiffusionPipeline), but there are some issues
I called "accelerator.prepare" properly, and ma…
-
### 🔖 Feature description
We've seen marketing from Unsloth that optimized triton kernels for various operations can significantly improve both the speed and memory efficiency of fine-tuning LoRA a…
-
We've recently added support for column-wise data split (feature parallelism) and vertical federated learning (#8424), but the user interface in python is limited to text inputs and numpy arrays (#936…
-
### 🚀 The feature, motivation and pitch
The current implementation invokes h2d/d2h on the same stream as compute, essentially blocking the GPU from doing other computation. A straightforward approa…
-
## Description
In the model wide&deep,**a categorical feature 'sex' has three values '0,1,2'.** Then feed it into SparseEmbedding layer, just like the code below.
```
embed_weight = mx.symbol.Varia…