-
**Is your feature request related to a problem? Please describe.**
Tried to run custom 40B model, whose weights can be loaded with 2 80GB GPU's VRAM.
lmcache is able to load small models with in sin…
-
-
Python 3.11.10 | packaged by conda-forge | (main, Sep 22 2024, 14:10:38) [GCC 13.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from vllm import SamplingPar…
-
你好,我运行`sh tools/dist_train.sh projects/configs/co_dino_vit/co_dino_5scale_vit_large_coco.py 1`时,会如下错误,我看之前也有人报这个错,请问这个问题如何解决?谢谢
![image](https://github.com/user-attachments/assets/8d4855df-16ff-4c5a-…
-
### Expected behavior
JMeter Listeners on maser machine not showing mercies from slave
### Actual behavior
When running JMeter Distributed Load Test. The Slave machine jmeter- server logs shows …
-
Right now to implement distributed tracing with the JS SDK, it is a bit confusing. The docs jump back and forth between different concepts: https://docs.sentry.io/platforms/javascript/distributed-trac…
-
用一张3090训练会出现如下的问题,我的训练命令是python train.py -c configs/dfine/dfine_hgnetv2_l_coco.yml,请问是否有配置选项可以关闭分布式功能。或者说能使用单卡训练dfine吗?
Traceback (most recent call last):
File "/workspace/D-FINE/src/nn/back…
-
@ericneiva
The moment fitting integration is not currently working on distributed.
I have done some progress in my fork [distributed_moment_fitting](https://github.com/pmartorell/GridapEmbedde…
-
We just explored osctrl-admin and found that we can add a custom tag to each node/device. However, after added the custom tag, we cannot run a distributed query based on this tag. It would be great he…
-
### Your current environment
kuberay,vllm 0.4.0
L40 GPU server *2, each one with L40*8, CX6 IB card 200G*2
### How would you like to use vllm
I plan to use KubeRay to implement multi-node distri…